8 Drainage

The numerical summaries in Chapter 6 raise some interesting questions. One is “How is drainageArea defined? Is it the watershed drained by the current dam extending upriver to the next dam? Or is a dam’s drainageArea inclusive of the whole up-river watershed? The exploration below confirms the latter.

8.1 Dams with the largest drainage areas

First: data quality. There seems to be some errors in the data. In the top 20 dams in terms of drainageArea (mi^2), some seem too small (nidStorage, nidHeight and surfaceArea) to drain such a large area (Table 8.1).

In particular:

FL75010 see also Google Maps
OK22289 on the Oklahoma River see also Google Maps which is “claiming” the entire drainage area of the Oklahoma River, which like Soo locks in Michigan, distorts.
MI00650 Soo Locks on Lake Superior as discussed in Chapter 6 Numbers.

Show the code

dta_for_plot <- dams |>
  st_drop_geometry() |>
  select(name, otherNames, state, big_dam, nidId, nidStorage, nidHeight, damLength, surfaceArea, maxDischarge, drainageArea)
  

dta_for_plot |>
  slice_max(order_by = drainageArea,
            n = 20) |>
  # select(-damLength) |>
  arrange(desc(drainageArea)) |>
  gt() |>
  tab_options(table.font.size = 11) |>
  tab_header(md("**Top 20 dams in NID in terms of drainageArea**")) |>
  sub_missing() |>
  fmt_number(columns = c(nidStorage, nidHeight, surfaceArea, maxDischarge, drainageArea),
             decimals = 0)

Table 8.1

Top 20 dams in NID in terms of drainageArea
name	otherNames	state	big_dam	nidId	nidStorage	nidHeight	damLength	surfaceArea	maxDischarge	drainageArea
Eastern Avenue	Maps	Oklahoma	FALSE	OK22289	1,680	13	300	142	48,000	8,300,000
Old River	Sidney A. Murray, Jr.	Louisiana	TRUE	LA00598	—	134	1004	—	—	1,250,000
St. Road 846 Land Trust Imp	—	Florida	FALSE	FL75010	350	9	10000	72	10	721,640
Reed Bingham Park Lake Dam	—	Georgia	FALSE	GA01591	3,812	16	2000	295	—	385,000
Long Shoals Lake Dam	—	North Carolina	FALSE	NC00372	160	13	350	30	990	302,720
Robert Moses - St. Lawrence	Robert Moses - Robert H. Saunders Dam	New York	TRUE	NY00678	803,000	167	1600	37,500	—	300,000
Laguna Diversion	Laguna Diversion Reservoir	Arizona	FALSE	AZ10315	1,600	43	4780	—	487,000	287,000
Grand Falls Dam	Grand Falls Dam	Missouri	FALSE	MO20006	401	15	—	50	—	275,000
Oahe Dam	Lake Oahe	South Dakota	TRUE	SD01095	23,600,000	245	9300	376,000	300,000	243,490
Bonneville Locks and Dam	Lake Bonneville	Oregon	TRUE	OR00001	537,000	197	2477	20,600	1,600,000	240,000
The Dalles Lock and Dam	Lake Celilo	Oregon	TRUE	OR00002	330,000	200	8735	11,200	2,290,000	237,000
John Day Lock and Dam	Lake Umatilla	Oregon	TRUE	OR00011	2,530,000	230	5900	55,000	2,250,000	226,000
McNary Lock and Dam	Lake Wallula	Oregon	TRUE	OR00616	1,350,000	220	7365	38,800	2,200,000	214,000
Olmsted Locks and Dam	—	Illinois	FALSE	IL50745	728,000	67	3575	19,000	—	203,000
Parker Dam	Lake Havasu	Arizona	TRUE	AZ10312	180,000	320	856	20,390	314,000	182,700
Soo Locks	Superior	Michigan	FALSE	MI00650	277,540,000	17	968	20,255,000	68,000	181,000
Garrison Dam	Lake Sakakawea	North Dakota	TRUE	ND00145	26,000,000	210	11300	133,000	827,000	180,940
Retamal Diversion Dam	—	Texas	FALSE	TX07037	6,000	36	172	—	30,000	176,112
Anzalduas Diversion Dam	—	Texas	FALSE	TX07036	16,400	23	524	1,287	130,000	176,112
12th Street Intake Dam	—	West Virginia	FALSE	WV08308	150	10	217	35	—	172,160

Show the code

dta_for_model <- dams |>
  st_drop_geometry() |>
  select(nidId, big_dam, nidStorage, nidHeight, surfaceArea, drainageArea, maxDischarge) |>
  filter(!is.na(nidStorage),
         !is.na(drainageArea)) |>
  filter(!nidId %in% c("FL75010", "OK22289", "MI00650"))

n_dta_for_model <- nrow(dta_for_model)

top100_storage <- dta_for_model |>
  slice_max(order_by = nidStorage,
            n = 100)

top100_drainage <- dta_for_model |>
  slice_max(order_by = drainageArea,
            n = 100)

dta_for_plot <- inner_join(top100_storage,
                           top100_drainage |>
                             select(nidId),
                           by = "nidId")

n_top100_drainage_storage <- nrow(dta_for_plot)

So let’s let’s remove them and look at the 54,443 dams in the NID that are not missing any values for nidStorage, nidHeight, surfaceArea, and drainageArea.

In Figure 8.1 Panel B the 12 larger points represent dams that are in both the top 100 dams by drainageArea and by nidStorage.

Show the code

p1 <- dta_for_model |>
  ggplot() +
  geom_point(aes(nidStorage, drainageArea, color = big_dam),
             size = 0.5, alpha = 0.4,
             na.rm = TRUE,
             show.legend = FALSE) +
  scale_x_log10(labels = label_number(scale_cut = cut_short_scale())) +
  scale_y_log10(labels = label_number(scale_cut = cut_short_scale())) +
  scale_color_viridis_d(end = 0.8) +
  # guides(color = guide_legend(override.aes = list(size = 3))) +
  labs(
    subtitle = glue("A: Bi-modal distribution evident in dams with small nidStorage",
                    "\nSee visual vertical artifacts at ~100 and ~1000 acre-ft:\non the small end are storage amounts rough estimates?")
  )

my_drainageArea <- 70000
my_nidStorage <- 100000

p2 <- dta_for_model |>
  filter(drainageArea > my_drainageArea,
         nidStorage > my_nidStorage) |>
  ggplot() +
  geom_point(data = dta_for_plot,
             aes(nidStorage, drainageArea, color = big_dam),
             size = 3, alpha = 0.4,
             na.rm = TRUE) +
  geom_point(aes(nidStorage, drainageArea, color = big_dam),
             size = 0.5, alpha = 0.4,
             na.rm = TRUE) +
  scale_x_log10(labels = label_number(scale_cut = cut_short_scale())) +
  scale_y_log10(labels = label_number(scale_cut = cut_short_scale())) +
  scale_color_viridis_d(end = 0.8) +
  guides(color = guide_legend(position = "inside",
                              override.aes = list(size = 3))) +
  theme(legend.position.inside = c(0.2, 0.85)) + # c(0.85, 0.2)) +
  labs(
    subtitle = glue("B: Zooming in on the upper right: {n_top100_drainage_storage} dams", 
                    "\n are in top 100 in both drainage and storage",
                    "\nShowing drainageArea > {my_drainageArea}, nidStorage > {my_nidStorage}"),
    y = NULL
  )

p1 + p2 +
  plot_annotation(
    title = "Dam drainage area (mi^2) by storage (acre-ft)",
    caption = my_caption
  )

Show the code

top_dams <- dams |>
  select(name, otherNames, nidId, big_dam, primaryOwnerTypeId, primaryPurposeId, nidStorage, nidHeight, surfaceArea, drainageArea) |>
  inner_join(
    dta_for_plot |>
      select(nidId),
    by = "nidId"
  ) |>
  mutate(drainageArea_m2 = drainageArea * 2.59e+6, # convert to meters
         drainage_radius = sqrt(drainageArea_m2 / pi), # area = pi * r^2
         drainage_circle = st_buffer(geom, dist = drainage_radius),
         drainage_area_check = st_area(drainage_circle),
         ptc_of_defined_area = drainage_area_check / drainageArea_m2
  )

top_dams |>
  st_drop_geometry() |>
  arrange(desc(drainageArea)) |>
  select(name, otherNames, nidId, primaryOwnerTypeId, primaryPurposeId, nidStorage, nidHeight, surfaceArea, drainageArea) |>
  gt() |>
  tab_options(table.font.size = 11) |>
  tab_header(md(glue("**There are {n_top100_drainage_storage} dams in NID in the top 100", 
                      " for both storage capacity and drainageArea**"))) %>%
  fmt_number(columns = c(nidStorage, nidHeight, surfaceArea, drainageArea),
             decimals = 0)

Table 8.2

There are 12 dams in NID in the top 100 for both storage capacity and drainageArea
name	otherNames	nidId	primaryOwnerTypeId	primaryPurposeId	nidStorage	nidHeight	surfaceArea	drainageArea
Oahe Dam	Lake Oahe	SD01095	Federal	Flood Risk Reduction	23,600,000	245	376,000	243,490
John Day Lock and Dam	Lake Umatilla	OR00011	Federal	Navigation	2,530,000	230	55,000	226,000
Garrison Dam	Lake Sakakawea	ND00145	Federal	Hydroelectric	26,000,000	210	133,000	180,940
Hoover Dam	Lake Mead	NV10122	Federal	Hydroelectric	30,237,000	730	162,700	167,800
Falcon Dam	NA	TX00024	Federal	Flood Risk Reduction	3,177,000	175	115,400	159,270
Amistad Dam	NA	TX02296	Federal	Flood Risk Reduction	5,128,000	287	89,000	123,134
Glen Canyon Dam	Lake Powell	AZ10307	Federal	Hydroelectric	25,025,826	710	160,784	107,417
Grand Coulee Dam	Franklin D. Roosevelt Lake	WA00262	Federal	Flood Risk Reduction	9,715,346	550	82,300	75,117
Keystone Dam	Keystone Lake	OK10309	Federal	Flood Risk Reduction	1,672,613	121	22,420	74,506
Brownlee	Brownlee	ID00056	Private	Hydroelectric	1,470,000	395	14,621	72,800
Fort Peck Dam	Fort Peck Lake	MT00025	Federal	Flood Risk Reduction	19,100,000	256	93,000	57,725
Painted Rock Dam	Painted Rock Reservoir	AZ10002	Federal	Flood Risk Reduction	4,831,500	181	1	50,800

In the NID data dictionary, drainage area is defined as follows:

Drainage area of the dam, in square miles, which is defined as the area that drains to a particular point (in this case, the dam) on a river or stream.

Thus when one dam is downriver of another, the downriver dam drainage area includes the upriver dam’s drainage area. In Figure 8.2 this is visible in the drainage areas of dams on the Columbia River (Figure 8.3). It’s also evident in the dams on the Missouri River in Montana, North Dakota, and South Dakota (Figure 8.4).

The top dams are all west of the Mississippi River.

Show the code

n_top_dams <- nrow(top_dams)

ggplot() +
  geom_sf(data = state_boundaries_sf,
          color = "grey50",
          fill = NA,
          alpha = 0.01) +
  geom_sf(data = rivers_us,
          color = "darkblue",
          linewidth = 0.1,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = top_dams |>
            st_set_geometry("drainage_circle"), #|>
            # st_crop(bbox_continental),
          color = "dodgerblue", fill = "dodgerblue",
          alpha = 0.2) +
  geom_sf(data = top_dams |>
            mutate(n_primaryPurposeId = n(),
                   .by = primaryPurposeId) |>
            mutate(primaryPurposeId = glue("{primaryPurposeId} (n={n_primaryPurposeId})"),
                   primaryPurposeId = fct_reorder(primaryPurposeId, -n_primaryPurposeId)),
          aes(color = primaryPurposeId),
          size = 3,
          alpha = 0.75) +
  guides(color = guide_legend(override.aes = list(size = 3),
                              position = "inside")) +
  theme(axis.text = element_blank(),
        axis.ticks = element_blank(),
        axis.title = element_blank(),
        legend.position.inside = c(0.15, 0.15)) +
  labs(
    title = glue("There are {n_top_dams} top dams in in the National Inventory of Dams"),
    subtitle = glue("Defined as the top 100 in both drainage area and storage area.",
                    " Circles show size of drainage area, not actual drainage area"),
    caption = my_caption_nid_nws
  )

Zooming into the parts of the Columbia River basin in Washington and Oregon…

Show the code

columbia_river_watershed <- st_read(here("data/raw/usgs/WBD_17_HU2_GDB/WBD_17_HU2_GDB.gdb"),
                                    layer = "WBDHU4",
                                    quiet = TRUE)

# col_layers <- st_layers(here("data/raw/usgs/WBD_17_HU2_GDB/WBD_17_HU2_GDB.gdb"))
# library(nhdplusTools)
# 
# # -------------------------------------------------------------
# # 1. Find the COMID (hydrologic ID) for the Columbia River outlet
# # -------------------------------------------------------------
# # Use the approximate mouth coordinates of the Columbia River
# # columbia_outlet <- data.frame(
# #   lon = -123.97,
# #   lat = 46.25
# # )
# 
# columbia_outlet <- c(
#   lon = -123.97,
#   lat = 46.25
# )
# # 
# # # Get COMID for the river outlet
# comid <- discover_nhdplus_id(st_sfc(st_point(columbia_outlet),
#                                     crs = "NAD83")
# )
# # 
# # # -------------------------------------------------------------
# # # 2. Get the upstream watershed (all catchments draining to that COMID)
# # # -------------------------------------------------------------
# # # get_vaa_names() # shows available attributes
# # columbia_watershed <- get_nhdplus(#AOI = comid, 
# #   comid = comid,
# #   realization = "catchment")
# 
# # # Alternatively, to get the complete upstream network (larger area):
# columbia_network <- get_nhdplus(comid = comid, realization = "catchment")
# 
# flowline <- navigate_nldi(list(featureSource = "comid", 
#                                featureID = comid), 
#                           mode = "upstreamTributaries", 
#                           distance_km = 9000)
# # 
# # -------------------------------------------------------------
# # 3. Convert to sf and ensure consistent CRS
# # -------------------------------------------------------------
# columbia_sf <- st_as_sf(columbia_watershed)

# -------------------------------------------------------------
# 4. Plot with ggplot
# -------------------------------------------------------------

my_rivers <- st_join(columbia_river_watershed, rivers_us,
                     join = st_touches)

# my_rivers <- st_crop(rivers_us, columbia_river_watershed)

wa_or_bbox <- st_bbox(state_boundaries_sf  |>
                        filter(state_abb %in% c("WA", "OR"))
                      )

ggplot() +
  geom_sf(data = state_boundaries_sf |>
            filter(state_abb %in% c("WA", "OR")),
          fill = NA) +
  geom_sf(data = columbia_river_watershed |>
            st_crop(wa_or_bbox),
          fill = "skyblue", 
          color = "firebrick", 
          size = 0.1,
          alpha = 0.2) +
   geom_sf(data = my_rivers |>
            # filter(pname == "COLUMBIA R") |>
            st_crop(wa_or_bbox),
          color = "darkblue",
          linewidth = 0.5,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = rivers_us |>
            filter(PNAME == "COLUMBIA R") |>
            st_crop(wa_or_bbox),
          color = "darkblue",
          linewidth = 1,
          fill = NA,
          alpha = 0.3) +
  coord_sf() +
  labs(
    title = "Columbia River Watershed",
    # subtitle = "Retrieved using nhdplusTools and plotted with ggplot2",
    caption = "Source: USGS National Hydrography Dataset (NHDPlus)"
  ) +
  theme_minimal()

# # -------------------------------------------------------------
# # 1. Get the USGS Watershed Boundary Dataset (WBD)
# # -------------------------------------------------------------
# # HUC2 = 17 corresponds to the Columbia River Basin
# columbia_huc <- "17"
# 
# # Retrieve the HUC2 polygon for the Columbia River Basin
# columbia_basin <- subset_wbd(huc = columbia_huc, type = "huc2")
# 
# # -------------------------------------------------------------
# # 2. Check and transform CRS if needed
# # -------------------------------------------------------------
# columbia_basin <- st_transform(columbia_basin, 4326)
# 
# # -------------------------------------------------------------
# # 3. Plot with ggplot
# # -------------------------------------------------------------
# ggplot() +
#   geom_sf(data = columbia_basin, fill = "lightblue", color = "darkblue", size = 0.4) +
#   coord_sf() +
#   labs(
#     title = "Columbia River Basin (HUC2 = 17)",
#     subtitle = "Watershed Boundary Dataset (WBD)",
#     caption = "Source: USGS / nhdplusTools"
#   ) +
#   theme_minimal()

Show the code

dta_for_plot <- dams |>
  select(nidId, riverName, drainageArea, geom) |>
  filter(str_squish(str_to_lower(riverName)) == "columbia river") |>
  mutate(drainageArea_m2 = drainageArea * 2.59e+6, # convert to meters
         drainage_radius = sqrt(drainageArea_m2 / pi), # area = pi * r^2
         drainage_circle = st_buffer(geom, dist = drainage_radius),
         drainage_area_check = st_area(drainage_circle),
         ptc_of_defined_area = drainage_area_check / drainageArea_m2,
         spotlight = (drainageArea_m2 == max(drainageArea_m2) | 
                        drainageArea_m2 == min(drainageArea_m2))
  )
         
n_dams_columbia <- nrow(dta_for_plot)

wa_or_bbox <- st_bbox(state_boundaries_sf  |>
                        filter(state_abb %in% c("WA", "OR"))
                      )

ggplot() +
  geom_sf(data = state_boundaries_sf  |>
            filter(state_abb %in% c("WA", "OR")),
          color = "grey50",
          fill = NA,
          alpha = 0.01) +
  geom_sf(data = rivers_us |>
            filter(PNAME == "COLUMBIA R") |>
            st_crop(wa_or_bbox),
          color = "darkblue",
          linewidth = 1,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = rivers_us |>
            st_crop(wa_or_bbox),
          color = "darkblue",
          linewidth = 0.1,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = dta_for_plot |>
            filter(spotlight) |>
            st_set_geometry("drainage_circle"), 
            # st_crop(bbox_continental),
          color = "dodgerblue", fill = "dodgerblue",
          alpha = 0.05) +
  geom_sf(data = dta_for_plot,
          aes(color = spotlight),
          show.legend = FALSE,
          alpha = 1) +
  scale_color_manual(values = c("black", "firebrick")) +
  theme(axis.text = element_blank(),
        axis.ticks = element_blank(),
        axis.title = element_blank(),
        legend.position.inside = c(0.15, 0.15)) +
  labs(
    title = glue("Largest drainage area is dam most down-river;\nsmallest is most up-river"),
    subtitle = glue("There are {n_dams_columbia} dams on the Columbia River according to the National Inventory of Dams.",
                    "\nCircles show size of drainage area, not actual drainage area."),
    caption = my_caption_nid_nws
  )

8.2 The Mississippi river system

Show the code

rivers_fname <- here("data/raw/nws/rs16my07") # subset of US river data 
rivers_us <- st_read(rivers_fname,
              crs = "NAD83",
              quiet = TRUE) |>
  clean_names()

miss <- rivers_us |>
  filter(str_detect(pname, "^MISSISSIPPI")) |>
  select(rf1_150_id, huc, seg, seqno, lev, pname, geometry)

miss_and_tribs <- rivers_us |>
  filter(str_detect(pname, "^(MISSISSIPPI|MISSOURI|OHIO|ARKANSAS|ILLINOIS|TENNESSEE)"))

dams_on_miss <- dams |>
  filter(str_detect(riverName, "^MISSISSIPPI")) |>
  filter(!nidId %in% c("MO40057")) |>
  filter(drainageArea > 0) |>
  mutate(drainageArea_m2 = drainageArea * 2.59e+6, # convert to meters
         drainage_radius = sqrt(drainageArea_m2 / pi), # area = pi * r^2
         drainage_circle = st_buffer(geom, dist = drainage_radius),  # shouldn't need a factor on dist?
         drainage_area_check = st_area(drainage_circle),
         ptc_of_defined_area = drainage_area_check / drainageArea_m2
  )

dams_on_miss_flow_order <- dams_on_miss |>
  arrange(desc(latitude)) |>
  mutate(idx = row_number())

dams_on_miss_flow_order_no_drainageArea <- dams |>
  filter(str_detect(riverName, "^MISSISSIPPI")) |>
  arrange(desc(latitude)) |>
  mutate(idx = row_number()) |>
  filter(is.na(drainageArea))

dams_on_miss_and_tribs <- dams |>
  filter(str_detect(riverName, "^(MISSISSIPPI|MISSOURI|OHIO|ARKANSAS|ILLINOIS|TENNESSEE)")) |>
  filter(!nidId %in% c("PA00714", "MO40057")) |>
  filter(!nidId %in% c("MO40057")) |>
  filter(drainageArea > 0) |>
  mutate(drainageArea_m2 = drainageArea * 2.59e+6, # convert to meters
         drainage_radius = sqrt(drainageArea_m2 / pi), # area = pi * r^2
         drainage_circle = st_buffer(geom, dist = drainage_radius),  # shouldn't need a factor on dist?
         drainage_area_check = st_area(drainage_circle),
         ptc_of_defined_area = drainage_area_check / drainageArea_m2
  )

The Mississippi River and its tributaries drain the largest portion of the continental US. The Mississippi has been (mostly) “tamed” through many dams and levees.

Figure 8.4 shows graphically that, in general, the drainageArea increases at dams downriver. Some exceptions are visible on the Missouri river, which in some cases might be dams on tributaries close to the Missouri River.

Show the code

ggplot() +
  geom_sf(data = continental_us_border,
          color = "grey50",
          fill = NA,
          linewidth = 0.1,
          alpha = 0.1) +
  geom_sf(data = miss_and_tribs,
          color = "darkblue",
          linewidth = 0.1,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = dams_on_miss_and_tribs |>
            st_set_geometry("drainage_circle"),
          color = NA, #"dodgerblue",
          fill = "dodgerblue",
          alpha = 0.05) +
  geom_sf(data = dams_on_miss_and_tribs,
          size = 0.5,
          color = "firebrick") +
  theme(axis.text = element_blank(),
        axis.ticks = element_blank(),
        axis.title = element_blank(),
        legend.position.inside = c(0.15, 0.15)) +
  labs(
    title = glue("The Mississippi River and its major tributaries"),
    # subtitle = str_wrap(nws_river_data_description, 120),
    caption = my_caption_nid_nws
  )

Figure 8.4: Map of the Mississippi River and its major tributaries (NWS data)

Looking only at the Mississippi River (Figure 8.5) and related Figure 8.6, it’s clear that drainage area increases for dams downriver.

Show the code

ggplot() +
  geom_sf(data = continental_us_border,
          color = "grey50",
          fill = NA,
          linewidth = 0.1,
          alpha = 0.1) +
  geom_sf(data = miss_and_tribs,
          color = "darkblue",
          linewidth = 0.1,
          fill = NA,
          alpha = 0.3) +
  geom_sf(data = dams_on_miss |>
            st_set_geometry("drainage_circle"),
          color = NA, #"dodgerblue",
          fill = "dodgerblue",
          alpha = 0.1) +
  geom_sf(data = dams_on_miss,
          size = 0.5,
          color = "firebrick") +
  geom_sf(data = dams_on_miss_flow_order_no_drainageArea,
          size = 4,
          shape = "X",
          color = "firebrick") +
  theme(axis.text = element_blank(),
        axis.ticks = element_blank(),
        axis.title = element_blank(),
        legend.position.inside = c(0.15, 0.15)) +
  labs(
    title = glue("The Mississippi River"),
    subtitle = "Dams without drainageArea data shown with 'X'",
    caption = my_caption_nid_nws
  )

Figure 8.5: Map of dams on the Mississippi River with drainage area

Figure 8.6 presents the same data:

Show the code

dams_on_miss_flow_order |>
  ggplot() +
  geom_point(aes(idx, drainageArea),
             na.rm = TRUE) +
  scale_x_continuous(expand = expansion(mult = c(0.01, 0.04)),
                     breaks = c(1, 1:4*10)) +
  scale_y_continuous(labels = label_number(scale_cut = cut_short_scale()),
                     expand = expansion(mult = c(0.01, 0.04))) +
  labs(
    title = "Dams on the Mississippi\ndrainage area increases down-river",
    x = "Index north to south (1 is northern-most dam)",
    y = "Drainage area (mi^2)",
    caption = my_caption_nid_nws
  )

Figure 8.6: Dams on the Mississippi increase in drainage area down-river