Introduction to Baseflow and Streamflow Prediction

The accuracy of continental-scale hydrological models in predicting streamflow, particularly in dry regions, hinges on their ability to simulate various water balance components, including surface and subsurface runoff, soil moisture retention, as well as groundwater recharge and discharge. Runoff, which includes both surface and subsurface components, plays a critical role in the distribution of water between immediate streamflow and groundwater recharge, impacting baseflow generation. In dry regions, where precipitation is sparse, subsurface runoff becomes especially relevant, as it often contributes to baseflow—a key element in sustaining streamflow during dry periods. Therefore, examining both surface and subsurface runoff provides insight into how different physical process representations in models affect baseflow estimates.

Baseflow Example
Figure 1. Baseflow computed using the method suggested by Van Dijk (2010) for GRDC 6731165 station data to compare with Beck et al (2013).

Importance of Understanding Baseflow Generation

Baseflow plays a significant role in sustaining streamflow during dry periods, yet traditional Land Surface Models (LSMs) struggle to accurately represent groundwater recharge, soil moisture retention, and infiltration processes. Overestimation of the Baseflow Index (BFI) results in unrealistic low-flow simulations, highlighting the need for improved hydrological modeling approaches.

In dry regions where precipitation is intermittent and highly variable, baseflow sustains river flows during dry periods. However, traditional LSMs frequently overestimate baseflow, leading to inaccurate predictions of low-flow conditions and groundwater recharge rates. This study hypothesizes that improving the physical representation of baseflow processes in LSMs will lead to better streamflow predictions. By refining soil water retention schemes, macropore flow mechanisms, and surface ponding processes, we aim to enhance the ability of LSMs to capture baseflow dynamics accurately.

BFI Results
Figure 2. BFI at 390 gauges across the southwestern region (a) derived from the USGS streamflow as well as those from the model experiments (b) NWM; (c) CH; (d) VGM; (e) DPM; and (f) VGM0..

Key Findings

1. Soil Water Retention Curves Impact Baseflow Representation - Comparing the recharge and subsurface runoff for CH and VG models revealed that the CH model exhibits higher recharge and greater subsurface runoff than the VG model. Since subsurface runoff is a primary source of baseflow, the higher values in the CH model led to an overestimation of baseflow. Moreover, the VG model showed higher surface runoff than the CH model, meaning less water contributes to baseflow. Therefore, the CH model's tendency to overestimate baseflow and the Baseflow Index (BFI) can be attributed to its higher simulated recharge and subsurface runoff.

In low BFI regions, the higher surface runoff produced by VG results in less water infiltrating into the deep soil and recharging the groundwater. Hence, VG could generate lower baseflow and BFI in these regions. In contrast, CH shows more recharge into groundwater and consequently higher BFI.

2. Role of Soil Macropores and Surface Ponding - The presence of soil macropores in DPM experiment facilitated rapid infiltration and preferential flow through the unsaturated zone, groundwater recharge, and thus baseflow (Mohammed, Cey et al. 2021). In our study, DPM tends to overestimate BFI in low-BFI regions whereas better captured high BFI values. The presence of macropores increased drainage from the surface to the root zone, potentially reducing surface layer moisture retention and increasing groundwater recharge and discharge. This enhances predictions in wet, high-BFI regions but may produce unrealistic results in low-BFI areas. Hence calibration of the soil macropore volume fraction, which is parameterized as a linear function of soil organic matter, is critical to achieve more realistic results. Also, the relationship between soil macropore volume fraction and soil organic matter and other coarse materials (gravels, stones) are worth further investigation.

Including a ponding threshold, as seen in the VGM scenario with a 50 mm maximum ponding depth, is crucial for improving baseflow generation, especially in high-BFI regions. This configuration allows more water to remain on the surface longer for infiltration before running off, which enhances infiltration and groundwater recharge and positively impacts baseflow generation.

3. Hydraulic Parameterization Influences Streamflow Predictions - Saturated hydraulic conductivity and the soil water retention curve parameters affect infiltration rate at the soil surface, soil moisture movement, recharge into groundwater, and streamflow generation, especially in timing and peak of the streamflow (ML’s KGE). However, we could not observe significant effect of hydraulic parameter on generated BFI. Accurate estimation of these parameters can greatly improve streamflow predictions but is very challenging due to the complex soil texture, structure, and presence of coarse materials (See (Gupta, Papritz et al. 2022)) . ML-derived parameters showed a 20% improvement KGE, better matching observed streamflow patterns than traditional lookup tables used by the NWM. However, the limited geographic and climatic distribution of the training dataset contributing in generating ML parameters, may affect its generalization, potentially leading to biases in the predicted streamflow.

4. Precipitation Data Selection Matters - The choice of precipitation datasets also significantly impacts the accuracy of baseflow predictions. Our results indicated that using the IMERG dataset improves the accuracy of BFI predictions in regions where NLDAS-2 tends to overestimate BFI. The IMERG experiments successfully captured lower BFI values in regions where the VGM configuration with AORC and NLDAS-2 precipitation overestimate them, particularly in the coastal regions of the California River Basin.

KGE Improvement
Figure 3 KGE Improvement of ML against NWM. ML was chosen since it outperformed other hydrological process and hydraulic parameter scenarios.

Implications for Climate Science and Water Management

These findings improve streamflow forecasting, hydrological modeling, and climate resilience strategies, particularly in arid regions.

1. More Accurate Streamflow Forecasts for Water ManagementImproved baseflow modeling enables better drought predictions and water allocation strategies, particularly in water-scarce regions.

2. Enhanced Climate Models and Hydrological Forecasting Refining baseflow and soil moisture dynamics enhances climate models, improving seasonal water availability forecasts.

3. Flood and Drought Preparedness Better representation of surface infiltration, macropore flow, and groundwater recharge leads to more reliable flood risk assessments and drought response planning.

4. Optimized Hydrological Model Development Implementing machine learning-driven soil parameters and refined precipitation datasets could significantly enhance streamflow predictions in operational hydrological models.

Conclusion & Future Directions

This study emphasizes the critical role of baseflow generation processes in streamflow prediction accuracy, especially in arid regions of the southwestern US. Our modeling results suggest that the streamflow prediction skill are sensitive to how baseflow generation is represented in terms of model physics, associated hydraulic parameters, and precipitation forcing data. Using a Noah-MP with enhanced hydrology, we show that the choice of hydrological processes, hydraulic parameters, and precipitation datasets significantly affects streamflow prediction accuracy over dry southwestern US.

The Van-Genuchten hydraulic scheme is more effective than the Brooks-Corey in modeling baseflow and BFI, particularly in dry regions where the soil is naturally dry, and the BFI is low. This scheme reduced the BFI overestimation produced by the Brooks-Corey with the CH hydraulic parameters (by the Noah-MP look-up table) and NWM with calibrated hydraulic parameters by better capturing groundwater recharge and discharge processes. Additionally, with the machine learning-derived soil water retention curve parameters, VGM significantly improves the streamflow predictions, offering a better match with the observed streamflow compared to the look-up table and pedotransfer functions. In general, our finding implies improving the baseflow in large-scale models like Noah-MP leads to better prediction of streamflow as observed in VGM configuration.

The study also highlights the importance of incorporating soil macropores, DPM experiment, and ponding depth thresholds, VGM and VGM0 experiments, in modeling, as these factors greatly influence infiltration, percolation, recharge and baseflow generation. A ponding depth greater than zero increases BFI by allowing more water to infiltrate, especially in wet regions. Additionally, the presence of macropores enhances drainage from the surface to the root zone, increasing baseflow and BFI. However, the benefits of these features vary by region. While uncalibrated macropore fraction improve predictions in high-BFI areas, they may lead to overestimations of baseflow in low-BFI regions.

Furthermore, the choice of precipitation dataset was shown to be crucial, with the IMERG dataset offering more accurate baseflow predictions in regions where traditional datasets like NLDAS-2 tended to overestimate BFI. Indeed, heavy precipitation facilitates the infiltration into deeper soil and groundwater recharge. This finding suggests that high-resolution precipitation data is essential for improving the accuracy of streamflow predictions in areas with complex hydrological conditions.

Overall, the study demonstrates that careful selection of hydrological processes (soil hydraulic schemes), hydraulic parameters, and precipitation datasets can significantly enhance the performance of hydrologic models in predicting streamflow, particularly in arid regions. These findings provide valuable insights for future research and model development, emphasizing the need to optimize model configurations before calibration to achieve more reliable streamflow predictions.

Reference

Mohammad A. Farmani, Ahmad A. Tavakoly, Ali Behrangi, et al. Improving Streamflow Predictions in the Arid Southwestern United States Through Understanding of Baseflow Generation Mechanisms. ESS Open Archive . November 27, 2024. DOI: 10.22541/essoar.173272456.69006273/v1