Title: | Retrieving, Organizing, and Analyzing Estuary Monitoring Data |
---|---|
Description: | Tools for retrieving, organizing, and analyzing environmental data from the System Wide Monitoring Program of the National Estuarine Research Reserve System <https://cdmo.baruch.sc.edu/>. These tools address common challenges associated with continuous time series data for environmental decision making. |
Authors: | Marcus W. Beck [aut, cre], Kimberly Cressman [ctb] |
Maintainer: | Marcus W. Beck <[email protected]> |
License: | CC0 |
Version: | 2.5.1 |
Built: | 2024-10-14 05:49:04 UTC |
Source: | https://github.com/fawda123/SWMPr |
Aggregate a metabolism attribute from swmpr data by a specified time period and method
aggremetab(swmpr_in, ...) ## S3 method for class 'swmpr' aggremetab(swmpr_in, by = "weeks", na.action = na.pass, alpha = 0.05, ...)
aggremetab(swmpr_in, ...) ## S3 method for class 'swmpr' aggremetab(swmpr_in, by = "weeks", na.action = na.pass, alpha = 0.05, ...)
swmpr_in |
input swmpr object |
... |
additional arguments passed to other methods |
by |
chr string or numeric value specifying aggregation period. If chr string, must be |
na.action |
function for treating missing data, default |
alpha |
numeric indicating alpha level of confidence interval for aggregated data |
The function summarizes metabolism data by averaging across set periods of observation. Confidence intervals are also returned based on the specified alpha level. It is used within plot_metab
function to view summarized metabolism results. Data can be aggregated by 'years'
, 'quarters'
, 'months'
, or 'weeks'
for the supplied function, which defaults to the mean
. The method of treating NA values for the user-supplied function should be noted since this may greatly affect the quantity of data that are returned.
Returns an aggregated metabolism data.frame
if the metabolism
attribute of the swmpr object is not NULL
. Upper and lower confidence limits are also provided if the aggregation period was specified as a character string.
aggregate
, aggreswmp
, ecometab
, plot_metab
## Not run: ## import water quality and weather data data(apadbwq) data(apaebmet) ## qaqc, combine wq <- qaqc(apadbwq) met <- qaqc(apaebmet) dat <- comb(wq, met) ## estimate metabolism res <- ecometab(dat) ## change aggregation period and alpha aggremetab(res, by = 'months', alpha = 0.1) ## use a moving window average of 30 days aggremetab(res, by = 30) ## use a left-centered window instead aggremetab(res, by = 30, sides = 1) ## End(Not run)
## Not run: ## import water quality and weather data data(apadbwq) data(apaebmet) ## qaqc, combine wq <- qaqc(apadbwq) met <- qaqc(apaebmet) dat <- comb(wq, met) ## estimate metabolism res <- ecometab(dat) ## change aggregation period and alpha aggremetab(res, by = 'months', alpha = 0.1) ## use a moving window average of 30 days aggremetab(res, by = 30) ## use a left-centered window instead aggremetab(res, by = 30, sides = 1) ## End(Not run)
Aggregate swmpr data by specified time period and method
aggreswmp(swmpr_in, ...) ## S3 method for class 'swmpr' aggreswmp( swmpr_in, by, FUN = function(x) mean(x, na.rm = TRUE), params = NULL, aggs_out = FALSE, plot = FALSE, na.action = na.pass, ... )
aggreswmp(swmpr_in, ...) ## S3 method for class 'swmpr' aggreswmp( swmpr_in, by, FUN = function(x) mean(x, na.rm = TRUE), params = NULL, aggs_out = FALSE, plot = FALSE, na.action = na.pass, ... )
swmpr_in |
input swmpr object |
... |
additional arguments passed to other methods |
by |
chr string of time period for aggregation one of |
FUN |
aggregation function, default |
params |
names of parameters to aggregate, default all |
aggs_out |
logical indicating if |
plot |
logical to return a plot of the summarized data, default |
na.action |
function for treating missing data, default |
The function aggregates parameter data for a swmpr object by set periods of observation and a user-supplied function. It is most useful for aggregating noisy data to evaluate trends on longer time scales, or to simply reduce the size of a dataset. Data can be aggregated by 'years'
, 'quarters'
, 'months'
, 'weeks'
, 'days'
, or 'hours'
for the supplied function, which defaults to the mean
. A swmpr object is returned for the aggregated data, although the datetimestamp vector will be converted to a date object if the aggregation period is a day or longer. Days are assigned to the date vector if the aggregation period is a week or longer based on the round method for IDate
objects. This approach was used to facilitate plotting using predefined methods for Date and POSIX objects.
The method of treating NA values for the user-supplied function should be noted since this may greatly affect the quantity of data that are returned (see the examples). Finally, the default argument for na.action
is set to na.pass
for swmpr objects to preserve the time series of the input data.
Returns an aggregated swmpr object. QAQC columns are removed if included with input object. If aggs_out = TRUE
, the original swmpr
object is returned with the datetimestamp
column formatted for the first day of the aggregation period from by
. A ggplot
object of boxplot summaries is returned if plot = TRUE
.
## Not run: ## get data, prep data(apacpwq) dat <- apacpwq swmpr_in <- subset(qaqc(dat), rem_cols = TRUE) ## get mean DO by quarters aggreswmp(swmpr_in, 'quarters', params = c('do_mgl')) ## get a plot instead aggreswmp(swmpr_in, 'quarters', params = c('do_mgl'), plot = T) ## plots with other variables p <- aggreswmp(swmpr_in, 'months', params = c('do_mgl', 'temp', 'sal'), plot = T) p library(ggplot2) p + geom_boxplot(aes(fill = var)) + theme(legend.position = 'none') ## get variance of DO by years, remove NA when calculating variance ## omit NA data in output fun_in <- function(x) var(x, na.rm = TRUE) aggreswmp(swmpr_in, FUN = fun_in, 'years') ## End(Not run)
## Not run: ## get data, prep data(apacpwq) dat <- apacpwq swmpr_in <- subset(qaqc(dat), rem_cols = TRUE) ## get mean DO by quarters aggreswmp(swmpr_in, 'quarters', params = c('do_mgl')) ## get a plot instead aggreswmp(swmpr_in, 'quarters', params = c('do_mgl'), plot = T) ## plots with other variables p <- aggreswmp(swmpr_in, 'months', params = c('do_mgl', 'temp', 'sal'), plot = T) p library(ggplot2) p + geom_boxplot(aes(fill = var)) + theme(legend.position = 'none') ## get variance of DO by years, remove NA when calculating variance ## omit NA data in output fun_in <- function(x) var(x, na.rm = TRUE) aggreswmp(swmpr_in, FUN = fun_in, 'years') ## End(Not run)
Import current station records from the CDMO starting with the most current date
all_params(station_code, Max = 100, param = NULL, trace = TRUE)
all_params(station_code, Max = 100, param = NULL, trace = TRUE)
station_code |
chr string of station, 7 or 8 characters |
Max |
numeric value for number of records to obtain from the current date |
param |
chr string for a single parameter to return, defaults to all parameters for a station type. |
trace |
logical indicating if import progress is printed in console |
This function retrieves data from the CDMO through the web services URL. The computer making the request must have a registered IP address. Visit the CDMO web services page for more information: https://cdmo.baruch.sc.edu/webservices.cfm. Function is the CDMO equivalent of exportAllParamsXMLNew
but actually uses all_params_dtrng
, which is a direct call to exportAllParamsDateRangeXMLNew
.
Returns a swmpr object, all available parameters including QAQC columns
all_params_dtrng
, single_param
## Not run: ## all parameters for a station, most recent all_params('hudscwq') ## End(Not run)
## Not run: ## all parameters for a station, most recent all_params('hudscwq') ## End(Not run)
Get station records from the CDMO within a date range
all_params_dtrng(station_code, dtrng, param = NULL, trace = TRUE, Max = NULL)
all_params_dtrng(station_code, dtrng, param = NULL, trace = TRUE, Max = NULL)
station_code |
chr string of station, 7 or 8 characters |
dtrng |
two element chr string, each of format MM/DD/YYYY |
param |
chr string for a single parameter to return, defaults to all parameters for a station type. |
trace |
logical indicating if import progress is printed in console |
Max |
numeric indicating maximum number of records to return |
This function retrieves data from the CDMO through the web services URL. The computer making the request must have a registered IP address. Visit the CDMO web services page for more information: https://cdmo.baruch.sc.edu/webservices.cfm. This function is the CDMO equivalent of exportAllParamsDateRangeXMLNew
.
Download time may be excessive for large requests.
Returns a swmpr object, all parameters for a station type (nutrients, water quality, or meteorological) or a single parameter if specified. QAQC columns are not provided for single parameters.
## Not run: ## get all parameters within a date range all_params_dtrng('apaebwq', c('01/01/2013', '02/01/2013')) ## get single parameter within a date range all_params_dtrng('apaebwq', c('01/01/2013', '02/01/2013'), param = 'do_mgl') ## End(Not run)
## Not run: ## get all parameters within a date range all_params_dtrng('apaebwq', c('01/01/2013', '02/01/2013')) ## get single parameter within a date range all_params_dtrng('apaebwq', c('01/01/2013', '02/01/2013'), param = 'do_mgl') ## End(Not run)
An example nutrient dataset for Apalachicola Bay Cat Point station. The data are a swmpr
object that have been imported into R from csv files using the import_local
function. The raw data were obtained from the CDMO data portal but can also be accessed from a zip file created for this package. See the source below. The help file for the import_local
function describes how the data can be imported from the zip file. Attributes of the dataset include names
, row.names
, class
, station
, parameters
, qaqc_cols
, date_rng
, timezone
, and stamp_class
.
apacpnut
apacpnut
A swmpr
object and data.frame
with 215 rows and 13 variables:
datetimestamp
POSIXct
po4f
num
f_po4f
chr
nh4f
num
f_nh4f
chr
no2f
num
f_no2f
chr
no3f
num
f_no3f
chr
no23f
num
f_no23f
chr
chla_n
num
f_chla_n
chr
https://s3.amazonaws.com/swmpexdata/zip_ex.zip
data(apacpnut)
data(apacpnut)
An example water quality dataset for Apalachicola Bay Cat Point station. The data are a swmpr
object that have been imported into R from csv files using the import_local
function. The raw data were obtained from the CDMO data portal but can also be accessed from a zip file created for this package. See the source below. The help file for the import_local
function describes how the data can be imported from the zip file. Attributes of the dataset include names
, row.names
, class
, station
, parameters
, qaqc_cols
, date_rng
, timezone
, and stamp_class
.
apacpwq
apacpwq
A swmpr
object and data.frame
with 70176 rows and 25 variables:
datetimestamp
POSIXct
temp
num
f_temp
chr
spcond
num
f_spcond
chr
sal
num
f_sal
chr
do_pct
num
f_do_pct
chr
do_mgl
num
f_do_mgl
chr
depth
num
f_depth
chr
cdepth
num
f_cdepth
chr
level
num
f_level
chr
clevel
num
f_clevel
chr
ph
num
f_ph
chr
turb
num
f_turb
chr
chlfluor
num
f_chlfluor
chr
https://s3.amazonaws.com/swmpexdata/zip_ex.zip
data(apacpwq)
data(apacpwq)
An example water quality dataset for Apalachicola Bay Dry Bar station. The data are a swmpr
object that have been imported into R from csv files using the import_local
function. The raw data were obtained from the CDMO data portal but can also be accessed from a zip file created for this package. See the source below. The help file for the import_local
function describes how the data can be imported from the zip file. Attributes of the dataset include names
, row.names
, class
, station
, parameters
, qaqc_cols
, date_rng
, timezone
, and stamp_class
.
apadbwq
apadbwq
A swmpr
object and data.frame
with 70176 rows and 25 variables:
datetimestamp
POSIXct
temp
num
f_temp
chr
spcond
num
f_spcond
chr
sal
num
f_sal
chr
do_pct
num
f_do_pct
chr
do_mgl
num
f_do_mgl
chr
depth
num
f_depth
chr
cdepth
num
f_cdepth
chr
level
num
f_level
chr
clevel
num
f_clevel
chr
ph
num
f_ph
chr
turb
num
f_turb
chr
chlfluor
num
f_chlfluor
chr
https://s3.amazonaws.com/swmpexdata/zip_ex.zip
data(apadbwq)
data(apadbwq)
An example weather dataset for Apalachicola Bay East Bay station. The data are a swmpr
object that have been imported into R from csv files using the import_local
function. The raw data were obtained from the CDMO data portal but can also be accessed from a zip file created for this package. See the source below. The help file for the import_local
function describes how the data can be imported from the zip file. Attributes of the dataset include names
, row.names
, class
, station
, parameters
, qaqc_cols
, date_rng
, timezone
, and stamp_class
.
apaebmet
apaebmet
A swmpr
object and data.frame
with 70176 rows and 23 variables:
datetimestamp
POSIXct
atemp
num
f_atemp
chr
rh
num
f_rh
chr
bp
num
f_bp
chr
wspd
num
f_wspd
chr
maxwspd
num
f_maxwspd
chr
wdir
num
f_wdir
chr
sdwdir
num
f_sdwdir
chr
totpar
num
f_totpar
chr
totprcp
num
f_totprcp
chr
cumprcp
num
f_cumprcp
chr
totsorad
num
f_totsorad
chr
https://s3.amazonaws.com/swmpexdata/zip_ex.zip
data(apaebmet)
data(apaebmet)
Calculate oxygen mass transfer coefficient using equations in Thiebault et al. 2008. Output is used to estimate the volumetric reaeration coefficient for ecosystem metabolism.
calckl(temp, sal, atemp, wspd, bp, height = 10)
calckl(temp, sal, atemp, wspd, bp, height = 10)
temp |
numeric for water temperature (C) |
sal |
numeric for salinity (ppt) |
atemp |
numeric for air temperature (C) |
wspd |
numeric for wind speed (m/s) |
bp |
numeric for barometric pressure (mb) |
height |
numeric for height of anemometer (meters) |
This function is used within the ecometab
function and should not be used explicitly.
Returns numeric value for oxygen mass transfer coefficient (m d-1).
Ro KS, Hunt PG. 2006. A new unified equation for wind-driven surficial oxygen transfer into stationary water bodies. Transactions of the American Society of Agricultural and Biological Engineers. 49(5):1615-1622.
Thebault J, Schraga TS, Cloern JE, Dunlavey EG. 2008. Primary production and carrying capacity of former salt ponds after reconnection to San Francisco Bay. Wetlands. 28(3):841-851.
Flag observations above/below detection limits
cens_id(swmpr_in, ...) ## S3 method for class 'swmpr' cens_id(swmpr_in, flag_type = "both", select = NULL, ...)
cens_id(swmpr_in, ...) ## S3 method for class 'swmpr' cens_id(swmpr_in, flag_type = "both", select = NULL, ...)
swmpr_in |
input swmpr object |
... |
optional arguments passed to or from other methods |
flag_type |
chr string indicating the flag type to return, must be one of |
select |
chr string of parameters to keep, defaults to all, |
Censored observations are identified in swmpr objects using the CDMO flags -4 or -5, indicating outside the low or high sensor range, respectively. Additional codes are identified including A (-2007) or SUL (2007-) for above and B (-2007), SBL (2007-), SCB (2007-, calculated) for below detection limits. The QAQC columns are searched for all parameters and replaced with the appropriate value indicating the detection limit as defined by flag_type
. The default argument flag_type = 'both'
will recode the QAQC columns as -1, 0, or 1 indicating below, within, or above the detection limit. Setting flag_type = 'below'
or 'above'
will convert the columns to TRUE
/FALSE
values indicating observations beyond the detection limit (either above or below, TRUE
) or within the normal detection range FALSE
.
The output includes additional columns similar to those for QAQC flags, such that the column names for censored flags include a c_
prefix before the parameter name. Note that the function will of course not work if already processed with qaqc
. QAQC columns are retained for additional processing.
The user should refer to the metadata or visually examine the observed data to identify the actual limit, which may change over time.
Returns a swmpr object with additional columns for censored flag values and the appropriate flag type based on the input arguments. Censored flag columns are named with a c_
prefix.
## get data data(apacpnut) dat <- apacpnut ## convert all qaqc columns to censored flags, -1 below, 0 within, 1 above cens_id(dat) ## T/F for above or within, note that none are above cens_id(dat, flag_type = 'above') ## T/F for below or within cens_id(dat, flag_type = 'below')
## get data data(apacpnut) dat <- apacpnut ## convert all qaqc columns to censored flags, -1 below, 0 within, 1 above cens_id(dat) ## T/F for above or within, note that none are above cens_id(dat, flag_type = 'above') ## T/F for below or within cens_id(dat, flag_type = 'below')
Combine swmpr data types for a station by common time series
comb(...) ## S3 method for class 'swmpr' comb(..., timestep = 15, differ = NULL, method = "union") ## Default S3 method: comb(..., date_col, timestep = 15, differ = NULL, method = "union")
comb(...) ## S3 method for class 'swmpr' comb(..., timestep = 15, differ = NULL, method = "union") ## Default S3 method: comb(..., date_col, timestep = 15, differ = NULL, method = "union")
... |
input time series data objects, from one to many |
timestep |
numeric value of time step to use in minutes, passed to |
differ |
numeric value defining buffer for merging time stamps to standardized time series, passed to |
method |
chr string indicating method of combining data. Use |
date_col |
chr string indicating name of the date column |
The comb
function is used to combine multiple swmpr objects into a single object with a continuous time series at a given step. The timestep
function is used internally such that timestep
and differ
are accepted arguments for comb
.
The function requires one or more swmpr objects as input as separate, undefined arguments. The remaining arguments must be called explicitly since an arbitrary number of objects can be used as input. In general, the function combines data by creating a master time series that is used to iteratively merge all swmpr objects. The time series for merging depends on the value passed to the method
argument. Passing 'union'
to method
will create a time series that is continuous starting from the earliest date and the latest date for all input objects. Passing 'intersect'
to method
will create a time series that is continuous from the set of dates that are shared between all input objects. Finally, a seven or eight character station name passed to method
will merge all input objects based on a continuous time series for the given station. The specified station must be present in the input data. Currently, combining data types from different stations is not possible, excluding weather data which are typically at a single, dedicated station.
Returns a combined swmpr object
## get wq and met data as separate objects for the same station swmp1 <- apacpnut swmp2 <- apaebmet ## combine nuts and wq data by union, set timestep to 120 minutes ## Not run: comb(swmp1, swmp2, timestep = 120, method = 'union') ## End(Not run)
## get wq and met data as separate objects for the same station swmp1 <- apacpnut swmp2 <- apaebmet ## combine nuts and wq data by union, set timestep to 120 minutes ## Not run: comb(swmp1, swmp2, timestep = 120, method = 'union') ## End(Not run)
Decompose data into trend, cyclical (e.g., daily, annual), and random components using decompose
and ts
decomp(dat_in, ...) ## S3 method for class 'swmpr' decomp( dat_in, param, type = "additive", frequency = "daily", start = NULL, ... ) ## Default S3 method: decomp( dat_in, param, date_col, type = "additive", frequency = "daily", start = NULL, ... )
decomp(dat_in, ...) ## S3 method for class 'swmpr' decomp( dat_in, param, type = "additive", frequency = "daily", start = NULL, ... ) ## Default S3 method: decomp( dat_in, param, date_col, type = "additive", frequency = "daily", start = NULL, ... )
dat_in |
input data object |
... |
arguments passed to |
param |
chr string of swmpr parameter to decompose |
type |
chr string of |
frequency |
chr string or numeric vector indicating the periodic component of the input parameter. Only |
start |
numeric vector indicating the starting value for the time series given the frequency. Only required if |
date_col |
chr string of the name of the date column |
This function is a simple wrapper to the decompose
function. The decompose
function separates a time series into additive or multiplicative components describing a trend, cyclical variation (e.g., daily or annual), and the remainder. The additive decomposition assumes that the cyclical component of the time series is stationary (i.e., the variance is constant), whereas a multiplicative decomposition accounts for non-stationarity. By default, a moving average with a symmetric window is used to filter the cyclical component. Alternatively, a vector of filter coefficients in reverse time order can be supplied (see decompose
).
The decompose
function requires a ts object with a specified frequency. The decomp
function converts the input swmpr vector to a ts object prior to decompose
. This requires an explicit input defining the frequency in the time series required to complete a full period of the parameter. For example, the frequency of a parameter with diurnal periodicity would be 96 if the time step is 15 minutes (24 hours * 60 minutes / 15 minutes). The frequency of a parameter with annual periodicity at a 15 minute time step would be 35040 (365 days * 24 hours * 60 minutes / 15 minutes). For simplicity, chr strings of 'daily'
or 'annual'
can be supplied in place of numeric values. A starting value of the time series must be supplied in the latter case. Use of the setstep
function is required to standardize the time step prior to decomposition.
Note that the decompose
function is a relatively simple approach and alternative methods should be investigated if a more sophisticated decomposition is desired.
Returns a decomposed.ts object
M. Kendall and A. Stuart (1983) The Advanced Theory of Statistics, Vol. 3, Griffin. pp. 410-414.
## get data data(apadbwq) swmp1 <- apadbwq ## subset for daily decomposition dat <- subset(swmp1, subset = c('2013-07-01 00:00', '2013-07-31 00:00')) ## decomposition and plot test <- decomp(dat, param = 'do_mgl', frequency = 'daily') plot(test) ## dealing with missing values dat <- subset(swmp1, subset = c('2013-06-01 00:00', '2013-07-31 00:00')) ## this returns an error ## Not run: test <- decomp(dat, param = 'do_mgl', frequency = 'daily') ## End(Not run) ## how many missing values? sum(is.na(dat$do_mgl)) ## use na.approx to interpolate missing data dat <- na.approx(dat, params = 'do_mgl', maxgap = 10) ## decomposition and plot test <- decomp(dat, param = 'do_mgl', frequency = 'daily') plot(test)
## get data data(apadbwq) swmp1 <- apadbwq ## subset for daily decomposition dat <- subset(swmp1, subset = c('2013-07-01 00:00', '2013-07-31 00:00')) ## decomposition and plot test <- decomp(dat, param = 'do_mgl', frequency = 'daily') plot(test) ## dealing with missing values dat <- subset(swmp1, subset = c('2013-06-01 00:00', '2013-07-31 00:00')) ## this returns an error ## Not run: test <- decomp(dat, param = 'do_mgl', frequency = 'daily') ## End(Not run) ## how many missing values? sum(is.na(dat$do_mgl)) ## use na.approx to interpolate missing data dat <- na.approx(dat, params = 'do_mgl', maxgap = 10) ## decomposition and plot test <- decomp(dat, param = 'do_mgl', frequency = 'daily') plot(test)
Decompose monthly SWMP time series into grandmean, annual, seasonal, and event series as described in Cloern and Jassby 2010.
decomp_cj(dat_in, ...) ## S3 method for class 'swmpr' decomp_cj( dat_in, param, vals_out = FALSE, event = TRUE, type = c("add", "mult"), center = c("mean", "median"), ... ) ## Default S3 method: decomp_cj( dat_in, param, date_col, vals_out = FALSE, event = TRUE, type = c("add", "mult"), center = c("mean", "median"), ... )
decomp_cj(dat_in, ...) ## S3 method for class 'swmpr' decomp_cj( dat_in, param, vals_out = FALSE, event = TRUE, type = c("add", "mult"), center = c("mean", "median"), ... ) ## Default S3 method: decomp_cj( dat_in, param, date_col, vals_out = FALSE, event = TRUE, type = c("add", "mult"), center = c("mean", "median"), ... )
dat_in |
input data object |
... |
additional arguments passed to or from other methods |
param |
chr string of variable to decompose |
vals_out |
logical indicating of numeric output is returned, default is |
event |
logical indicating if an 'events' component should be determined |
type |
chr string indicating the type of decomposition, either additive ( |
center |
chr string indicating the method of centering, either |
date_col |
chr string indicating the name of the date column which should be a date or POSIX object. |
This function is a simple wrapper to the decompTs
function in the archived wq package, also described in Cloern and Jassby (2010). The function is similar to decomp.swmpr
(which is a wrapper to decompose
) with a few key differences. The decomp.swmpr
function decomposes the time series into a trend, seasonal, and random components, whereas the current function decomposes into the grandmean, annual, seasonal, and events components. For both functions, the random or events components, respectively, can be considered anomalies that don't follow the trends in the remaining categories.
The decomp_cj
function provides only a monthly decomposition, which is appropriate for characterizing relatively long-term trends. This approach is meant for nutrient data that are obtained on a monthly cycle. The function will also work with continuous water quality or weather data but note that the data are first aggregated on the monthly scale before decomposition. Use the decomp.swmpr
function to decompose daily variation.
A ggplot
object if vals_out = FALSE
(default), otherwise a monthly time series matrix of class ts
.
Cloern, J.E., Jassby, A.D. 2010. Patterns and scales of phytoplankton variability in estuarine-coastal ecosystems. Estuaries and Coasts. 33:230-241.
## get data data(apacpnut) dat <- apacpnut dat <- qaqc(dat, qaqc_keep = NULL) ## decomposition of chl, values as data.frame decomp_cj(dat, param = 'chla_n', vals_out = TRUE) ## decomposition of chl, ggplot decomp_cj(dat, param = 'chla_n') ## decomposition changing arguments passed to decompTs decomp_cj(dat, param = 'chla_n', type = 'mult') ## Not run: ## monthly decomposition of continuous data data(apacpwq) dat2 <- qaqc(apacpwq) decomp_cj(dat2, param = 'do_mgl') ## using the default method with a data frame dat <- data.frame(dat) decomp_cj(dat, param = 'chla_n', date_col = 'datetimestamp') ## End(Not run)
## get data data(apacpnut) dat <- apacpnut dat <- qaqc(dat, qaqc_keep = NULL) ## decomposition of chl, values as data.frame decomp_cj(dat, param = 'chla_n', vals_out = TRUE) ## decomposition of chl, ggplot decomp_cj(dat, param = 'chla_n') ## decomposition changing arguments passed to decompTs decomp_cj(dat, param = 'chla_n', type = 'mult') ## Not run: ## monthly decomposition of continuous data data(apacpwq) dat2 <- qaqc(apacpwq) decomp_cj(dat2, param = 'do_mgl') ## using the default method with a data frame dat <- data.frame(dat) decomp_cj(dat, param = 'chla_n', date_col = 'datetimestamp') ## End(Not run)
The function decomposes a time series into a long-term mean, annual, seasonal and "events" component. The decomposition can be multiplicative or additive, and based on median or mean centering. Function and documentation herein are from archived wq package.
decompTs( x, event = TRUE, type = c("add", "mult"), center = c("mean", "median") )
decompTs( x, event = TRUE, type = c("add", "mult"), center = c("mean", "median") )
x |
a monthly time series vector |
event |
whether or not an "events" component should be determined |
type |
the type of decomposition, either multiplicative ("mult") or additive ("add") |
center |
the method of centering, either median or mean |
The rationale for this simple approach to decomposing a time series, with examples of its application, is given by Cloern and Jassby (2010). It is motivated by the observation that many important events for estuaries (e.g., persistent dry periods, species invasions) start or stop suddenly. Smoothing to extract the annualized term, which can disguise the timing of these events and make analysis of them unnecessarily difficult, is not used.
A multiplicative decomposition will typically be useful for a biological community- or population-related variable (e.g., chlorophyll-a) that experiences exponential changes in time and is approximately lognormal, whereas an additive decomposition is more suitable for a normal variable. The default centering method is the median, especially appropriate for series that have large, infrequent events.
If event = TRUE
, the seasonal component represents a recurring monthly pattern and the events component a residual series. Otherwise, the seasonal component becomes the residual series. The latter is appropriate when seasonal patterns change systematically over time.
A monthly time series matrix with the following individual time series:
original |
original time series |
annual |
annual mean series |
seasonal |
repeating seasonal component |
events |
optionally, the residual or "events" series |
Cloern, J.E. and Jassby, A.D. (2010) Patterns and scales of phytoplankton variability in estuarine-coastal ecosystems. Estuaries and Coasts 33, 230–241.
Estimate ecosystem metabolism using the Odum open-water method. Estimates of daily integrated gross production, total respiration, and net ecosystem metabolism are returned.
ecometab(dat_in, ...) ## S3 method for class 'swmpr' ecometab(dat_in, depth_val = NULL, metab_units = "mmol", trace = FALSE, ...) ## Default S3 method: ecometab( dat_in, tz, lat, long, depth_val = NULL, metab_units = "mmol", trace = FALSE, ... )
ecometab(dat_in, ...) ## S3 method for class 'swmpr' ecometab(dat_in, depth_val = NULL, metab_units = "mmol", trace = FALSE, ...) ## Default S3 method: ecometab( dat_in, tz, lat, long, depth_val = NULL, metab_units = "mmol", trace = FALSE, ... )
dat_in |
Input data object, see details for required time series |
... |
arguments passed to other methods |
depth_val |
numeric value for station depth (m) if time series is not available |
metab_units |
chr indicating desired units of output for oxygen, either as mmol or grams |
trace |
logical indicating if progress is shown in the console |
tz |
chr string for timezone, e.g., 'America/Chicago' |
lat |
numeric for latitude |
long |
numeric for longitude (negative west of prime meridian) |
Input data include both water quality and weather time series, which are typically collected with independent instrument systems. For SWMP data, this requires merging water quality and meteorology swmpr
data objects using the comb
function (see examples). For the default method not using SWMP data, the input data.frame
must have columns named datetimestamp
(date/time column, as POSIXct
object), do_mgl
(dissolved oxygen, mg/L), depth
(depth, m), atemp
(air temperature, C), sal
(salinity, psu), temp
(water temperature, C), wspd
(wind speed, m/s), and bp
(barometric pressure, mb).
The open-water method is a common approach to quantify net ecosystem metabolism using a mass balance equation that describes the change in dissolved oxygen over time from the balance between photosynthetic and respiration processes, corrected using an empirically constrained air-sea gas diffusion model (see Ro and Hunt 2006, Thebault et al. 2008). The diffusion-corrected DO flux estimates are averaged separately over each day and night of the time series. The nighttime average DO flux is used to estimate respiration rates, while the daytime DO flux is used to estimate net primary production. To generate daily integrated rates, respiration rates are assumed constant such that hourly night time DO flux rates are multiplied by 24. Similarly, the daytime DO flux rates are multiplied by the number of daylight hours, which varies with location and time of year, to yield net daytime primary production. Respiration rates are subtracted from daily net production estimates to yield gross production rates. The metabolic day is considered the 24 hour period between sunsets on two adjacent calendar days.
Areal rates for gross production and total respiration are based on volumetric rates normalized to the depth of the water column at the sampling location, which is assumed to be well-mixed, such that the DO sensor is reflecting the integrated processes in the entire water column (including the benthos). Water column depth is calculated as the mean value of the depth variable across the time series in the swmpr
object. Depth values are floored at one meter for very shallow stations and 0.5 meters is also added to reflect the practice of placing sensors slightly off of the bottom. A user-supplied depth value can also be passed to the depth_val
argument, either as a single value that is repeated or as a vector equal in length to the number of rows in the input data. An accurate depth value should be used as this acts as a direct scalar on metabolism estimates.
The air-sea gas exchange model is calibrated with wind data either collected at, or adjusted to, wind speed at 10 m above the surface. The metadata should be consulted for exact height. The value can be changed manually using a height
argument, which is passed to calckl
.
A minimum of three records are required for both day and night periods to calculate daily metabolism estimates. Occasional missing values for air temperature, barometric pressure, and wind speed are replaced with the climatological means (hourly means by month) for the period of record using adjacent data within the same month as the missing data.
All DO calculations within the function are done using molar units (e.g., mmol O2 m-3). The output can be returned as mass units by setting metab_units = 'grams'
(i.e., 1mol = 32 g O2, which multiplies all estimates by 32 g mol-1/1000 mg/g). Input data must be in standard mass units for DO (mg L-1).
The specific approach for estimating metabolism with the open-water method is described in Caffrey et al. 2013 and references cited therein.
A data.frame
of daily integrated metabolism estimates is returned. If a swmpr
object is passed to the function, this data.frame
is added to the metab
attribute and the original object is returned. See the examples for retrieval from a swmpr
object. The metabolism data.frame
contains the following:
date
The metabolic day, defined as the 24 hour period starting at sunrise (calculated using metab_day
)
DOF_d
Mean DO flux during day hours, mmol m-2 hr-1. Day hours are calculated using the metab_day
function.
D_d
Mean air-sea gas exchange of DO during day hours, mmol m-2 hr-1
DOF_n
Mean DO flux during night hours, mmol m-2 hr-1
D_n
Mean air-sea gas exchange of DO during night hours, mmol m-2 hr-1
Pg
Gross production, mmol m-2 d-1, calculated as ((DOF_d - D_d) - (DOF_n - D_n)) * day hours
Rt
Total respiration, mmol m-2 d-1, calculated as (DOF_n - D_n) * 24
NEM
Net ecosytem metabolism, mmol m-2 d-1, calculated as Pg + Rt
Caffrey JM, Murrell MC, Amacker KS, Harper J, Phipps S, Woodrey M. 2013. Seasonal and inter-annual patterns in primary production, respiration and net ecosystem metabolism in 3 estuaries in the northeast Gulf of Mexico. Estuaries and Coasts. 37(1):222-241.
Odum HT. 1956. Primary production in flowing waters. Limnology and Oceanography. 1(2):102-117.
Ro KS, Hunt PG. 2006. A new unified equation for wind-driven surficial oxygen transfer into stationary water bodies. Transactions of the American Society of Agricultural and Biological Engineers. 49(5):1615-1622.
Thebault J, Schraga TS, Cloern JE, Dunlavey EG. 2008. Primary production and carrying capacity of former salt ponds after reconnection to San Francisco Bay. Wetlands. 28(3):841-851.
calckl
for estimating the oxygen mass transfer coefficient used with the air-sea gas exchange model, comb
for combining swmpr
objects, metab_day
for identifying the metabolic day for each observation in the time series, plot_metab
for plotting the results, and aggremetab
for aggregating the metabolism attribute.
## Not run: ## import water quality and weather data, qaqc data(apadbwq) data(apaebmet) wq <- qaqc(apadbwq) met <- qaqc(apaebmet) ## combine dat <- comb(wq, met) ## output units in grams of oxygen res <- ecometab(dat, metab_units = 'grams') attr(res, 'metabolism') ## manual input of integration depth ## NA values must be filled dat_fill <- na.approx(dat, params = 'depth', maxgap = 1e6) depth <- dat_fill$depth res <- ecometab(dat, metab_units = 'grams', depth_val = depth) attr(res, 'metabolism') ## use the default method for ecometab with a generic data frame ## first recreate a generic object from the sample data cols <- c('datetimestamp', 'do_mgl', 'depth', 'atemp', 'sal', 'temp', 'wspd', 'bp') dat <- data.frame(dat) dat <- dat[, cols] res <- ecometab(dat, tz = 'America/Jamaica', lat = 29.67, long = -85.06) res ## End(Not run)
## Not run: ## import water quality and weather data, qaqc data(apadbwq) data(apaebmet) wq <- qaqc(apadbwq) met <- qaqc(apaebmet) ## combine dat <- comb(wq, met) ## output units in grams of oxygen res <- ecometab(dat, metab_units = 'grams') attr(res, 'metabolism') ## manual input of integration depth ## NA values must be filled dat_fill <- na.approx(dat, params = 'depth', maxgap = 1e6) depth <- dat_fill$depth res <- ecometab(dat, metab_units = 'grams', depth_val = depth) attr(res, 'metabolism') ## use the default method for ecometab with a generic data frame ## first recreate a generic object from the sample data cols <- c('datetimestamp', 'do_mgl', 'depth', 'atemp', 'sal', 'temp', 'wspd', 'bp') dat <- data.frame(dat) dat <- dat[, cols] res <- ecometab(dat, tz = 'America/Jamaica', lat = 29.67, long = -85.06) res ## End(Not run)
Get gradient default colors for plots
gradcols(col_vec = NULL)
gradcols(col_vec = NULL)
col_vec |
chr string of plot colors to use. Any color palette from RColorBrewer can be used as a named input. Palettes from grDevices must be supplied as the returned string of colors for each palette. |
This is a convenience function for retrieving a color palette. Palettes from RColorBrewer will use the maximum number of colors. The default palette is 'Spectral'.
A character vector of colors in hexidecimal notation.
Plot a histogram showing the distribution of a swmpr parameter
## S3 method for class 'swmpr' hist(x, ...)
## S3 method for class 'swmpr' hist(x, ...)
x |
input swmpr object |
... |
other arguments passed to |
The swmpr method for histograms is a convenience function for the default histogram function. Conventional histogram methods also work well since swmpr objects are also data frames. The input data must contain only one parameter.
## get data data(apadbwq) dat <- subset(apadbwq, select = 'do_mgl') ## histogram using swmpr method hist(dat) ## change axis labels, plot title hist(dat, xlab = 'Dissolved oxygen', main = 'Histogram of DO') ## plot using default method hist(dat$do_mgl)
## get data data(apadbwq) dat <- subset(apadbwq, select = 'do_mgl') ## histogram using swmpr method hist(dat) ## change axis labels, plot title hist(dat, xlab = 'Dissolved oxygen', main = 'Histogram of DO') ## plot using default method hist(dat$do_mgl)
Import local data that were obtained from the CDMO through the zip downloads feature
import_local( path, station_code, trace = FALSE, collMethd = c("1", "2"), keep_qaqcstatus = FALSE )
import_local( path, station_code, trace = FALSE, collMethd = c("1", "2"), keep_qaqcstatus = FALSE )
path |
chr string of full path to .csv files with raw data, can be a zipped or unzipped directory where the former must include the .zip extension |
station_code |
chr string of station to import, typically 7 or 8 characters including wq, nut, or met extensions, may include full name with year, excluding file extension |
trace |
logical indicating if progress is sent to console, default |
collMethd |
chr string of nutrient data to subset. 1 indicates monthly, 2 indicates diel. Default is both diel and monthly data. |
keep_qaqcstatus |
logical indicating if the |
The function is designed to import local data that were downloaded from the CDMO outside of R. This approach works best for larger data requests, specifically those from the zip downloads feature in the advanced query section of the CDMO. The function may also work using data from the data export system, but this feature has not been extensively tested. The downloaded data will be in a compressed folder that includes multiple .csv files by year for a given data type (e.g., apacpwq2002.csv, apacpwq2003.csv, apacpnut2002.csv, etc.). The import_local function can be used to import files directly from the compressed folder or after the folder is decompressed. In the former case, the requested files are extracted to a temporary directory and then deleted after they are loaded into the current session. An example dataset is available online to illustrate the format of the data provided through the zip downloads feature. See the link below to access these data. All example datasets included with the package were derived from these raw data.
Occasionally, duplicate time stamps are present in the raw data. The function handles duplicate entries differently depending on the data type (water quality, weather, or nutrients). For water quality and nutrient data, duplicate time stamps are simply removed. Note that nutrient data often contain replicate samples with similar but not duplicated time stamps within a few minutes of each other. Replicates with unique time stamps are not removed but can be further processed using rem_reps
. Weather data prior to 2007 may contain duplicate time stamps at frequencies for 60 (hourly) and 144 (daily) averages, in addition to 15 minute frequencies. Duplicate values that correspond to the smallest value in the frequency column (15 minutes) are retained.
If keep_qaqcstatus = TRUE
, the historical
and provisionalplus
columns are retained in the output. These two columns include integer values as 0 or 1. From the CDMO web page, a value of 0 in the historical
column indicates that the data have not been through final QAQC by the CDMO. A value of 1 indicates that the data have been through final tertiary review at the CDMO and posted as the final authoritative data. A value of 0 in the provisionalplus
column indicates that the data have been through the automated flagging process (primary QAQC) only and have not been checked by the Reserve. A value of 1 in the provisionalplus
column indicates that the data have been through secondary QAQC at the Reserve using Excel macros (provided by the CDMO) to further QAQC the data.
Zip download request through CDMO: https://cdmo.baruch.sc.edu/aqs/zips.cfm
Example dataset: https://s3.amazonaws.com/swmpexdata/zip_ex.zip
Returns a swmpr object with all parameters and QAQC columns for the station. The full date range in the raw data are also imported.
all_params
, all_params_dtrng
, rem_reps
, single_param
## Not run: ## this is the path for csv example files, decompressed path <- 'C:/this/is/my/data/path' ## import, do not include file extension import_local(path, 'apaebmet') ## this is the path for csv example files, zipped folder path <- 'C:/this/is/my/data/path.zip' ## import, do not include file extension import_local(path, 'apaebmet') ## End(Not run)
## Not run: ## this is the path for csv example files, decompressed path <- 'C:/this/is/my/data/path' ## import, do not include file extension import_local(path, 'apaebmet') ## this is the path for csv example files, zipped folder path <- 'C:/this/is/my/data/path.zip' ## import, do not include file extension import_local(path, 'apaebmet') ## End(Not run)
Create a map of all the stations in a reserve
map_reserve(nerr_site_id, text_sz = 6, text_col = "black", f = 0.2)
map_reserve(nerr_site_id, text_sz = 6, text_col = "black", f = 0.2)
nerr_site_id |
chr string of the reserve to map, first three characters used by NERRS or vector of stations to map using the first five characters |
text_sz |
numeric value for text size of station names, passed to |
text_col |
chr string for text color of station names, passed to |
f |
numeric passed to |
This function is a simple wrapper to functions in the ggmap package which returns a map of all of the stations at a NERRS reserve. The f
argument may have to be chosen through trial and error depending on the spatial extent of the reserve. A local data file included with the package is used to get the latitude and longitude values of each station. The files includes only active stations as of January 2015.
A ggplot
object for plotting.
## Not run: map_reserve('jac') ## End(Not run)
## Not run: map_reserve('jac') ## End(Not run)
Identify metabolic days in a time series based on sunrise and sunset times for a location and date. The metabolic day is considered the 24 hour period between sunsets for two adjacent calendar days. The function calls the getSunlightTimes
function from the suncalc package.
metab_day(dat_in, ...) ## Default S3 method: metab_day(dat_in, tz, lat, long, ...)
metab_day(dat_in, ...) ## Default S3 method: metab_day(dat_in, tz, lat, long, ...)
dat_in |
data.frame |
... |
arguments passed to or from other methods |
tz |
chr string for timezone, e.g., 'America/Chicago' |
lat |
numeric for latitude |
long |
numeric for longitude (negative west of prime meridian) |
This function is only used within ecometab
and should not be called explicitly.
Linearly interpolate gaps in swmpr data within a maximum size
## S3 method for class 'swmpr' na.approx(object, params = NULL, maxgap, ...)
## S3 method for class 'swmpr' na.approx(object, params = NULL, maxgap, ...)
object |
input swmpr object |
params |
is chr string of swmpr parameters to smooth, default all |
maxgap |
numeric vector indicating maximum gap size to interpolate where size is numer of records, must be explicit |
... |
additional arguments passed to other methods |
A common approach for handling missing data in time series analysis is linear interpolation. A simple curve fitting method is used to create a continuous set of records between observations separated by missing data. A required argument for the function is maxgap
which defines the maximum gap size for interpolation. The ability of the interpolated data to approximate actual, unobserved trends is a function of the gap size. Interpolation between larger gaps are less likely to resemble patterns of an actual parameter, whereas interpolation between smaller gaps may be more likely to resemble actual patterns. An appropriate gap size limit depends on the unique characteristics of specific datasets or parameters.
Returns a swmpr object. QAQC columns are removed if included with input object.
data(apadbwq) dat <- qaqc(apadbwq) dat <- subset(dat, select = 'do_mgl', subset = c('2013-01-22 00:00', '2013-01-26 00:00')) # interpolate, maxgap of 10 records fill1 <- na.approx(dat, params = 'do_mgl', maxgap = 10) # interpolate maxgap of 30 records fill2 <- na.approx(dat, params = 'do_mgl', maxgap = 30) # plot for comparison par(mfrow = c(3, 1)) plot(fill1, col = 'red', main = 'Interpolation - maximum gap of 10 records') lines(dat) plot(fill2, col = 'red', main = 'Interpolation - maximum gap of 30 records') lines(dat)
data(apadbwq) dat <- qaqc(apadbwq) dat <- subset(dat, select = 'do_mgl', subset = c('2013-01-22 00:00', '2013-01-26 00:00')) # interpolate, maxgap of 10 records fill1 <- na.approx(dat, params = 'do_mgl', maxgap = 10) # interpolate maxgap of 30 records fill2 <- na.approx(dat, params = 'do_mgl', maxgap = 30) # plot for comparison par(mfrow = c(3, 1)) plot(fill1, col = 'red', main = 'Interpolation - maximum gap of 10 records') lines(dat) plot(fill2, col = 'red', main = 'Interpolation - maximum gap of 30 records') lines(dat)
Plot multiple SWMP time series on the same y-axis, aka overplotting
overplot(dat_in, ...) ## S3 method for class 'swmpr' overplot( dat_in, select = NULL, subset = NULL, operator = NULL, ylabs = NULL, xlab = NULL, cols = NULL, lty = NULL, lwd = NULL, pch = NULL, type = NULL, ... ) ## Default S3 method: overplot( dat_in, date_var, select = NULL, ylabs = NULL, xlab = NULL, cols = NULL, lty = NULL, lwd = NULL, inset = -0.15, cex = 1, xloc = "top", yloc = NULL, pch = NULL, type = NULL, ... )
overplot(dat_in, ...) ## S3 method for class 'swmpr' overplot( dat_in, select = NULL, subset = NULL, operator = NULL, ylabs = NULL, xlab = NULL, cols = NULL, lty = NULL, lwd = NULL, pch = NULL, type = NULL, ... ) ## Default S3 method: overplot( dat_in, date_var, select = NULL, ylabs = NULL, xlab = NULL, cols = NULL, lty = NULL, lwd = NULL, inset = -0.15, cex = 1, xloc = "top", yloc = NULL, pch = NULL, type = NULL, ... )
dat_in |
input data object |
... |
additional arguments passed to |
select |
chr string of variable(s) to plot, passed to |
subset |
chr string of form 'YYYY-mm-dd HH:MM' to subset a date range. Input can be one (requires operator) or two values (a range). Passed to |
operator |
chr string specifiying binary operator (e.g., '>', '<=') if subset is one date value, passed to |
ylabs |
chr string of labels for y-axes, default taken from |
xlab |
chr string of label for x-axis |
cols |
chr string of colors to use for lines |
lty |
numeric indicating line types, one value for all or values for each parameter |
lwd |
numeric indicating line widths, one value for all or values for each parameter, used as |
pch |
numeric for point type of points are used |
type |
character string indicating |
date_var |
chr string of the name for the datetimestamp column, not required for |
inset |
numeric of relative location of legend, passed to |
cex |
numeric of scale factor for legend, passed to |
xloc |
x location of legend, passed to |
yloc |
y location of legend, passed to |
One to many SWMP parameters can be plotted on the same y-axis to facilitate visual comparison. This is commonly known as overplotting. The building blocks of this function include plot
, legend
, axis
, and mtext
.
An R plot created using base graphics
## import data data(apacpwq) dat <- qaqc(apacpwq) ## plot overplot(dat) ## a truly heinous plot overplot(dat, select = c('depth', 'do_mgl', 'ph', 'turb'), subset = c('2013-01-01 0:0', '2013-02-01 0:0'), lwd = 2) ## Not run: ## change the type argument if plotting discrete and continuous data swmp1 <- apacpnut swmp2 <- apaebmet dat <- comb(swmp1, swmp2, timestep = 120, method = 'union') overplot(dat, select = c('chla_n', 'atemp'), subset = c('2012-01-01 0:0', '2013-01-01 0:0'), type = c('p', 'l')) ## End(Not run)
## import data data(apacpwq) dat <- qaqc(apacpwq) ## plot overplot(dat) ## a truly heinous plot overplot(dat, select = c('depth', 'do_mgl', 'ph', 'turb'), subset = c('2013-01-01 0:0', '2013-02-01 0:0'), lwd = 2) ## Not run: ## change the type argument if plotting discrete and continuous data swmp1 <- apacpnut swmp2 <- apaebmet dat <- comb(swmp1, swmp2, timestep = 120, method = 'union') overplot(dat, select = c('chla_n', 'atemp'), subset = c('2012-01-01 0:0', '2013-01-01 0:0'), type = c('p', 'l')) ## End(Not run)
Finds dissolved oxygen concentration in equilibrium with water-saturated air. Function and documentation herein are from archived wq package.
oxySol(t, S, P = NULL)
oxySol(t, S, P = NULL)
t |
numeric for temperature, degrees C |
S |
numeric for salinity, on the Practical Salinity Scale |
P |
numeric for pressure, atm |
Calculations are based on the approach of Benson and Krause (1984), using Green and Carritt's (1967) equation for dependence of water vapor partial pressure on t
and S
. Equations are valid for temperature in the range 0-40 C and salinity in the range 0-40.
Dissolved oxygen concentration in mg/L at 100% saturation. If P = NULL
, saturation values at 1 atm are calculated.
Benson, B.B. and Krause, D. (1984) The concentration and isotopic fractionation of oxygen dissolved in fresh-water and seawater in equilibrium with the atmosphere. Limnology and Oceanography 29, 620-632.
Green, E.J. and Carritt, D.E. (1967) New tables for oxygen saturation of seawater. Journal of Marine Research 25, 140-147.
Get parameter column names for each parameter type
param_names(param_type = c("nut", "wq", "met"))
param_names(param_type = c("nut", "wq", "met"))
param_type |
chr string specifying |
This function is used internally within several functions to return a list of the expected parameters for a given parameter type: nutrients, water quality, or meteorological. It does not need to be called explicitly.
Returns a named list of parameters for the param_type
. The parameter names are lower-case strings of SWMP parameters and corresponding qaqc names ('f_'
prefix)
Parsing function for objects returned from CDMO web services
parser(resp_in, parent_in = "data")
parser(resp_in, parent_in = "data")
resp_in |
web object returned from CDMO server, response class from httr package |
parent_in |
chr string of parent nodes to parse |
This function parses XML objects returned from the CDMO server, which are further passed to swmpr
. It is used internally by the data retrieval functions, excluding import_local
. The function does not need to be called explicitly.
Returns a data.frame
of parsed XML nodes
Plot gross production, total respiration, and net ecosystem metabolism for a swmpr object.
plot_metab(swmpr_in, ...) ## S3 method for class 'swmpr' plot_metab( swmpr_in, by = "months", alpha = 0.05, width = 10, pretty = TRUE, ... )
plot_metab(swmpr_in, ...) ## S3 method for class 'swmpr' plot_metab( swmpr_in, by = "months", alpha = 0.05, width = 10, pretty = TRUE, ... )
swmpr_in |
input swmpr object |
... |
arguments passed to or from other methods |
by |
chr string describing aggregation period, passed to |
alpha |
numeric indicating alpha level for confidence intervals in aggregated data. Use |
width |
numeric indicating width of top and bottom segments on error bars |
pretty |
logical indicating use of predefined plot aesthetics |
A plot will only be returned if the metabolism
attribute for the swmpr
object is not NULL
. Daily metabolism estimates are also aggregated into weekly averages. Accepted aggregation periods are 'years'
, 'quarters'
, 'months'
, 'weeks'
, and 'days'
(if no aggregation is preferred).
By default, pretty = TRUE
will return a ggplot
object with predefined aesthetics. Setting pretty = FALSE
will return the plot with minimal modifications to the ggplot
object. Use the latter approach for easier customization of the plot.
A ggplot
object which can be further modified.
## Not run: ## import water quality and weather data data(apadbwq) data(apaebmet) ## qaqc, combine wq <- qaqc(apadbwq) met <- qaqc(apaebmet) dat <- comb(wq, met) ## estimate metabolism res <- ecometab(dat) ## plot plot_metab(res) ## change alpha, aggregation period, widths plot_metab(res, by = 'quarters', alpha = 0.1, widths = 0) ## plot daily raw, no aesthetics plot_metab(res, by = 'days', pretty = FALSE) ## note the difference if aggregating with a moving window average plot_metab(res, by = 30) ## End(Not run)
## Not run: ## import water quality and weather data data(apadbwq) data(apaebmet) ## qaqc, combine wq <- qaqc(apadbwq) met <- qaqc(apaebmet) dat <- comb(wq, met) ## estimate metabolism res <- ecometab(dat) ## plot plot_metab(res) ## change alpha, aggregation period, widths plot_metab(res, by = 'quarters', alpha = 0.1, widths = 0) ## plot daily raw, no aesthetics plot_metab(res, by = 'days', pretty = FALSE) ## note the difference if aggregating with a moving window average plot_metab(res, by = 30) ## End(Not run)
A line for a single year is plotted over ribbons ofquantiles for historical data.
plot_quants(swmpr_in, ...) ## S3 method for class 'swmpr' plot_quants( swmpr_in, paramtoplot, yr, yrstart, yrend, yaxislab = NULL, yrcolor = "red3", bgcolor1 = "lightgray", bgcolor2 = "gray65", maintitle = NULL, ... )
plot_quants(swmpr_in, ...) ## S3 method for class 'swmpr' plot_quants( swmpr_in, paramtoplot, yr, yrstart, yrend, yaxislab = NULL, yrcolor = "red3", bgcolor1 = "lightgray", bgcolor2 = "gray65", maintitle = NULL, ... )
swmpr_in |
input swmpr object. |
... |
additional arguments passed to or from other methods |
paramtoplot |
chr string of parameter to plot |
yr |
numeric of year to feature as a line on the plot |
yrstart |
numeric of year to begin range of comparison data |
yrend |
numeric of year to end range of comparison data |
yaxislab |
chr string for y-axis albel. Default is |
yrcolor |
chr string of line color for year of interest |
bgcolor1 |
chr string of color for outer 50% of data range |
bgcolor2 |
chr string of color for middle 50% of data range. |
maintitle |
chr string of plot title. Default pastes together site name, parameter name, year to feature, and range of years to use for comparison, e.g. 'GNDBHWQ 2017 Daily Average Temp overlaid on 2006-2016 daily averages'. |
The plot is based on aggregates of daily average values for the entire time series. Quantiles (min, 25%, 75%, max) for each individual calendar day (01/01, 01/02, ... 12/31) are used to generate a ribbon plot of historical data and the selected year in yr
is plotted as a line over the ribbon for historical context.
required packages: dplyr, lubridate, ggplot2, tibble
A a ggplot2
object.
Kimberly Cressman, Marcus Beck
# qaqc dat <- qaqc(apacpwq) # generate a plot of salinity for 2013 overlaid on 2012-2013 data plot_quants(dat, 'sal', yr = 2013, yrstart = 2012, yrend = 2013) # change some of the defaults plot_quants(dat, 'sal', yr = 2013, yrstart = 2012, yrend = 2013, bgcolor1 = 'lightsteelblue2', bgcolor2 = 'lightsteelblue4', yaxislab = 'Salinity (psu)')
# qaqc dat <- qaqc(apacpwq) # generate a plot of salinity for 2013 overlaid on 2012-2013 data plot_quants(dat, 'sal', yr = 2013, yrstart = 2012, yrend = 2013) # change some of the defaults plot_quants(dat, 'sal', yr = 2013, yrstart = 2012, yrend = 2013, bgcolor1 = 'lightsteelblue2', bgcolor2 = 'lightsteelblue4', yaxislab = 'Salinity (psu)')
Plot graphical summaries of SWMP data for individual parameters, including seasonal/annual trends and anomalies
plot_summary(swmpr_in, ...) ## S3 method for class 'swmpr' plot_summary( swmpr_in, param, colsleft = c("lightblue", "lightgreen"), colsmid = "lightblue", colsright = c("lightblue", "lightgreen", "tomato1"), base_size = 11, years = NULL, plt_sep = FALSE, sum_out = FALSE, fill = c("none", "monoclim", "interp"), ... )
plot_summary(swmpr_in, ...) ## S3 method for class 'swmpr' plot_summary( swmpr_in, param, colsleft = c("lightblue", "lightgreen"), colsmid = "lightblue", colsright = c("lightblue", "lightgreen", "tomato1"), base_size = 11, years = NULL, plt_sep = FALSE, sum_out = FALSE, fill = c("none", "monoclim", "interp"), ... )
swmpr_in |
input swmpr object |
... |
additional arguments passed to other methods |
param |
chr string of variable to plot |
colsleft |
chr string vector of length two indicating colors for left plots |
colsmid |
chr string vector of length one indicating colors for middle plots |
colsright |
chr string vector of length three indicating colors for right plots |
base_size |
numeric for text size |
years |
numeric vector of starting and ending years to plot, default all |
plt_sep |
logical if a list is returned with separate plot elements |
sum_out |
logical if summary data for the plots is returned |
fill |
chr string indicating if missing monthly values are left as is ( |
This function creates several graphics showing seasonal and annual trends for a given swmp parameter. Plots include monthly distributions, monthly anomalies, and annual anomalies in multiple formats. Anomalies are defined as the difference between the monthly or annual average from the grand mean. Monthly anomalies are in relation to the grand mean for the same month across all years. All data are aggregated for quicker plotting. Nutrient data are based on monthly averages, wheras weather and water quality data are based on daily averages. Cumulative precipitation data are based on the daily maximum.
Individual plots can be obtained if plt_sep = TRUE
. Individual plots for elements one through six in the list correspond to those from top left to bottom right in the combined plot.
Summary data for the plots can be obtained if sum_out = TRUE
. This returns a list with three data frames with names sum_mo
, sum_moyr
, and sum_mo
. The data frames match the plots as follows: sum_mo
for the top left, bottom left, and center plots, sum_moyr
for the top right and middle right plots, and sum_yr
for the bottom right plot.
Missing values can be filled using the long-term average across years for each month (fill = 'monoclim'
) or as a linear interpolation between missing values using na.approx
(fill = 'interp'
). The monthly average works well for long gaps, but may not be an accurate representation of long-term trends, i.e., real averages may differ early vs late in the time series if a trend exists. The linear interpolation option is preferred for small gaps.
A graphics object (Grob) of multiple ggplot
objects, otherwise a list of individual ggplot
objects if plt_sep = TRUE
or a list with data frames of the summarized data if sum_out = TRUE
.
## import data data(apacpnut) dat <- qaqc(apacpnut) ## plot plot_summary(dat, param = 'chla_n', years = c(2007, 2013)) ## get individaul plots plots <- plot_summary(dat, param = 'chla_n', years = c(2007, 2013), plt_sep = TRUE) plots[[1]] # top left plots[[3]] # middle plots[[6]] # bottom right ## get summary data plot_summary(dat, param = 'chla_n', year = c(2007, 2013), sum_out = TRUE)
## import data data(apacpnut) dat <- qaqc(apacpnut) ## plot plot_summary(dat, param = 'chla_n', years = c(2007, 2013)) ## get individaul plots plots <- plot_summary(dat, param = 'chla_n', years = c(2007, 2013), plt_sep = TRUE) plots[[1]] # top left plots[[3]] # middle plots[[6]] # bottom right ## get summary data plot_summary(dat, param = 'chla_n', year = c(2007, 2013), sum_out = TRUE)
Create a wind rose from met data
plot_wind(swmpr_in, ...) ## S3 method for class 'swmpr' plot_wind( swmpr_in, years = NULL, angle = 45, width = 1.5, breaks = 5, paddle = FALSE, grid.line = 10, max.freq = 30, cols = "GnBu", annotate = FALSE, main = NULL, type = "default", between = list(x = 1, y = 1), par.settings = NULL, strip = NULL, ... )
plot_wind(swmpr_in, ...) ## S3 method for class 'swmpr' plot_wind( swmpr_in, years = NULL, angle = 45, width = 1.5, breaks = 5, paddle = FALSE, grid.line = 10, max.freq = 30, cols = "GnBu", annotate = FALSE, main = NULL, type = "default", between = list(x = 1, y = 1), par.settings = NULL, strip = NULL, ... )
swmpr_in |
input swmpr object |
... |
arguments passed to or from other methods |
years |
numeric of years to plot, defaults to most recent |
angle |
numeric for the number of degrees occupied by each spoke |
width |
numeric for width of paddles if |
breaks |
numeric for the number of break points in the wind speed |
paddle |
logical for paddles at the ends of the spokes |
grid.line |
numeric for grid line interval to use |
max.freq |
numeric for the scaling used to set the maximum value of the radial limits (like zoom) |
cols |
chr string for colors to use for plotting, can be any palette R recognizes or a collection of colors as a vector |
annotate |
logical indicating if text is shown on the bottom of the plot for the percentage of observations as 'calm' and mean values |
main |
chr string for plot title, defaults to station name and year plotted |
type |
chr string for temporal divisions of the plot, defaults to whole year. See details. |
between |
list for lattice plot options, defines spacing between plots |
par.settings |
list for optional plot formatting passed to |
strip |
list for optional strip formatting passed to |
This function is a convenience wrapper to windRose
. Most of the arguments are taken directly from this function.
The type
argument can be used for temporal divisions of the plot. Options include the entire year (type = "default"
), seasons (type = "season"
), months (type = "month"
), or weekdays (type = "weekday"
). Combinations are also possible (see windRose
).
A wind rose plot
Kimberly Cressman, Marcus Beck
plot_wind(apaebmet)
plot_wind(apaebmet)
Plot a time series of parameters in a swmpr object
## S3 method for class 'swmpr' plot(x, type = "l", ...) ## S3 method for class 'swmpr' lines(x, ...)
## S3 method for class 'swmpr' plot(x, type = "l", ...) ## S3 method for class 'swmpr' lines(x, ...)
x |
input swmpr object |
type |
chr string for type of plot, default |
... |
other arguments passed to |
The swmpr method for plotting is a convenience function for plotting a univariate time series. Conventional plotting methods also work well since swmpr objects are also data frames. See the examples for use with different methods.
## get data data(apadbwq) swmp1 <- apadbwq ## subset dat <- subset(swmp1, select = 'do_mgl', subset = c('2013-07-01 00:00', '2013-07-31 00:00')) ## plot using swmpr method, note default line plot plot(dat) ## plot using formula method plot(do_mgl ~ datetimestamp, dat) ## plot using default, add lines plot(dat, type = 'n') lines(dat, col = 'red')
## get data data(apadbwq) swmp1 <- apadbwq ## subset dat <- subset(swmp1, select = 'do_mgl', subset = c('2013-07-01 00:00', '2013-07-31 00:00')) ## plot using swmpr method, note default line plot plot(dat) ## plot using formula method plot(do_mgl ~ datetimestamp, dat) ## plot using default, add lines plot(dat, type = 'n') lines(dat, col = 'red')
QAQC filtering for SWMP data obtained from retrieval functions, local and remote
qaqc(swmpr_in, ...) ## S3 method for class 'swmpr' qaqc(swmpr_in, qaqc_keep = "0", trace = FALSE, ...)
qaqc(swmpr_in, ...) ## S3 method for class 'swmpr' qaqc(swmpr_in, qaqc_keep = "0", trace = FALSE, ...)
swmpr_in |
input swmpr object |
... |
arguments passed to or from other methods |
qaqc_keep |
character string of qaqc flags to keep, default |
trace |
logical for progress output on console, default |
The qaqc function is a simple screen to retain values from the data with specified QAQC flags, described online: https://cdmo.baruch.sc.edu/data/qaqc.cfm. Each parameter in the swmpr data typically has a corresponding QAQC column of the same name with the added prefix 'f_'. Values in the QAQC column specify a flag from -5 to 5. Generally, only data with the '0' QAQC flag should be used, which is the default option for the function. Data that do not satisfy QAQC criteria are converted to NA
values. Additionally, simple filters are used to remove obviously bad values, e.g., wind speed values less than zero or pH values greater than 12. Erroneous data entered as -99 are also removed. Processed data will have QAQC columns removed, in addition to removal of values in the actual parameter columns that do not meet the criteria.
The data are filtered by matching the flag columns with the character string provided by qaqc_keep
. A single combined string is created by pasting each element together using the '|' operator, then using partial string matching with grepl
to compare the actual flags in the QAQC columns. Values that can be passed to the function are those described online: https://cdmo.baruch.sc.edu/data/qaqc.cfm.
Returns a swmpr object with NA
values for records that did not match qaqc_keep
. QAQC columns are also removed.
## Not run: ## get data data(apadbwq) dat <- apadbwq ## retain only '0' and '-1' flags qaqc(dat, qaqc_keep = c('0', '-1')) ## retain observations with the 'CSM' error code qaqc(dat, qaqc_keep = 'CSM') ## End(Not run)
## Not run: ## get data data(apadbwq) dat <- apadbwq ## retain only '0' and '-1' flags qaqc(dat, qaqc_keep = c('0', '-1')) ## retain observations with the 'CSM' error code qaqc(dat, qaqc_keep = 'CSM') ## End(Not run)
Summary of the number of observations with a given QAQC flag for each parameter column of a swmpr object
qaqcchk(swmpr_in) ## S3 method for class 'swmpr' qaqcchk(swmpr_in)
qaqcchk(swmpr_in) ## S3 method for class 'swmpr' qaqcchk(swmpr_in)
swmpr_in |
input swmpr object |
Viewing the number of observations for each parameter that are assigned to a QAQC flag may be useful for deciding how to process the data qith qaqc. The qaqcchk
function can be used to view this information. Consult the online documentation for a description of each QAQC flag: https://cdmo.baruch.sc.edu/data/qaqc.cfm
Returns a data.frame
with columns for swmpr parameters and row counts indicating the number of observations in each parameter assigned to a flag value.
## get data data(apadbwq) dat <- apadbwq ## view the number observations in each QAQC flag qaqcchk(dat)
## get data data(apadbwq) dat <- apadbwq ## view the number observations in each QAQC flag qaqcchk(dat)
Remove replicates in SWMP nutrient data to keep approximate monthly time step
rem_reps(swmpr_in, ...) ## S3 method for class 'swmpr' rem_reps(swmpr_in, FUN = function(x) mean(x, na.rm = TRUE), ...)
rem_reps(swmpr_in, ...) ## S3 method for class 'swmpr' rem_reps(swmpr_in, FUN = function(x) mean(x, na.rm = TRUE), ...)
swmpr_in |
input swmpr object |
... |
arguments passed to other methods |
FUN |
function to combine values, defaults to mean |
Raw nutrient data obtained from the CDMO will usually include replicate samples that are taken within a few minutes of each other. This function combines nutrient data that occur on the same day. The datetimestamp
column will always be averaged for replicates, but the actual observations will be combined based on the user-supplied function which defauls to the mean. Other suggested functions include the median
, min
, or max
. The entire function call including treatment of NA
values should be passed to the FUN
argument (see the examples). The function is meant to be used after qaqc
processing, although it works with a warning if QAQC columns are present.
Returns a swmpr object for nutrient data with no replicates.
## get nutrient data data(apacpnut) swmp1 <- apacpnut # remove replicate nutrient data rem_reps(swmp1) # use different function to aggregate replicates func <- function(x) max(x, na.rm = TRUE) rem_reps(swmp1, FUN = func)
## get nutrient data data(apacpnut) swmp1 <- apacpnut # remove replicate nutrient data rem_reps(swmp1) # use different function to aggregate replicates func <- function(x) max(x, na.rm = TRUE) rem_reps(swmp1, FUN = func)
Create a continuous time vector at set time step for a swmpr object
setstep(dat_in, ...) ## S3 method for class 'swmpr' setstep(dat_in, timestep = 15, differ = NULL, ...) ## Default S3 method: setstep(dat_in, date_col, timestep = 15, differ = NULL, ...)
setstep(dat_in, ...) ## S3 method for class 'swmpr' setstep(dat_in, timestep = 15, differ = NULL, ...) ## Default S3 method: setstep(dat_in, date_col, timestep = 15, differ = NULL, ...)
dat_in |
input data object |
... |
arguments passed to or from other methods |
timestep |
numeric value of time step to use in minutes. Alternatively, a chr string indicating |
differ |
numeric value defining buffer for merging time stamps to standardized time series, defaults to one half of the timestep |
date_col |
chr string for the name of the date/time column, e.g., |
The setstep function formats a swmpr object to a continuous time series at a given time step. This function is not necessary for most stations but can be useful for combining data or converting an existing time series to a set interval. The first argument of the function, timestep
, specifies the desired time step in minutes starting from the nearest hour of the first observation. The second argument, differ
, specifies the allowable tolerance in minutes for matching existing observations to user-defined time steps in cases where the two are dissimilar. Values for differ
that are greater than one half the value of timestep are not allowed to prevent duplication of existing data. Likewise, the default value for differ is one half the time step. Rows that do not match any existing data within the limits of the differ argument are not discarded. Output from the function can be used with subset
and to create a time series at a set interval with empty data removed.
Returns a data object for the specified time step
## Not run: ## import data data(apaebmet) dat <- apaebmet ## convert time series to two hour invervals ## tolerance of +/- 30 minutes for matching existing data setstep(dat, timestep = 120, differ = 30) ## convert a nutrient time series to a continuous time series ## then remove empty rows and columns data(apacpnut) dat_nut <- apacpnut dat_nut <- setstep(dat_nut, timestep = 60) subset(dat_nut, rem_rows = TRUE, rem_cols = TRUE) ## End(Not run)
## Not run: ## import data data(apaebmet) dat <- apaebmet ## convert time series to two hour invervals ## tolerance of +/- 30 minutes for matching existing data setstep(dat, timestep = 120, differ = 30) ## convert a nutrient time series to a continuous time series ## then remove empty rows and columns data(apacpnut) dat_nut <- apacpnut dat_nut <- setstep(dat_nut, timestep = 60) subset(dat_nut, rem_rows = TRUE, rem_cols = TRUE) ## End(Not run)
Get stations records from the CDMO for a single parameter starting with the most current date
single_param(station_code, param, Max = 100, trace = TRUE)
single_param(station_code, param, Max = 100, trace = TRUE)
station_code |
chr string of station, 7 or 8 characters |
param |
chr string for a single parameter to return. |
Max |
numeric value for number of records to obtain from the current date |
trace |
logical indicating if import progress is printed in console |
This function retrieves data from the CDMO through the web services URL. The computer making the request must have a registered IP address. Visit the CDMO web services page for more information: https://cdmo.baruch.sc.edu/webservices.cfm. This function is the CDMO equivalent of exportSingleParamXML
.
Returns a swmpr object with one parameter. QAQC columns are not provided.
## Not run: ## single parameter for a station, most recent single_param('hudscwq', 'do_mgl') ## End(Not run)
## Not run: ## single parameter for a station, most recent single_param('hudscwq', 'do_mgl') ## End(Not run)
Obtain a data.frame
of metadata for all SWMP stations.
site_codes()
site_codes()
This function retrieves data from the CDMO web services. The computer making the request must have a registered IP address. Visit the CDMO web services page for more information: https://cdmo.baruch.sc.edu/webservices.cfm. This is the CDMO equivalent of exportStationCodesXML
.
A data.frame
of SWMP metadata
## Not run: ## retrieve metadata for all sites site_codes() ## End(Not run)
## Not run: ## retrieve metadata for all sites site_codes() ## End(Not run)
Get metadata for all the stations at a single SWMP reserve
site_codes_ind(nerr_site_id)
site_codes_ind(nerr_site_id)
nerr_site_id |
chr string of site, three letters |
This function retrieves data from the CDMO web services. The computer making the request must have a registered IP address. Visit the CDMO web services page for more information: https://cdmo.baruch.sc.edu/webservices.cfm. This function is the CDMO equivalent of NERRFilterStationCodesXMLNew
.
An abbreviated data.frame
of the SWMP metadata for the requested site
## Not run: ## retrieve metadata for all stations at a site site_codes_ind('apa') ## End(Not run)
## Not run: ## retrieve metadata for all stations at a site site_codes_ind('apa') ## End(Not run)
Smooth swmpr data with a moving window average
smoother(x, ...) ## Default S3 method: smoother(x, window = 5, sides = 2, ...) ## S3 method for class 'swmpr' smoother(x, params = NULL, ...)
smoother(x, ...) ## Default S3 method: smoother(x, window = 5, sides = 2, ...) ## S3 method for class 'swmpr' smoother(x, params = NULL, ...)
x |
input object |
... |
arguments passed to or from other methods |
window |
numeric vector defining size of the smoothing window, passed to |
sides |
numeric vector defining method of averaging, passed to |
params |
is chr string of swmpr parameters to smooth, default all |
The smoother
function can be used to smooth parameters in a swmpr object using a specified window size. This method is a simple wrapper to filter
. The window argument specifies the number of observations included in the moving average. The sides argument specifies how the average is calculated for each observation (see the documentation for filter
). A value of 1 will filter observations within the window that are previous to the current observation, whereas a value of 2 will filter all observations within the window centered at zero lag from the current observation. The params argument specifies which parameters to smooth.
Returns a filtered swmpr object. QAQC columns are removed if included with input object.
## import data data(apadbwq) swmp1 <- apadbwq ## qaqc and subset imported data dat <- qaqc(swmp1) dat <- subset(dat, subset = c('2012-07-09 00:00', '2012-07-24 00:00')) ## filter test <- smoother(dat, window = 50, params = 'do_mgl') ## plot to see the difference plot(do_mgl ~ datetimestamp, data = dat, type = 'l') lines(test, select = 'do_mgl', col = 'red', lwd = 2)
## import data data(apadbwq) swmp1 <- apadbwq ## qaqc and subset imported data dat <- qaqc(swmp1) dat <- subset(dat, subset = c('2012-07-09 00:00', '2012-07-24 00:00')) ## filter test <- smoother(dat, window = 50, params = 'do_mgl') ## plot to see the difference plot(do_mgl ~ datetimestamp, data = dat, type = 'l') lines(test, select = 'do_mgl', col = 'red', lwd = 2)
Location of NERRS sites in decimal degrees and time offset from Greenwich mean time. Only active sites are included. Sites are identified by five letters indexing the reserve and site names. The dataset is used to plot locations with the map_reserve
function and to identify metabolic days with the ecometab
function. Created from sampling_locations.csv provided by CDMO.
stat_locs
stat_locs
A data.frame
object with 161 rows and 4 variables:
station_code
chr
latitude
numeric
longitude
numeric
gmt_off
int
Subset a swmpr object by a date range, parameters, or non-empty values
## S3 method for class 'swmpr' subset( x, subset = NULL, select = NULL, operator = NULL, rem_rows = FALSE, rem_cols = FALSE, ... )
## S3 method for class 'swmpr' subset( x, subset = NULL, select = NULL, operator = NULL, rem_rows = FALSE, rem_cols = FALSE, ... )
x |
input swmpr object |
subset |
chr string of form 'YYYY-mm-dd HH:MM' to subset a date range. Input can be one (requires |
select |
chr string of parameters to keep, |
operator |
chr string specifiying binary operator (e.g., |
rem_rows |
logical indicating if rows with no data are removed, default |
rem_cols |
is logical indicating if cols with no data are removed, default |
... |
arguments passed to other methods |
This function is used to subset swmpr data by date and/or a selected parameter. The date can be a single value or as two dates to select records within the range. The former case requires a binary operator input as a character string passed to the argument, such as >
or <
. The subset argument for the date(s) must also be a character string of the format YYYY-mm-dd HH:MM for each element (i.e., %Y-%m-%d %H:%M
in POSIX standards). Be aware that an error may be returned using this function if the subset argument is in the correct format but the calendar date does not exist, e.g. '2012-11-31 12:00'
. The function can also be used to remove rows and columns that do not contain data, which may be useful after processing with other functions.
Returns a swmpr object as a subset of the input. The original object will be returned if no arguments are specified.
## get data data(apaebmet) dat <- apaebmet ## subset records greater than or equal to a date subset(dat, subset = '2013-01-01 0:00', operator = '>=') ## subset records within a date range, select two parameters subset(dat, subset = c('2012-07-01 6:00', '2012-08-01 18:15'), select = c('atemp', 'totsorad'))
## get data data(apaebmet) dat <- apaebmet ## subset records greater than or equal to a date subset(dat, subset = '2013-01-01 0:00', operator = '>=') ## subset records within a date range, select two parameters subset(dat, subset = c('2012-07-01 6:00', '2012-08-01 18:15'), select = c('atemp', 'totsorad'))
Wrapper for creating a swmpr object
swmpr(stat_in, meta_in)
swmpr(stat_in, meta_in)
stat_in |
|
meta_in |
chr string for station code (7 or 8 characters), can be multiple stations if data are combined |
This function is a simple wrapper to structure
that is used internally within other functions to create a swmpr object. The function does not have to be used explicitly. Attributes of a swmpr object include names
, row.names
, class
, station
, parameters
, qaqc_cols
, cens_cols
, date_rng
, timezone
, stamp_class
, metabolism
(if present), and metab_units
(if present).
Returns a swmpr object to be used with S3 methods
Format the datetimestamp column of SWMP data
time_vec(chr_in = NULL, station_code, tz_only = FALSE)
time_vec(chr_in = NULL, station_code, tz_only = FALSE)
chr_in |
chr string of datetimestamp vector |
station_code |
is chr string for station (three or more characters) |
tz_only |
logical that returns only the timezone, default |
This function formats the datetimestamp column of SWMP data to the POSIXct
format and the correct timezone for a station. Note that SWMP data do not include daylight savings and the appropriate location based on GMT offsets is used for formatting. This function is used internally within data retrieval functions and does not need to be called explicitly.
Returns a POSIX vector if tz_only
is true, otherwise the timezone for a station is returned as a chr string