Port Jingjing's backcasting preprocessing utils #114

brookslogan · 2022-06-21T16:56:38Z

Resolves #88. Might involve #49, #90, #106, #109.

Progress so far:

Ported over some of Jingjing's preprocessing functions and tests.
Got the tests running in a package environment.

Some TODOs:

Get these working based on epiprocess classes (taking care that current functions work on a single geo/epigroup at a time)
Work either on implied lag version - time_value or an explicit lag/lag-like column.
Check that, or change to: either always, or optionally, add NAs getting/filling lag training data if it looks like we missed recording a version (it's okay if there is no update for an observation in a target version, but a problem if there are no updates for any observations in a target version) unless there are bugs from assuming that versions are evenly spaced, this is a separate convenience function/arg to think about independently
Change fill_rows and fill_missing_updates to use a mix of last-version-carried-forward and NA/0/customizable fill-in determined by archive's [check out $fill_through_version --- may or may not be useful]
Add examples
Use Abort, etc., rather than stop
Think about interaction with epix_slide. Straightforward is probably combining with Should slide() for epi_archive be given access to less than the most up-to-date snapshots? #49. But we might also think about turning the time&version-lag covariates dfs into a custom type of epi_archive (base epi_archive would be pretty inefficient though; lazy merge might be one way to improve). ------ Just do the straightforward way to begin with, then profile.
[Use alternative to zoo::rollmeanr. Maybe data.table::frollmean?]
[Check compatibility of "shift" terminology with tidymodels, which is used by epipredict. (Might be incompatible with some of our production COVID-19 hospitalization forecasters, but that might be fine.)]

Modified-by: Logan C. Brooks <[email protected]>

Get lag-completion/target-lag function tests working in package setting. Before this change, `ref_lag` was a global set in a not-yet-package file and also the test file, which, when converted into a package, did not pass tests, as the tests' `ref_lag` global doesn't overwrite the pre-existing value from the package.

`add_7davs_and_target` used to rename `value_raw` to `value_target` regardless of what the parameter `value_col` is set to. This causes an error when `value_raw` can't be found in a df with a differently-name value column.

…utils

jingjtang and others added 6 commits June 14, 2022 10:26

Copy over parts of Jingjing's backcaster preprocessing utilities

b0a5859

Modified-by: Logan C. Brooks <[email protected]>

Fix roxygen issues, testthat ed3 complaint, missing ref_lag global

f5847ba

Add missed definition of n_refds

00ea36c

Fix an error in the unit test

a949301

Improve names of backcasting preprocessing preprocessing files

74cceb5

brookslogan self-assigned this Jun 21, 2022

brookslogan changed the title ~~Lcb/port jingjing backcasting preprocessing utils~~ Port Jingjing's backcasting preprocessing utils Jun 21, 2022

brookslogan assigned nmdefries Aug 3, 2022

nmdefries added 13 commits August 3, 2022 16:27

swap Abort in for stop

462552c

.Rd re-documented

c142eae

clarify docs, package imports

ea4261c

import zoo

2c14536

dplyr-ize field renaming

7f06c03

allow any name for value field

b1026a1

`add_7davs_and_target` used to rename `value_raw` to `value_target` regardless of what the parameter `value_col` is set to. This causes an error when `value_raw` can't be found in a df with a differently-name value column.

Merge branch 'main' into lcb/port-jingjing-backcasting-preprocessing-…

20016b5

…utils

remove unneeded jsonlite load

958c6a9

support using implied lag col

6935ec8

allow user to specify issue date col name

66cb491

test implied lag/issue date behavior

41eb9ed

formatting

c3e3aa4

allow any name for value field

11e643c

dsweber2 added this to the Epiprocess Issue Triage milestone Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port Jingjing's backcasting preprocessing utils #114

Port Jingjing's backcasting preprocessing utils #114

brookslogan commented Jun 21, 2022 •

edited by nmdefries

Loading

Port Jingjing's backcasting preprocessing utils #114

Are you sure you want to change the base?

Port Jingjing's backcasting preprocessing utils #114

Conversation

brookslogan commented Jun 21, 2022 • edited by nmdefries Loading

brookslogan commented Jun 21, 2022 •

edited by nmdefries

Loading