Download raw ACS tables for the blockgroup ACS pipeline
Source:R/download_bg_acs_raw.R
download_bg_acs_raw.RdDownload raw ACS tables for the blockgroup ACS pipeline
Usage
download_bg_acs_raw(
yr,
blockgroup_tables = setdiff(as.vector(EJAM::tables_ejscreen_acs), tract_tables),
tract_tables = c("B18101", "C16001", "B27010"),
include_tract_data = TRUE,
fiveorone = "5",
pipeline_dir = NULL,
save_stage = FALSE,
stage_format = c("csv", "rds", "rda", "arrow"),
raw_acs_storage = c("folder", "object"),
raw_table_format = stage_format,
overwrite = TRUE,
validation_strict = TRUE,
storage = c("auto", "local", "s3"),
download_fun = ACSdownload::get_acs_new,
download_timeout = 3600,
download_retries = 2
)Arguments
- yr
end year of the ACS 5-year survey to use.
- blockgroup_tables
ACS tables to download at blockgroup resolution.
- tract_tables
ACS tables to download at tract resolution for later blockgroup apportionment.
- include_tract_data
logical, whether to download
tract_tables.- fiveorone
ACS sample length,
"5"by default.- pipeline_dir
folder for saving the pipeline stage.
- save_stage
logical, whether to save the
bg_acs_rawstage.- stage_format
file format for saved object stages:
"rds","rda","csv", or"arrow". Raw ACS folder checkpoints useraw_table_formatfor the per-table files.- raw_acs_storage
raw ACS checkpoint storage pattern.
"folder"saves one ACS table per file plus a manifest."object"saves the historical singlebg_acs_rawlist object.- raw_table_format
file format for per-table raw ACS files when
raw_acs_storage = "folder".- overwrite
logical, whether to overwrite an existing saved stage.
- validation_strict
logical passed to
ejscreen_pipeline_save().- storage
raw ACS checkpoint storage backend:
"auto","local", or"s3".- download_fun
ACSdownload-compatible function used to obtain a single ACS table for a given
yr,tables,fips, andfiveorone. Defaults toACSdownload::get_acs_new(). Supply a wrapper if you need to pin a legacy ACS source implementation.- download_timeout
timeout in seconds to use while downloading ACS table files. This is increased above R's usual 60 second default because some Census table-based summary files are hundreds of MB.
- download_retries
number of times to retry a failed ACS table download after the initial attempt.
Details
This creates the raw ACS checkpoint for the annual EJSCREEN/EJAM
data update pipeline. It downloads the Census Bureau ACS table-based summary
file tables with ACSdownload::get_acs_new(). By default, the saved
checkpoint uses a folder-plus-manifest layout: one file per ACS table in
bg_acs_raw/blockgroup/ and bg_acs_raw/tract/, plus manifest files that
describe the checkpoint. That is easier to inspect and extend than one large
list object, while still being loadable as the same ejam_bg_acs_raw list
object used by downstream functions.
This stage is deliberately before EJAM formula calculations. The downloaded
tables are the parsed ACSdownload output, including Census table columns,
GEO_ID, fips, and SUMLEVEL.