Built a COVID-19 Tracking Resource that looks at statewide data and data for each of the 254 counties with R

Matt_Worthington_Texas_2036 · May 19, 2020, 12:00am

Hi R Fam! I’m reposting this from a different channel, but primarily because our resource is mostly built in R. If you can help, that would be awesome. Big things I’m wondering are: how do I call on the list of available files using the aws.s3 package? Right now, I’ve been able execute the code that @Makada_Brookings shared and @Jessica_Williams-Holt reposted recently, but am not sure how to get a list of available CSVs in the bucket. Anyone have tips?
> Just getting started with looking at SafeGraph’s county-level data. Through our organization, I’ve built a COVID-19 Tracking Resource that looks at statewide data and data for each of the 254 counties. Can anyone direct me to resources that would explain what files to pull to get the latest data for each of those geographies, in each of the safegraph datasets available? Thanks!

Jessica_Williams-Holt · May 19, 2020, 5:07pm

The AWS.S3 package is not the best documented in the R universe! Let me poke around and see if I can come up with the answer

Matt_Worthington_Texas_2036 · May 19, 2020, 8:34pm

Agreed. Just trying to figure out how to get a list of the files.

Sean_Davis_NIH · May 20, 2020, 6:12pm

You’ll want to use get_bucket() and potentially specify a “prefix”. See help for get_bucket().

Jessica_Williams-Holt · May 20, 2020, 11:23pm

@Matt_Worthington_Texas_2036 i wonder if you were able to get get_bucket to work for you? i have not. could be the prefix argument

Jessica_Williams-Holt · May 20, 2020, 11:59pm

finally found the right parameter:
get_bucket_df("sg-c19-response", prefix = "monthly-patterns"). use prefix to specify the desired subdirectory

Matt_Worthington_Texas_2036 · May 25, 2020, 1:08am

@Jessica_Williams-Holt I just had a chance to dig into Safegraph this evening and your get_bucket_df() suggestion with the subdirectory worked like magic!

Jessica_Williams-Holt · May 25, 2020, 5:02pm

Glad to hear @Matt_Worthington_Texas_2036! Even with @Sean_Davis_NIH s helpful pointer, it took me longer than I’d care to admit to figure this out

Matt_Worthington_Texas_2036 · May 27, 2020, 2:19pm

I can totally relate to that last sentence, except also on a million other occasions in R.

Sean_Davis_NIH · June 1, 2020, 5:43pm

You know you are an R person if it takes longer to solve the simple problems than the HARD ones.

Jessica_Williams-Holt · June 1, 2020, 5:44pm

haha totally

Matt_Worthington_Texas_2036 · June 1, 2020, 7:06pm

It’s so true. AND if you convince yourself that it’ll be worth it in the long run.