Hi all,
I have been working with the Places Patterns Monthly Dataset, and I have some questions regarding the raw_visit_count
metric.
I am aware that this metric is not scaled or normalized in any way. Nevertheless, many of the POI visit counts seem unreasonably low.
For example, during the years 2018 and 2019, I note that -
sg:00d096c86cb64771914acb41e72577f0
, a Prada store on 301 Canal St. in New Orleans has a mean of 1 visit per month
sg:062d8f0d70604ad2808d680f70050c0b
, a Smoothie King on 500 Port of New Orleans Pl #C has a mean of 1 visit per month,
sg:29f32ae3874a427281bf4b7daa18d390
, a New Balance retailer on 500 Port of New Orleans Pl has a mean of 1 visit per month,
sg:350bbaa135374ea8ae6a67168f55a82f
, a YMCA on 2220 Oretha Castle Haley Blvd in New Orleans has a mean of 1 visit per month,
sg:26b8ca382fcc4184822167f754837e76
, a T-Mobile on 2700 S Claiborne Ave #300 in New Orleans has a mean of 1.5 visits per month.
There are many other examples.
It doesn’t appear that these locations were closed at the time of the sample.
I am also wondering what happens when an active POI gets no visits during a given month. In this case, it appears that the entire row is omitted from the data. Is this correct?
Many of the low visit count POIs do not have a full 24 months of observations. However, these POIs appear to drop in and out of the sample, which implies that they were not closed for the missing months. If these missing months are actually months in which SafeGraph observed no visits at all, the mean visit counts that I report above are overestimates of what SafeGraph observed, as I omit months in which no visits occurred. This makes the POI visit counts listed above seem even more unlikely.
This topic was automatically generated from Slack. You can find the original thread here.