-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve the behavior of the person scrape #900
Comments
There is a 5-year-old issue with a rich history on how we might approach handling deleted data. @antidipyramid, can you have a look and leave a comment in that issue about the approach that makes most sense to you? |
Whoops, forgot to add the link: opencivicdata/pupa#295 |
A few weeks ago, I observed that the last_seen flag was not behaving as expected. It turns out our scrapers were pinned to an earlier version of @antidipyramid has also drafted a pupa command to remove data that has not been seen in a certain window: opencivicdata/pupa#344 We'll pilot this once we confirm the date stamps are behaving as expected. |
Date stamps look to be working! Here are the memberships and events we haven't seen in the past week:
Looks like all of these have been removed from Legistar. |
Hannah will show Monkruman how to cut releases of OCD and pupa next week, then we'll add the DAG to flush data we haven't seen in a week. |
Done! |
Specifically, changes in name and membership dates, etc., misbehave often, and it'd be great if they didn't.
The text was updated successfully, but these errors were encountered: