Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automate data cleaning #117

Open
Tracked by #108
maxachis opened this issue Dec 18, 2024 · 1 comment
Open
Tracked by #108

Automate data cleaning #117

maxachis opened this issue Dec 18, 2024 · 1 comment

Comments

@maxachis
Copy link
Collaborator

maxachis commented Dec 18, 2024

Part of #108

  • monitor 404'd links, try to find updated locations
  • this is probably related to automatic-archives, in that it happens at the same time / without hitting their site too many times
@maxachis
Copy link
Collaborator Author

maxachis commented Jan 7, 2025

For the component for finding 404'd links (which I am prototypically calling 404Probe), we will run into an issue in some cases where the URL may return a 200 response but actually navigating to the browser reveals a "Page Not Found". For example:

https://data.tempe.gov/pages/tempegov::1-25-police-body-cameras

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant