-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Migrate catalog to cloud.gov #2788
Comments
People had expressed concerns about migrating large data dumps from FCS to cloud.gov. The simplest way to sidestep that would be to pipe directly from mysqldump -> gzip -> sws s3 cp, which people do all the time. The S3 credentials would be for a bucket that we provision in our |
Plan to do dashboard, then inventory, then catalog, with static site being "whenever ready". Subject to change. |
For reference, here's how the more general backup strategy will work. |
Confirmed that we have CAA for letsencrypt.org at data.gov, which will be inherited for all subdomains (unless overridden). |
Database Migration Commands:
|
Final DB Migration Script: https://gist.github.com/nickumia-reisys/8a5da2c3e33b9b7fb2ada263b9f9c52e Steps to replicate:
|
@nickumia-reisys since we needed some collaboration, I moved the scripts as "docs" or usage scripts for cf-backup-manager: GSA/cf-backup-manager#18 (I also made some changes). |
|
Database does look like it is functional, Accessing Staging api route: https://catalog-stage-datagov.app.cloud.gov/api/action/package_show?id=megapixel-mercury-cadmium-telluride-focal-plane-arrays-for-infrared-imaging-out-to-12-micr Staging api route: https://catalog-stage-datagov.app.cloud.gov/api/action/package_show?id=namma-lightning-zeus-data-v1 |
Courtesy of @jbrown-xentity, |
I'm proposing that we don't need to take a new database dump and just run all of the harvest jobs on catalog production since it has all of the data since December 2021. |
Harvesting activity is stopped. |
Database is restored and Solr registered 17k datasets to reindex. Solr reindex is currently running. |
After database restore, we need to run an ANALYZE command to collect new statistics. |
Catalog DB (prod) backup/restore times |
Putting catalog on cloud.gov into a safe mode before migrating to minimize the web traffic and performance issues:
After migration we will re-evaluate and gradually revert these changes to bring catalog back to normal. ============================
|
Pointed current production CDN to catalog-web app on cloud.gov. Catalog.data.gov is officially migrated to cloud.gov, Things are looking fine, UI speed is good, catalog-web instances are stable, ECS Solr memory is normal. Will watch for the performance in the next a few days, and gradually take catalog.data.gov out of 'safe mode' and turn on harvesting. |
Initial harvesting has been running for 4 days. Dataset count increased 57k. This is abnormal. Investigating data.json source duplicating issue now. |
ckanext-datajson duplicate issue identified and fixed. Refreshing catalog with last FCS DB backup and reindexing Solr. |
Change requests for |
The error we saw during pg_restore
|
User Story
In order to stop maintaining the FCS deployment, the data.gov team wants production service to be directed to our deployment on cloud.gov.
Acceptance Criteria
[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]
THEN we see the expected catalog output
AND we see requests in the catalog app's logs on cloud.gov
Background
[Any helpful contextual notes or links to artifacts/evidence, if needed]
Security Considerations (required)
[Any security concerns that might be implicated in the change. "None" is OK, just be explicit here!]
This change will migrate us away from our old environment, which is harder to maintain and for which there are more things that we have to look after. The new environment has already been pen-tested and ATOd so we think it's going to be net win on attack surface overall.
Launch plan
Pre-launch
In the days leading up to the launch, these tasks should be completed:
cf create-private-domain gsa-datagov catalog.data.gov
cf map-route catalog catalog.data.gov
Open a ticket with GSA DNS to coordinate the DNS switch over and update the DNScatalog.data.gov CNAME catalog.data.gov.external-domains-production.cloud.gov
Launch
Tasks to be completed at the time of launch.
Update harvest sources set to private to be public (use logged in catalog api call and search onharvest_source_title
to evaluate what harvest sources have data that should probably be public).Note_all of the private harvest sources are listed here, but some of these are clearly "test" harvests that shouldn't be set to publicDeal this in another ticket.Have GSA update the DNScatalog.data.gov CNAME catalog.data.gov.external-domains-production.cloud.gov
catalog-prod-datagov.apps.cloud.gov
In the event a rollback is necessary, apply these tasks.
The text was updated successfully, but these errors were encountered: