M3: Improving the analysis pipeline
No due date
50% complete
The BigQuery datasets are reorganized to support partitioning and clustering.
The summary datasets are generated from the HAR files in Dataflow.
Content pre-processing (Rework CSS parsing) is done by Dataflow and written to BigQuery.
2021 Web Almanac queries run monthly and results stored. (Cloud SQL + BQ?)
Web Almanac metrics are well-documented and easi…
The BigQuery datasets are reorganized to support partitioning and clustering.
The summary datasets are generated from the HAR files in Dataflow.
Content pre-processing (Rework CSS parsing) is done by Dataflow and written to BigQuery.
2021 Web Almanac queries run monthly and results stored. (Cloud SQL + BQ?)
Web Almanac metrics are well-documented and easily extensible.