Skip to content

Load TPCH dataset into BigQuery using Trino#14982

Merged
hashhar merged 3 commits intomasterfrom
hashhar/bigquery-test-setup
Nov 10, 2022
Merged

Load TPCH dataset into BigQuery using Trino#14982
hashhar merged 3 commits intomasterfrom
hashhar/bigquery-test-setup

Conversation

@hashhar
Copy link
Copy Markdown
Member

@hashhar hashhar commented Nov 10, 2022

Description

BigQuery connector now supports writes so we can directly load the TPCH dataset instead of using external scripts.

Release notes

(x) This is not user-visible or docs only and no release notes are required.

Fixes #14338

This uncovered #14981 because the copyTpchTables method failed when verifying count of inserted rows on newly created tables.

Since Trino now supports writes to BigQuery we can directly create
required TPCH tables instead of having to load them manully beforehand.
In previous commit 90f518f BigQuery
connector now uses Trino to create the TPCH dataset and hence the
external script and schemas are no longer needed.
@hashhar hashhar requested a review from ebyhr November 10, 2022 07:25
@cla-bot cla-bot bot added the cla-signed label Nov 10, 2022
@hashhar hashhar merged commit f88ae75 into master Nov 10, 2022
@hashhar hashhar deleted the hashhar/bigquery-test-setup branch November 10, 2022 08:57
@github-actions github-actions bot added this to the 403 milestone Nov 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

Simplify or remove scripts to load TPCH tables in BigQuery

2 participants