feat: add dry run to the read_gbq function #979

antoineeripret · 2025-11-06T09:32:23Z

This change allows the user to run a dry run query using the read_gbq function. Instead of returning a pd.DataFrame, the behavior is changed and the amount of data processed (in GB) is returned.

shuoweil · 2025-11-07T18:20:36Z

@antoineeripret Could you please check the failed tests? Thanks a lot.

antoineeripret · 2025-11-10T08:19:33Z

@shuoweil , I've added a new commit with some changes to fix tests. I've ran nox -s unit-3.10 and got 0 fails. Thank you !

shuoweil · 2025-11-11T18:16:02Z

lint / lint (pull_request)

Hi @antoineeripret, could you please check the failed check please? It should be a quick fix. Thanks a lot.

antoineeripret · 2025-11-12T23:41:36Z

Hi @shuoweil, the last commit should fix it. Got the following on my local env:

python -m black --check docs pandas_gbq tests noxfile.py setup.py
All done! ✨ 🍰 ✨
45 files would be left unchanged.

sycai · 2025-11-13T18:57:08Z

pandas_gbq/gbq_connector.py

+                # we need to get it from the query result
+                # For query_and_wait_via_client_library, the RowIterator should have job set
+                raise ValueError("Cannot access QueryJob from RowIterator for dry_run")
+            return query_job.total_bytes_processed / 1024**3


Could we simply return query_job.total_bytes_processed without further processing?

Reasons:

The total_bytes_processed has integer type, which is more precise than a float type

For small tables (ones with 1-10 MB sizes), converting the size to GB makes the result less readable

It aligns more with the behavior of BigQuery Python client to return size in bytes.

Generally speaking, we want the caller of this function to perform unit conversions.

@sycai, good call ! I've though about my own usage, but didn't think about the bigger picture here. I'll commit the change. :)

sycai

Thank you! I think we should be good to go once the doc and tests are updated.

pandas_gbq/gbq.py

tests/unit/test_gbq.py

antoineeripret · 2025-11-18T07:06:39Z

@sycai : updated :)

sycai · 2025-11-18T18:35:21Z

Looks like there's a lint error. Could you fix it? Thanks a lot!

sycai · 2025-11-18T18:33:51Z

pandas_gbq/gbq.py

    -------
-    df: DataFrame
-        DataFrame representing results of query.
+    df: DataFrame or float


doc nit: "DataFrame or int"

sycai · 2025-11-18T18:34:24Z

pandas_gbq/gbq.py

-        DataFrame representing results of query.
+    df: DataFrame or float
+        DataFrame representing results of query. If ``dry_run=True``, returns
+        a float representing the amount of data that would be processed (in bytes).


doc nit: "returns an int representing ..."

shuoweil · 2025-11-18T18:48:11Z

@antoineeripret I believe lint fails. Could you please update it? It still fails with the new commit.

antoineeripret added 2 commits November 6, 2025 10:21

feat: add dry run to the read_gbq function

d268a62

return the cost (in GB) if dry run is set to True

13fbf92

antoineeripret requested review from a team as code owners November 6, 2025 09:32

antoineeripret requested review from Linchin and sycai November 6, 2025 09:32

blunderbuss-gcf bot assigned GarrettWu Nov 6, 2025

product-auto-label bot added size: s Pull request size is small. api: bigquery Issues related to the googleapis/python-bigquery-pandas API. labels Nov 6, 2025

antoineeripret changed the title ~~Add dry run~~ feat: add dry run to the read_gbq function Nov 6, 2025

GarrettWu assigned shuoweil and unassigned GarrettWu Nov 6, 2025

shuoweil added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Nov 7, 2025

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Nov 7, 2025

updates to fix test

adcfc7b

product-auto-label bot added size: m Pull request size is medium. and removed size: s Pull request size is small. labels Nov 10, 2025

fix lint

e9f4c00

sycai reviewed Nov 13, 2025

View reviewed changes

Remove unit conversion

a171ff4

antoineeripret requested a review from sycai November 16, 2025 10:48

sycai reviewed Nov 17, 2025

View reviewed changes

pandas_gbq/gbq.py Outdated Show resolved Hide resolved

tests/unit/test_gbq.py Outdated Show resolved Hide resolved

fix docs

8207a47

antoineeripret requested a review from sycai November 18, 2025 07:06

Merge branch 'main' into add_dry_run

f016352

sycai approved these changes Nov 18, 2025

View reviewed changes

sycai requested a review from tswast November 18, 2025 18:36

feat: add dry run to the read_gbq function #979

Are you sure you want to change the base?

feat: add dry run to the read_gbq function #979

Conversation

antoineeripret commented Nov 6, 2025

Uh oh!

shuoweil commented Nov 7, 2025

Uh oh!

antoineeripret commented Nov 10, 2025

Uh oh!

shuoweil commented Nov 11, 2025

Uh oh!

antoineeripret commented Nov 12, 2025

Uh oh!

sycai Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

antoineeripret Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

sycai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

antoineeripret commented Nov 18, 2025

Uh oh!

sycai commented Nov 18, 2025

Uh oh!

sycai Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

sycai Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

shuoweil commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shuoweil commented Nov 18, 2025 •

edited

Loading