Skip to content

Conversation

@raunaqmorarka
Copy link
Member

Description

Additional context and related issues

Release notes

(x) This is not user-visible or docs only and no release notes are required.
( ) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label May 2, 2023
@raunaqmorarka raunaqmorarka requested review from arhimondr and sopel39 May 2, 2023 15:00
@arhimondr
Copy link
Contributor

CC: @findepi

@arhimondr
Copy link
Contributor

This change fine with me, but I remember there was some counter argument from changing it by default though I don't remember what exactly it was

connector.name=tpch
tpch.splits-per-node=4
tpch.column-naming=STANDARD
tpch.double-type-mapping=DECIMAL
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Define "standard".

IMO this file should have only connector.name=tpch and everything beyond that is redundant

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Standard is defined at https://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v3.0.1.pdf
The naming convention and usage of decimal type is specified by the TPCH standard. This is already what we benchmark, I'm just making the dev setup consistent with what we benchmark so that devs don't have to keep filling this in by hand when working with files locally.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Standard is defined at https://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v3.0.1.pdf
The naming convention and usage of decimal type is specified by the TPCH standard.

Sure, let's fix the tpch connector to follow the standard

This is already what we benchmark, I'm just making the dev setup consistent with what we benchmark

that's convenient from benchmark perspective, but that's not convenient from integration tests perspective
the deve server should be consistent with both, so it serves developers well

I'm just making the dev setup consistent with what we benchmark so that devs don't have to keep filling this in by hand when working with files locally.

people may be firing queries taken from integration tests and these won't work anymore

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Valid point about integration tests. I care more about the using decimal types than matching the naming convention, so created #17344 to hopefully find a better middle ground.

@raunaqmorarka
Copy link
Member Author

This change fine with me, but I remember there was some counter argument from changing it by default though I don't remember what exactly it was

I haven't changed defaults, just the config for the dev server catalog

@raunaqmorarka raunaqmorarka merged commit 0c58983 into trinodb:master May 3, 2023
@raunaqmorarka raunaqmorarka deleted the tpch-config branch May 3, 2023 03:02
@github-actions github-actions bot added this to the 416 milestone May 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

3 participants