fix: Edge case with metric not getting quoted in sort by when normalize_columns is enabled by Vitor-Avila · Pull Request #33337 · apache/superset

Vitor-Avila · 2025-05-01T16:59:40Z

SUMMARY

When using Snowflake, it's possible to enable the normalize_columns setting at the dataset level to have all columns as lowercase. With this setting enabled, in case you use a metric in the chart that has the same key as a column that exists in the dataset (but it's not used in the chart), you would get a SQL error as the metric is used on sorting by default, and the metric name won't be quoted. Superset would generate:

SELECT DATE_TRUNC('DAY', time) AS "time", COUNT(*) AS "count_lower" 
FROM public.new GROUP BY DATE_TRUNC('DAY', time) ORDER BY count_lower DESC
LIMIT 1000;

As opposed to:

SELECT DATE_TRUNC('DAY', time) AS "time", COUNT(*) AS "count_lower" 
FROM public.new GROUP BY DATE_TRUNC('DAY', time) ORDER BY "count_lower" DESC
LIMIT 1000;

This only happens with this setting enabled. If you disable it and have the columns in uppercase, the exact same chart configuration works.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

No UI changes.

TESTING INSTRUCTIONS

Create a table in Snowflake.
Create a dataset for it.
Enable normalize_columns on the dataset.
Sync columns so they revert to lowercase.
Create a metric in the dataset with a key named after a column from the table.
Create a chart using this metric, and another column.
Validate the metric name is properly quoted in the ORDER BY statement.

ADDITIONAL INFORMATION

Has associated issue:
Required feature flags:
Changes UI
Includes DB Migration (follow approval process in SIP-59)
- Migration is atomic, supports rollback & is backwards-compatible
- Confirm DB migration upgrade and downgrade tested
- Runtime estimates and downtime expectations provided
Introduces new feature or API
Removes existing feature or API

…ze_columns is enabled

korbit-ai

I've completed my review and didn't find any issues.

Files scanned

File Path	Reviewed
superset/models/helpers.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

Vitor-Avila · 2025-05-01T17:52:12Z

this is mostly changing the "order of evaluation" (to check if the column is a metric first). No tests failed, but I'm not 100% sure if this order of eval has any importance. @betodealmeida @eschutho @mistercrunch @michael-s-molina @villebro any thoughts?

Vitor-Avila · 2025-05-01T17:53:20Z

tagging @hughhhh as I believe you worked on this logic in the past

mistercrunch

LGTM, though overall the this whole section in the code is rough. Had to do a fair amount of thinking/guessing to understand why the evaluation ordering of the elifs matter here...

Vitor-Avila · 2025-05-02T22:44:45Z

superset/models/helpers.py

+            elif col in columns_by_name:
+                col = self.convert_tbl_column_to_sqla_col(
+                    columns_by_name[col], template_processor=template_processor
+                )


I'm basically just changing the order to first evaluate the column against metrics, and then across columns.

My understanding of the issue is that in the current if/elif ordering, we would first evaluate col across columns_by_name, which would be true (even tho it's a metric), and then convert_tbl_column_to_sqla_col() does col = sa.column(tbl_column.column_name, type_=type_) which seems to return quoted if the column is uppercase (hence the issue does not happen with normalize_columns disabled):

import sqlalchemy as sa print(sa.column("test", sa.String)) print(sa.column("TEST", sa.String))

test "TEST"

With normalize_columns, we send a lowercase column label to sa.column() which then does not quote it.

…ze_columns is enabled (#33337) (cherry picked from commit 9f0ae77)

…ze_columns is enabled (apache#33337)

…ze_columns is enabled (apache#33337) (cherry picked from commit 9f0ae77)

fix: Edge case with metric not getting quoted in sort by when normali…

1490768

…ze_columns is enabled

pull-request-size bot added the size/XS label May 1, 2025

dosubot bot added change:backend Requires changing the backend data:connect:snowflake Related to Snowflake labels May 1, 2025

korbit-ai bot reviewed May 1, 2025

View reviewed changes

Vitor-Avila requested review from betodealmeida, eschutho, hughhhh, michael-s-molina, mistercrunch and villebro May 1, 2025 17:52

eschutho approved these changes May 2, 2025

View reviewed changes

mistercrunch approved these changes May 2, 2025

View reviewed changes

Vitor-Avila commented May 2, 2025

View reviewed changes

mistercrunch merged commit 9f0ae77 into master May 3, 2025
58 of 59 checks passed

mistercrunch deleted the fix/quote-sorted-metric branch May 3, 2025 01:20

michael-s-molina added the v5.0 Label added by the release manager to track PRs to be included in the 5.0 branch label May 12, 2025

michael-s-molina pushed a commit that referenced this pull request May 12, 2025

fix: Edge case with metric not getting quoted in sort by when normali…

2f3658c

…ze_columns is enabled (#33337) (cherry picked from commit 9f0ae77)

LevisNgigi pushed a commit to LevisNgigi/superset that referenced this pull request Jun 18, 2025

fix: Edge case with metric not getting quoted in sort by when normali…

6d4f1e0

…ze_columns is enabled (apache#33337)

alexandrusoare pushed a commit to alexandrusoare/superset that referenced this pull request Jun 19, 2025

fix: Edge case with metric not getting quoted in sort by when normali…

588550a

…ze_columns is enabled (apache#33337) (cherry picked from commit 9f0ae77)

mistercrunch added 🍒 5.0.0 Cherry-picked to 5.0.0 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels labels Jul 29, 2025

silveira-js mentioned this pull request Jan 7, 2026

Add MapLibre support to charts PortalTelemedicina/superset#4

Merged

9 tasks

This was referenced Jan 15, 2026

Point explicit service account to client PortalTelemedicina/superset#9

Merged

Fix margin parameters PortalTelemedicina/superset#10

Merged

silveira-js mentioned this pull request Jan 26, 2026

Adjust Styling Options PortalTelemedicina/superset#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Edge case with metric not getting quoted in sort by when normalize_columns is enabled#33337

fix: Edge case with metric not getting quoted in sort by when normalize_columns is enabled#33337
mistercrunch merged 1 commit intomasterfrom
fix/quote-sorted-metric

Vitor-Avila commented May 1, 2025

Uh oh!

korbit-ai bot left a comment

Uh oh!

Vitor-Avila commented May 1, 2025

Uh oh!

Vitor-Avila commented May 1, 2025

Uh oh!

mistercrunch left a comment

Uh oh!

Vitor-Avila May 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Vitor-Avila commented May 1, 2025

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TESTING INSTRUCTIONS

ADDITIONAL INFORMATION

Uh oh!

korbit-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Vitor-Avila commented May 1, 2025

Uh oh!

Vitor-Avila commented May 1, 2025

Uh oh!

mistercrunch left a comment

Choose a reason for hiding this comment

Uh oh!

Vitor-Avila May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants