[FEATURE]: Migrate `spark.table("db.table")` to `spark.table("catalog.db.table")` #1082

nfx · 2024-03-21T18:53:24Z

Is there an existing issue for this?

I have searched the existing issues

Problem statement

Every table in UC needs a catalog

Proposed Solution

Transform AST/CST with the migrated table index using the fixer framework declared in #1067

spark.table(...)
spark.read.table(...)
...write.saveAsTable(...)
...

Add another Linter/Fixer to https://github.com/databrickslabs/ucx/blob/main/src/databricks/labs/ucx/code/pyspark.py

Additional Context

No response

The text was updated successfully, but these errors were encountered:

ericvergnaud · 2024-04-01T11:31:23Z

As the list of candidate function calls eligible to migration grows, our currently minimalistic approach (checking just the name and arguments of the function being called) might increase the risk of unwanted migrations.
Also, if the argument is not a string literal, the migration will be skipped.
Is that something we want to cater for, and if it is, should it be done as part of this ticket ?

nfx · 2024-04-01T11:42:11Z

As the list of candidate function calls eligible to migration grows, our currently minimalistic approach (checking just the name and arguments of the function being called) might increase the risk of unwanted migrations.

that's correct.

Also, if the argument is not a string literal, the migration will be skipped.

created a separate issue to track this:

[FEATURE]: If a code computes a value dynamically, do value inference, at least at the state of linting #1205

nfx · 2024-04-12T13:19:08Z

@jimidle ^

ericvergnaud · 2024-04-12T14:07:11Z

@jimidle you could leverage what I did for sys.path in PythonLinter once it's merged to main - fyi @nfx

jimidle · 2024-04-14T03:04:35Z

@jimidle you could leverage what I did for sys.path in PythonLinter once it's merged to main - fyi @nfx

OK - I will take a look once you merge it. @nfx Do you have a preference for using the fixer framework over Eric's suggestion?

ericvergnaud · 2024-04-14T06:11:22Z

My suggestion is for improved linting i.e. #1205, not for fixing. It's now merged.

nfx · 2024-04-14T15:09:58Z

@jimidle @ericvergnaud sys.path manipulation and this feature are orthogonal. We already have the fixer framework and the example is with sql queries.

The implementation that only looks at string constants is trivial and should not take more than few hours to implement and test.

nfx · 2024-04-16T16:22:51Z

fixed in #1210

nfx added enhancement New feature or request migrate/jobs Step 5 - Upgrading Jobs for External Tables labels Mar 21, 2024

nfx added this to UCX Mar 21, 2024

github-project-automation bot moved this to Triage in UCX Mar 21, 2024

nfx added the migrate/code Abstract Syntax Trees and other dark magic label Mar 21, 2024

ericvergnaud mentioned this issue Apr 1, 2024

Migrate views sequentially #1177

Merged

11 tasks

nfx mentioned this issue Apr 1, 2024

[EPIC] Migrate Python notebooks that belong to a single job #1204

Open

14 tasks

nfx mentioned this issue Apr 1, 2024

[FEATURE]: If a code computes a value dynamically, do value inference, at least at the state of linting #1205

Closed

1 task

nfx moved this from Triage to Month Backlog in UCX Apr 10, 2024

nfx added the backlog: jim label Apr 12, 2024

nfx added backlog: constantin and removed backlog: jim labels Apr 16, 2024

nfx closed this as completed Apr 16, 2024

github-project-automation bot moved this from Month Backlog to Archive in UCX Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE]: Migrate `spark.table("db.table")` to `spark.table("catalog.db.table")` #1082

[FEATURE]: Migrate `spark.table("db.table")` to `spark.table("catalog.db.table")` #1082

nfx commented Mar 21, 2024 •

edited

Loading

ericvergnaud commented Apr 1, 2024

nfx commented Apr 1, 2024 •

edited

Loading

nfx commented Apr 12, 2024

ericvergnaud commented Apr 12, 2024

jimidle commented Apr 14, 2024

ericvergnaud commented Apr 14, 2024 •

edited

Loading

nfx commented Apr 14, 2024

nfx commented Apr 16, 2024

[FEATURE]: Migrate spark.table("db.table") to spark.table("catalog.db.table") #1082

[FEATURE]: Migrate spark.table("db.table") to spark.table("catalog.db.table") #1082

Comments

nfx commented Mar 21, 2024 • edited Loading

Is there an existing issue for this?

Problem statement

Proposed Solution

Additional Context

ericvergnaud commented Apr 1, 2024

nfx commented Apr 1, 2024 • edited Loading

nfx commented Apr 12, 2024

ericvergnaud commented Apr 12, 2024

jimidle commented Apr 14, 2024

ericvergnaud commented Apr 14, 2024 • edited Loading

nfx commented Apr 14, 2024

nfx commented Apr 16, 2024

[FEATURE]: Migrate `spark.table("db.table")` to `spark.table("catalog.db.table")` #1082

[FEATURE]: Migrate `spark.table("db.table")` to `spark.table("catalog.db.table")` #1082

nfx commented Mar 21, 2024 •

edited

Loading

nfx commented Apr 1, 2024 •

edited

Loading

ericvergnaud commented Apr 14, 2024 •

edited

Loading