Skip to content

Conversation

@cloud-fan
Copy link
Contributor

What changes were proposed in this pull request?

This reverts #36625 and its followup #38321 .

Why are the changes needed?

External table location can be arbitrary and has no connection with the database location. It can be wrong to qualify the external table location based on the database location.

If a table written by old Spark versions does not have a qualified location, there is no way to restore it as the information is already lost. People can manually fix the table locations assuming they are under the same HDFS cluster with the database location, by themselves.

Does this PR introduce any user-facing change?

no

How was this patch tested?

N/A

@cloud-fan
Copy link
Contributor Author

cc @wangyum

@github-actions github-actions bot added the SQL label Apr 20, 2023
@HyukjinKwon
Copy link
Member

Merged to master and branch-3.4.

HyukjinKwon pushed a commit that referenced this pull request Apr 21, 2023
…d on database URI

### What changes were proposed in this pull request?

This reverts #36625 and its followup #38321 .

### Why are the changes needed?

External table location can be arbitrary and has no connection with the database location. It can be wrong to qualify the external table location based on the database location.

If a table written by old Spark versions does not have a qualified location, there is no way to restore it as the information is already lost. People can manually fix the table locations assuming they are under the same HDFS cluster with the database location, by themselves.

### Does this PR introduce _any_ user-facing change?

no

### How was this patch tested?

N/A

Closes #40871 from cloud-fan/minor.

Authored-by: Wenchen Fan <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit afd9e2c)
Signed-off-by: Hyukjin Kwon <[email protected]>
@HyukjinKwon
Copy link
Member

Actually ... we need a JIRA since this change is in 3.4.1 but the original change remains in 3.4.0 .. so difficult to set the fixed JRIA version.

@dongjoon-hyun
Copy link
Member

dongjoon-hyun commented May 2, 2023

Too bad.

To @cloud-fan , as Hyukjin mentioned, we should not revert like this after the official release.

cc @sunchao , too.

@cloud-fan cloud-fan changed the title Revert [SPARK-39203][SQL] Rewrite table location to absolute URI based on database URI [SPARK-43373][SQL] Revert [SPARK-39203][SQL] Rewrite table location to absolute URI based on database URI May 4, 2023
@cloud-fan
Copy link
Contributor Author

I've created a new JIRA ticket

snmvaughan pushed a commit to snmvaughan/spark that referenced this pull request Jun 20, 2023
…d on database URI

### What changes were proposed in this pull request?

This reverts apache#36625 and its followup apache#38321 .

### Why are the changes needed?

External table location can be arbitrary and has no connection with the database location. It can be wrong to qualify the external table location based on the database location.

If a table written by old Spark versions does not have a qualified location, there is no way to restore it as the information is already lost. People can manually fix the table locations assuming they are under the same HDFS cluster with the database location, by themselves.

### Does this PR introduce _any_ user-facing change?

no

### How was this patch tested?

N/A

Closes apache#40871 from cloud-fan/minor.

Authored-by: Wenchen Fan <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit afd9e2c)
Signed-off-by: Hyukjin Kwon <[email protected]>
GladwinLee pushed a commit to lyft/spark that referenced this pull request Oct 10, 2023
…d on database URI

### What changes were proposed in this pull request?

This reverts apache#36625 and its followup apache#38321 .

### Why are the changes needed?

External table location can be arbitrary and has no connection with the database location. It can be wrong to qualify the external table location based on the database location.

If a table written by old Spark versions does not have a qualified location, there is no way to restore it as the information is already lost. People can manually fix the table locations assuming they are under the same HDFS cluster with the database location, by themselves.

### Does this PR introduce _any_ user-facing change?

no

### How was this patch tested?

N/A

Closes apache#40871 from cloud-fan/minor.

Authored-by: Wenchen Fan <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit afd9e2c)
Signed-off-by: Hyukjin Kwon <[email protected]>
catalinii pushed a commit to lyft/spark that referenced this pull request Oct 10, 2023
…d on database URI

### What changes were proposed in this pull request?

This reverts apache#36625 and its followup apache#38321 .

### Why are the changes needed?

External table location can be arbitrary and has no connection with the database location. It can be wrong to qualify the external table location based on the database location.

If a table written by old Spark versions does not have a qualified location, there is no way to restore it as the information is already lost. People can manually fix the table locations assuming they are under the same HDFS cluster with the database location, by themselves.

### Does this PR introduce _any_ user-facing change?

no

### How was this patch tested?

N/A

Closes apache#40871 from cloud-fan/minor.

Authored-by: Wenchen Fan <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit afd9e2c)
Signed-off-by: Hyukjin Kwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants