-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-4237] should not sync partition parameters when create non-partition table in spark #6525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@XuQianJin-Stars @alexeykudinkin could you check if this is needed? Functionality-wise, is the fix necessary? |
|
@dujl can you please update the PR description w/ the crux of the issue? The one in Jira is very detailed (thanks for providing it!), but it's important to make sure PRs also have detailed descriptions as well. |
done |
|
@alexeykudinkin please help to review and approve |
|
Approved already. @nsivabalan can you please help landing this one? |
|
Hi @XuQianJin-Stars , Can you land this bugfix? |
|
hi @dujl the ci is failed. |
|
ok,i will fix it<br/><br/><br/><html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<div class="ntes-mailmaster-quote" style="padding-top: 1px; padding-bottom: 1px" >
<div style=" margin-top: 20px; margin-bottom: 12px; font-size: 14px; line-height: 1.25; color: #89898c; " >---- Replied Message ----</div>
<div style=" margin-bottom: 12px; font-size: 13px; line-height: 1.25; color: #2c2c2e; padding: 2px; border-radius: 8px; background-color: #f0f0f0; " >
<table width="100%" cellpadding="0" cellspacing="10" border="0">
<tr>
<td valign="top" style=" width: 4em; font-size: 13px; line-height: 1.25; color: #89898c; white-space: nowrap; " >From</td>
<td valign="top" style=" font-size: 13px; line-height: 1.25; color: #2c2c2e; word-break: break-all; " ><a class="mail-from" style="color: #1c83eb; text-decoration: none" ***@***.***" ***@***.***></a></td>
</tr>
<tr>
<td valign="top" style=" width: 4em; font-size: 13px; line-height: 1.25; color: #89898c; white-space: nowrap; " >Date</td>
<td class="mail-date" valign="top" style=" font-size: 13px; line-height: 1.25; color: #2c2c2e; word-break: break-all; " >09/17/2022 10:56</td>
</tr>
<tr style="">
<td valign="top" style=" width: 4em; font-size: 13px; line-height: 1.25; color: #89898c; " >To</td>
<td valign="top" style=" font-size: 13px; line-height: 1.25; color: #2c2c2e; word-break: break-all; " ><a class="mail-to" style="color: #1c83eb; text-decoration: none" ***@***.***" ***@***.***></a></td>
</tr>
<tr style="">
<td valign="top" style=" width: 4em; font-size: 13px; line-height: 1.25; color: #89898c; " >Cc</td>
<td valign="top" style=" font-size: 13px; line-height: 1.25; color: #2c2c2e; word-break: break-all; " ><a class="mail-cc" style="color: #1c83eb; text-decoration: none" ***@***.***" ***@***.***></a>、<a class="mail-cc" style="color: #1c83eb; text-decoration: none" ***@***.***" ***@***.***></a></td>
</tr>
<tr>
<td valign="top" style=" width: 4em; font-size: 13px; line-height: 1.25; color: #89898c; " >Subject</td>
<td class="mail-subject" valign="top" style=" font-size: 13px; line-height: 1.25; color: #2c2c2e; word-break: break-all; " >Re: [apache/hudi] [HUDI-4237] should not sync partition parameters when create non-partition table in spark (PR #6525)</td>
</tr>
</table>
</div>
<div><p></p>
<p dir="auto">hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/dujl/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" ***@***.***</a> the ci is failed.</p>
<p style="font-size:small;-webkit-text-size-adjust:none;color:#666;">—<br />Reply to this email directly, <a href="#6525 (comment)">view it on GitHub</a>, or <a href="https://github.com/notifications/unsubscribe-auth/ABBN7FKORZJWFC4K3J4P22DV6UXNVANCNFSM5744G2VQ">unsubscribe</a>.<br />You are receiving this because you were mentioned.<img src="https://github.com/notifications/beacon/ABBN7FJXCNOSCRLICEE6NYDV6UXNVA5CNFSM5744G2V2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOJKAUANQ.gif" height="1" width="1" alt="" /><span style="color: transparent; font-size: 0; display: none; visibility: hidden; overflow: hidden; opacity: 0; width: 0; height: 0; max-width: 0; max-height: 0; mso-hide: all">Message ID: <span><apache/hudi/pull/6525/c1249984566</span><span>@</span><span>github</span><span>.</span><span>com></span></span></p>
<script type="application/ld+json">[
{
***@***.***": "http://schema.org",
***@***.***": "EmailMessage",
"potentialAction": {
***@***.***": "ViewAction",
"target": "#6525 (comment)",
"url": "#6525 (comment)",
"name": "View Pull Request"
},
"description": "View this Pull Request on GitHub",
"publisher": {
***@***.***": "Organization",
"name": "GitHub",
"url": "https://github.com"
}
}
]</script></div>
</div>
</body>
</html>
|
|
@dujl It's likely due to CI flakiness. Could you rebase this PR on the latest master? |
…rtition hudi table in Spark
5d16a4e to
7506862
Compare
|
@hudi-bot run azure |
|
@dujl The failed tests in GH action are reproducible. Could you look into those? |
ok, i will check it |
|
close in favor or #6821 |
issue description
Create a non-partition hudi table in Spark,it will store spark.sql.sources.schema.partCol.0 with an empty value in hiveMetastore. This is unexpected behavior, it should not store spark.sql.sources.schema.partCol.0 in HiveMetastore when it is a non-partition table.
Steps to reproduce the behavior:
it include partition.fields key with an empty value
it has a value "".
Change Logs
When init a non-partition hoodie table, should set PartitionFields as null instead of empty string "".
Then after sync table meta to hiveMetaStore, it will not store spark.sql.sources.schema.partCol.
Impact
fix the bug when create non-partition table in spark
more detail see jira https://issues.apache.org/jira/browse/HUDI-4237
Risk level: none | low | medium | high
low
Contributor's checklist