Skip to content

Conversation

@nastra
Copy link
Contributor

@nastra nastra commented Dec 12, 2022

Previously, having an unpartitioned table would produce a "partitions." entry in the snapshot summary when the partition summary limit was configured


Assert.assertTrue(
"Partition metrics must be correct",
summary.get(CHANGED_PARTITION_PREFIX).contains(ADDED_DELETE_FILES_PROP + "=1"));
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this was in fact checking the entry, which we don't produce anymore:

 "partitions." -> "added-position-delete-files=1,added-delete-files=1,added-files-size=10,added-position-deletes=1"

The summaries map also had the following entries for the 3 partitions:

 "partitions.data_bucket=0" -> "added-position-delete-files=1,added-delete-files=1,added-files-size=10,added-position-deletes=1"
 "partitions.data=abc" -> "added-position-delete-files=1,added-delete-files=1,added-files-size=10,added-position-deletes=1"
 "partitions.data=xyz" -> "added-data-files=1,added-records=1,added-files-size=10"

@nastra nastra force-pushed the partition-summary branch 2 times, most recently from 7647339 to b18fac9 Compare December 12, 2022 14:13
@github-actions github-actions bot added the spark label Dec 12, 2022
@nastra nastra requested a review from rdblue December 19, 2022 08:29
Previously, having an unpartitioned table would produce a `"partitions."` entry in
the snapshot summary when the partition summary limit was configured
@rdblue rdblue merged commit b582246 into apache:master Dec 19, 2022
@rdblue
Copy link
Contributor

rdblue commented Dec 19, 2022

Merged! Thanks, @nastra.

@nastra nastra deleted the partition-summary branch December 20, 2022 06:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants