Skip to content

Conversation

@nastra
Copy link
Contributor

@nastra nastra commented Feb 10, 2023

No description provided.

@nastra nastra requested review from aokolnychyi and rdblue February 10, 2023 06:40
@nastra nastra force-pushed the add-rewritten-bytes-compaction-results branch from 315b1e4 to 8e7d6c8 Compare February 10, 2023 07:33
Copy link
Contributor

@Fokko Fokko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

private final FileGroupInfo info;

/**
* @deprecated Will be removed in 1.3.0; use {@link
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we create an issue for this, and add it to the 1.3.0 milestone?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll usually clean things like this up right after a release, so I'll open an issue for that shortly

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome, just to make sure that's in the collective memory of the community :)

Copy link
Member

@RussellSpitzer RussellSpitzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm on board for this adjustment. Only issue is the test suite become a little brittle with the change. I left a note on how I think we can keep it from requiring a lot of patching in the future.

@nastra nastra force-pushed the add-rewritten-bytes-compaction-results branch from 8e7d6c8 to 43465c7 Compare February 10, 2023 15:45
@nastra nastra force-pushed the add-rewritten-bytes-compaction-results branch from 43465c7 to f7aa973 Compare February 14, 2023 08:46
insertData(tableName(QUOTED_SPECIAL_CHARS_TABLE_NAME), 10);
insertData(tblName, 10);
// TODO: metadata table access currently fails with special chars in the table name
// long dataSizeBefore = testDataSize(tblName);
Copy link
Contributor Author

@nastra nastra Feb 14, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this seems like a bug to me where the table with special characters can't be found when running SELECT sum(file_size_in_bytes) from %s.files. It fails with

Caused by: java.io.FileNotFoundException: File file:/tmp/warehouse2890706410427132468.tmp/default/table:with.special:chars/metadata/2ae7cad2-3cff-4c92-9536-9bf9652f119d-m1.avro does not exist
	at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:779)
	at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1100)
	at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:769)
	at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:462)
	at org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.<init>(ChecksumFileSystem.java:160)
	at org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:372)
	at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:976)
	at org.apache.iceberg.hadoop.HadoopInputFile.newStream(HadoopInputFile.java:183)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

turns out this happened because of caching in the Hadoop catalog. Adding cache-enabled=false fixes this issue

@nastra nastra force-pushed the add-rewritten-bytes-compaction-results branch from e78ef43 to a017a34 Compare February 14, 2023 10:36
@nastra nastra force-pushed the add-rewritten-bytes-compaction-results branch from a017a34 to a00a9bd Compare February 15, 2023 08:56
@RussellSpitzer RussellSpitzer merged commit 49d833a into apache:master Feb 15, 2023
@RussellSpitzer
Copy link
Member

Thanks for the PR @nastra and for help with review @Fokko !

snazy pushed a commit to projectnessie/nessie that referenced this pull request Feb 16, 2023
…#6117)

Yesterday this PR got merged
(apache/iceberg#6801) which introduces one more
output value. Hence, the strict check fails.

This PR is to unblock `query-engine-integration-tests` 

Part of fixing #6114
krvikash pushed a commit to krvikash/iceberg that referenced this pull request Mar 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants