Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1009 commits
Select commit Hold shift + click to select a range
87242c0
Parquet: Support row group bloom filters (#4938)
huaxingao Jun 13, 2022
06bd31e
Docs: Fix copy paste: s/parquet/avro/ (#5020)
Fokko Jun 13, 2022
c7d758c
Python: Add identity transform (#4908)
jun-he Jun 13, 2022
38145f5
Core: Enforce close for in-memory files (#5019)
findepi Jun 14, 2022
547152c
Build: Avoid running engine tests on LICENSE and NOTICE update (#5022)
singhpk234 Jun 14, 2022
98dc9fe
Core: Update ExpireSnapshots impl for branching and tagging (#4578)
amogh-jahagirdar Jun 14, 2022
8d8b606
Nessie: Properly close all resources in NessieCatalog (#5047)
nastra Jun 15, 2022
a7f7c1a
Core: Add reader and writer for Puffin index and stats format (#4537)
findepi Jun 15, 2022
5653a0d
AWS: add skip name validation config for glue namespace and table (#5…
xingfanx Jun 15, 2022
a38e663
Python: Don't use a metaclass for the Singleton (#5055)
Fokko Jun 15, 2022
13db47e
Core: In HadoopTableOperations, replace Util.getFs call with getFileS…
chakradhar2 Jun 16, 2022
6c510d6
Python: Add Flake8 for simple checks (#5073)
Fokko Jun 17, 2022
c9442f1
Flink: FLIP-27 source enumerator (#4986)
stevenzwu Jun 17, 2022
bf2f33a
Puffin: Inline PuffinWriter.writeFully (#5088)
findepi Jun 20, 2022
5f74b33
Open-api: RenameTableRequest props are required (#5093)
Fokko Jun 20, 2022
c713475
Python: Add py.typed file for annotations (#5091)
Fokko Jun 20, 2022
282e976
Include actual class name in class mismatch exception (#5089)
findepi Jun 20, 2022
a2c0e13
Fix type of fields list in Puffin (#5087)
findepi Jun 20, 2022
da63c84
Python: Add Avro read path (#4920)
Fokko Jun 20, 2022
a64417c
Flink: Port #4986, FLIP-27 source enumerator to 1.14 module (#5078)
stevenzwu Jun 20, 2022
3bd6107
Core: Add Javadoc for LoadTableResponse, fix error prone warnings (#5…
kbendick Jun 20, 2022
a3c44ee
Core: Add serialization tests for CreateTableRequest (#5052)
kbendick Jun 20, 2022
ce11427
Python: Bump pre-commit versions (#5074)
Fokko Jun 20, 2022
120cbe9
AWS: add skip name validation to isValidIdentifier (#5081)
xingfanx Jun 20, 2022
58a9c6d
Python: Replace vars with cached_property decorator (#5068)
Fokko Jun 21, 2022
4959e54
Core: Send format version for create table in REST catalog (#5084)
bryanck Jun 21, 2022
2f550cd
Python: Renable pylint (#5066)
Fokko Jun 21, 2022
e335086
Core: Check for valid identifiers in REST catalog (#5107)
bryanck Jun 21, 2022
08c8764
Hive: Print db and table name while acquiring hive meta-store lock (#…
krisdas Jun 22, 2022
a5efb53
Core: Metadata table queries fail if a partition column was reused in…
szlta Jun 22, 2022
ac8733d
Core: Support FileIO prefix operations (#5096)
danielcweeks Jun 22, 2022
88a4d9e
Core: Remove Usage of Sets.Union in Loop (#5114)
RussellSpitzer Jun 22, 2022
ae19482
Parquet: Fix VectorizedParquetDefinitionLevelReader with direct ByteB…
huaxingao Jun 23, 2022
7d3ed17
Nessie: Fix drop/rename table with TableReference in identifier (#5033)
ajantha-bhat Jun 23, 2022
51fbc56
Spec: Typo an to a (#5123)
Fokko Jun 23, 2022
a741a30
Python: Add flake8 bugbear (#5117)
Fokko Jun 23, 2022
720d8d9
Core: Reuse REST client for catalog and table operations (#5125)
danielcweeks Jun 23, 2022
6ef656b
Spark 3.2: RewriteDataFiles - Escape special characters in table iden…
bijanhoule Jun 23, 2022
51827cf
Docs: Change docs layout for new iceberg-theme (#5115)
samredai Jun 23, 2022
95dfb2b
Flink: Add option to limit the number of snapshots in a planning oper…
chenjunjiedada Jun 24, 2022
781f1bc
ORC: Upgrade to 1.7.5 (#5070)
williamhyun Jun 24, 2022
0eeec8b
Python: Add flake8-comprehensions (#5130)
Fokko Jun 24, 2022
1fc5058
Docs: Use more specific hidden partitioning anchor link (#5128)
a49a Jun 24, 2022
35b8558
Flink: FLIP-27 Iceberg source and builder (#5109)
stevenzwu Jun 24, 2022
0e2f690
checkStyle errors
Jun 25, 2022
313f497
Spark: Add compute stats to scan builder also (#5136)
bryanck Jun 27, 2022
443fee4
Parquet: Release the compressor when closing ParquetWriter (#5126)
hmit Jun 27, 2022
4956ec6
Checkstyle: Add rule for %d in Preconditions.checkArgument (#5057)
xrl1 Jun 27, 2022
eff6556
Arrow: Avoid extra dictionary buffer copy (#5137)
bryanck Jun 27, 2022
e0ff3c6
Core: Fix REST field name case strategy (#5133)
rdblue Jun 27, 2022
bd94084
Python: Update dependencies (#5142)
Fokko Jun 27, 2022
dadeef2
Docs: A few small fixes to docs markdown files (#5127)
samredai Jun 27, 2022
98dc1e6
Python: Use Pydantic for serialization and deserialization (#5011)
Fokko Jun 27, 2022
4ce3900
Core: Fix partition clustering to produce table sort order (#5131)
rdblue Jun 27, 2022
5590d1e
Core: Fix tag ancestor snapshot handling (#5034)
rdblue Jun 28, 2022
2262f67
Build: Print used JDK version (#5138)
nastra Jun 28, 2022
c27efc0
Spark: Validate HMS uri in SparkSessionCatalog (#5134)
flyrain Jun 28, 2022
7d6bbc4
Spark: Add __metadata_col for metadata columns when converting types …
singhpk234 Jun 28, 2022
612fa2d
Revert "Spark: Use original option keys in SparkCatalog (#4903)"
rdblue Jun 28, 2022
7a783b6
MR: Clone ANYWHERE location array IcebergSplit (#4984)
findepi Jun 28, 2022
23b97c1
API: Refactor ScanTask hierarchy (#5077)
aokolnychyi Jun 28, 2022
0775af1
Core: Add source snapshot info to Puffin Blob metadata (#5129)
findepi Jun 28, 2022
c1dee75
API: Support composite types in Accessors (#5067)
dimas-b Jun 28, 2022
2c6f097
Docs: Add reference to Apache Impala documentation (#5146)
samredai Jun 28, 2022
71cbe8b
Core: Fix CreateTableRequest to use field names from the REST spec (#…
kbendick Jun 28, 2022
a5eb4e4
Core: Update CreateTableRequest Javadoc (#5144)
kbendick Jun 28, 2022
f7add9f
move spark/v3.2 to spark/v3.3
Jun 28, 2022
3e2ab64
make spark 3.3 work
Jun 20, 2022
e28b288
copy 3.2 files from 3.3
Jun 28, 2022
b9dfdc2
enable ci for 3.2
Jun 20, 2022
2229881
Docs: Update README.md project description.
rdblue Jun 28, 2022
bcfb7a8
Merge branch 'master' of https://github.com/apache/iceberg
Jun 28, 2022
a2619b8
Spec: Fix typo in REST catalog spec (#5152)
kbendick Jun 28, 2022
b814bf2
Python: Make the VoidTransform a singleton (#5149)
Fokko Jun 28, 2022
31e1ef4
Spark: Fix regression from Scan refactor (#5143)
bryanck Jun 28, 2022
878022b
Core: Test serialization of LoadTableResponse (#5118)
kbendick Jun 29, 2022
3f3a987
Spark 3.3: Support AS OF syntax in Spark SQL for time travel (#5156)
singhpk234 Jun 29, 2022
6b40ec7
Python: Resolve write/read schemas (#5116)
Fokko Jun 29, 2022
faa4c55
Core: Make StreamingDelete public (#5148)
gustavoatt Jun 29, 2022
e9b47f7
Spark: Port performance fix for Spark 3.2 to 3.3 (#5155)
bryanck Jun 29, 2022
7f152b1
Spark 3.x: Use original option keys in SparkCatalog (#5044)
rajarshisarkar Jun 29, 2022
3147901
Docs: Add Flink SQL client -i docs (#5003)
ymZhao1001 Jun 29, 2022
2dab406
Spec: Fix language in spec ORC type notes (#4976)
szehon-ho Jun 29, 2022
b94716f
Nessie: Update to 0.30.0 (#4780)
snazy Jun 29, 2022
b4ed29b
Core: Avoid snapshot ID collisions (#4747)
ajantha-bhat Jun 29, 2022
e1b67dc
Flink: Remove unnecessary switch statement (#4737)
Jun 29, 2022
336197f
AWS: Support overriding endpoint in DynamoDB (#4726)
ebyhr Jun 29, 2022
3663497
API: Fix version exception when registering metadata (#4946)
ajantha-bhat Jun 29, 2022
7e30bec
Core: Add reference_snapshot_id filter column to all_manifests table …
szehon-ho Jun 29, 2022
dd5b350
Docs: Update Hive doc page with the 4.0.0-alpha-1 features (#5161)
samredai Jun 30, 2022
3fca5e5
API: Fix typo Canout to Cannot (#5164)
bianqi520 Jun 30, 2022
f72442f
Python: Add truncate transform (#5030)
jun-he Jun 30, 2022
342ec3c
API: Access deleted and added delete files in Snapshot (#5105)
aokolnychyi Jun 30, 2022
da37818
Python: Rename python top-level module pyiceberg, remove src (#5169)
samredai Jun 30, 2022
bd495ec
Parquet: Add bloom filter options to the write path (#5035)
huaxingao Jun 30, 2022
dec5679
Parquet: Fix bloom filter check for corrupt files (#5172)
huaxingao Jun 30, 2022
63acd31
Python: Move Transforms to Pydantic (#5170)
Fokko Jul 1, 2022
509bdc1
Spec: Sort-order order-id is mandatory (#5177)
Fokko Jul 1, 2022
7e1ade8
Arrow: Pad decimal bytes before passing to decimal vector (#5168)
bryanck Jul 1, 2022
0420cde
API: Add generic FileIO JSON serialization (#5178)
rdblue Jul 3, 2022
6ec04dc
Python: Pin bugbear in Python legacy (#5184)
Fokko Jul 3, 2022
c69a3dd
API: Add a scan for changes (#4870)
aokolnychyi Jul 3, 2022
ff6cf54
Flink: Support write options in FlinkSink builder (#3998)
hililiwei Jul 3, 2022
36d0b91
Spark 3.3: Re-enable 2-level Parquet list test (#5179)
singhpk234 Jul 3, 2022
9742ea0
API: Add java type to composite TypeID enums (#5154)
dimas-b Jul 3, 2022
81dec35
Docs: Add Flink and Iceberg type compatibility tables (#4865)
wuwenchi Jul 3, 2022
da7dd94
Flink: Fix typo in FlinkSink string (#5176)
a49a Jul 5, 2022
e3e8a15
Docs: Fix typo in docs (#5202)
samredai Jul 5, 2022
d563d6d
Core: Fix ErrorProne Warnings (#5200)
nastra Jul 5, 2022
c8b97c9
Arrow: Fix for dictionary encoded fixed length binary decimals (#5198)
bryanck Jul 5, 2022
8d80888
Merge branch 'master' of https://github.com/apache/iceberg
Jul 6, 2022
a265687
Core: Defer reading Avro metadata until ManifestFile is read (#5206)
rdblue Jul 6, 2022
5c85b2f
Core: Add length arg to FileIO.newInputFile (#5207)
rdblue Jul 6, 2022
d397ae3
Build: Update Parquet and Avro dependencies (#5188)
rdblue Jul 6, 2022
56c1993
Spark: Add procedure to publish WAP changes using wap.id (#4715)
edgarRd Jul 6, 2022
3959e2f
Hive: Return null if Hive inspects a null struct record (#4283)
tprelle Jul 6, 2022
d8d212f
Spark 3.3: Skip pushdown of non-evaluable filters (#5204)
huaxingao Jul 6, 2022
96975d7
Arrow/AWS/Core/Hive: Fix ErrorProne warnings (#5212)
nastra Jul 7, 2022
761eadc
Core: Use table partitioning with manual sort order (#5187)
rdblue Jul 7, 2022
d131c35
Core: Fix REST catalog when reverting DDL changes. (#5226)
rdblue Jul 8, 2022
c0ccb00
Spark-3.2: Support Zorder option for rewrite_data_files stored proced…
ajantha-bhat Jul 8, 2022
8e9e333
Spark: Provide size estimate for table broadcast (#5225)
bryanck Jul 8, 2022
a253817
Core: Update MetricsConfig to use a default for first 32 columns (#5215)
rdblue Jul 8, 2022
b0937a4
Spark 3.2: Skip pushdown of non-evaluable filters (#5227)
huaxingao Jul 8, 2022
637b2e8
adding setters and getters for targetBranch
Jul 9, 2022
43faba0
Flink: Read RowData by default in FLIP-27 source (#5220)
stevenzwu Jul 10, 2022
0f13b2a
Flink 1.14: FLIP-27 Iceberg source and builder, port #5109 (#5191)
zoucao Jul 10, 2022
f281ec0
Core: Document prefix in REST catalog spec (#5233)
Fokko Jul 10, 2022
f8d1249
Build: Add task to generate a git properties file (#5228)
kbendick Jul 10, 2022
791e7e6
Build: Add iceberg-build.properties to Jars and release process (#5236)
rdblue Jul 10, 2022
d1d7087
Python: Reenable mypy (#5171)
Fokko Jul 10, 2022
4d055f8
Core: Improve in and not_in predicate eval for all_manifests.referenc…
szehon-ho Jul 10, 2022
117625e
Python: Remove operation enum, add In expression (#4816)
CircArgs Jul 10, 2022
643ecc2
REST: Create commit catalog handler fix (#5235)
bryanck Jul 10, 2022
82ef2dc
Spark-3.3: Support Zorder option for rewrite_data_files stored proced…
ajantha-bhat Jul 11, 2022
62739b8
API: Add IcebergBuild info (#5237)
rdblue Jul 11, 2022
3754cbd
Core: Add build info to REST requests in HTTP headers (#5238)
kbendick Jul 11, 2022
ecd1b69
Docs: Add time travel examples for Spark SQL (#5180)
singhpk234 Jul 11, 2022
7338103
API: Fix API Javadoc (#5190)
kbendick Jul 11, 2022
1629c53
Spark 3.3: Add procedure to publish WAP changes using wap.id (#5223)
edgarRd Jul 11, 2022
60fb199
Flink 1.14: Read RowData by default in FLIP-27 source, port of #5220 …
stevenzwu Jul 11, 2022
7279859
Python: Add Makefile argument to pass args to pytest (#5246)
samredai Jul 11, 2022
fa07d9f
Core: Fix wrapper performance for delete filters (#5249)
rdblue Jul 11, 2022
6603a81
Python: Add sort order fields (#5124)
Fokko Jul 11, 2022
0f5b7eb
Python: Add BindVisitor to bind a BooleanExpression to a Schema (#5239)
samredai Jul 11, 2022
6cc4a19
Spark: Add Spark catalog for loading tables from cache (#5247)
aokolnychyi Jul 12, 2022
b493bde
Spark 3.3: Rename output columns in register_table procedure (#5253)
aokolnychyi Jul 12, 2022
71aa529
Core: Load equality deletes only initialize once per DeleteFilter (#5…
shidayang Jul 12, 2022
5f5d699
Core: Fix REST catalog metadata table loading. (#5255)
rdblue Jul 12, 2022
1533eb1
Python: Move Snapshot to Pydantic (#5201)
Fokko Jul 12, 2022
d3953cf
Core, AWS: Remove throw in finally block (#5222)
nastra Jul 12, 2022
600b293
Spark 3.2: Rename output columns in register_table procedure (#5260)
aokolnychyi Jul 12, 2022
883f867
Build: Fix gradle warning for optimizations disabled for API tests (#…
kbendick Jul 12, 2022
eff879d
Core: Add copy(withStats) to ContentFile (#5213)
nastra Jul 12, 2022
1afce60
Python: Bump pre-commit versions (#5203)
Fokko Jul 12, 2022
eeb0af1
Flink: Port #3998 to Flink 1.13 & 1.14 (#5194)
hililiwei Jul 12, 2022
774b2f7
API: Fix exception thrown in runSafely, suppress error prone warning …
kbendick Jul 12, 2022
2ad3c1a
Spark 3.3: Expose action classes in SparkActions (#5257)
aokolnychyi Jul 12, 2022
6d2edd6
Spark 3.2: Expose action classes in SparkActions (#5261)
aokolnychyi Jul 12, 2022
e58ff70
Spark: Log warning on filter pushdown failure (#5254)
huaxingao Jul 13, 2022
52aa642
Build: Upgrade test dependencies (#5210)
XN137 Jul 13, 2022
51a9702
Update License Header (#5265)
nastra Jul 13, 2022
8495141
Python: Move PartitionSpec to Pydantic (#5192)
Fokko Jul 13, 2022
6a06cb3
Flink 1.13, 1.14: Port maxPlanningSnapshotCount configuration from #4…
chenjunjiedada Jul 13, 2022
90fe0ed
Build: Unify github action versions (#5211)
XN137 Jul 14, 2022
95f9adb
Build: Exclude unnecessary git properties from iceberg-build.properti…
singhpk234 Jul 15, 2022
64ed2a7
Spark: Remove stack trace from log for filter pushdown failure (#5274)
huaxingao Jul 15, 2022
8155e5e
Spark: Update Spark 2.4 JMH instructions for renamed module (#5189)
kbendick Jul 15, 2022
4e7ad7e
Dell: Fix bugs during documenting (#5059)
wang-x-xia Jul 16, 2022
36d6910
Hive: Fix typo depracated to deprecated (#5285)
bianqi520 Jul 16, 2022
9a3349b
Build: Fix Scala 2.13 builds in stage-binaries.sh. (#5270)
rdblue Jul 17, 2022
344534d
Build: Add iceberg-build.properties to RAT excludes. (#5262)
rdblue Jul 17, 2022
88b1812
AWS: DynamoCatalog: Pass CommitFailedException up the stack without w…
waifairer Jul 18, 2022
8300b21
Build: Use Google Java Format for spotless (#5266)
nastra Jul 19, 2022
f655c56
AWS: avoid retry for AWS service user errors in GlueTableOperations (…
xingfanx Jul 19, 2022
20c4750
AWS: Fix setup of S3 batch deletion tests (#5289)
amogh-jahagirdar Jul 19, 2022
21b504c
Prevent usage of @Test(expected = ...) and change existing tests (#5221)
nastra Jul 19, 2022
9f6f687
API: Avoid hasNext in CloseableIterable.concat (#5306)
nastra Jul 20, 2022
2df9484
API: Introduce DefaultMetricsContext and Timer interface (#5286)
nastra Jul 20, 2022
2946be9
Docs: Fix Flink Connector docs with custom catalog (#5045)
nastra Jul 20, 2022
e364716
Spark: Correct SparkCatalog javadoc for supplying custom catalog (#5288)
Jul 20, 2022
87f7ab0
StreamingDelete constructor can be called by subclasses (#5271)
gustavoatt Jul 20, 2022
906ba1b
Core: Avoid extra manifest read (#5309)
palaniappa Jul 20, 2022
debad8e
Python: Add __version__ to the package (#5315)
Fokko Jul 21, 2022
0fadeca
Python: Remove .python-version file (#5326)
Fokko Jul 21, 2022
2eefe5e
Build: Upgrade slf4j to 1.7.36 (#5320)
nastra Jul 21, 2022
4a7f2fb
Core: Avoid NPE in SchemaParser if a type is missing (#5291)
Fokko Jul 21, 2022
a0fca88
Build: Upgrade Guava to 31.1-jre (#5322)
nastra Jul 21, 2022
ae96bdf
Core: Add MetadataLogs metadata table (#5063)
singhpk234 Jul 22, 2022
3c0fd8f
Core: Implement BaseMetastoreCatalog.registerTable() (#5037)
Mehul2500 Jul 22, 2022
f8321ca
Python: Bump dependencies to the latest version (#5325)
Fokko Jul 22, 2022
c07aee8
Python: Bump pre-commit plugins to the latest version (#5324)
Fokko Jul 22, 2022
7ea2f24
Python: Remove unused function (#5296)
Fokko Jul 22, 2022
760643b
Spec: Add sequence-number and parent-snapshot-id (#5196)
Fokko Jul 22, 2022
fd14dce
Build: Let revapi compare API compatibility against apache-iceberg-0.…
nastra Jul 22, 2022
f0a2b58
Build: Use apache-iceberg- tag prefix to set SNAPSHOT version (#5341)
nastra Jul 22, 2022
4a8654b
Python: Add dependabot for keeping the dependencies up to date (#5340)
Fokko Jul 22, 2022
be9c491
Spark: Support partition transforms with using any case (#5335)
southernriver Jul 22, 2022
65b803f
Python: Map Manifest onto Pydantic class (#5298)
Fokko Jul 22, 2022
742b3ec
AWS: Fix PUT retry failures by opening new data file streams (#5282)
jfz Jul 22, 2022
eaad5d3
Bump zstandard from 0.17.0 to 0.18.0 in /python (#5342)
dependabot[bot] Jul 22, 2022
650e52d
Core: Print date/time strings with +00:00 zone offset (#5337)
nastra Jul 22, 2022
5951ece
API: Deprecate Counter#count(), add Counter#value() (#5328)
nastra Jul 22, 2022
040a891
AWS: Add LakeFormation Integration tests (#4423)
xiaoxuandev Jul 25, 2022
5e25f2b
Python: Add more expression classes (#5258)
CircArgs Jul 25, 2022
2a6d17f
Spark: Support _deleted metadata column in vectorized reads (#4888)
flyrain Jul 25, 2022
fa6403b
AWS: Make glue endpoint configurable #5095 (#5330)
naushadh Jul 25, 2022
39878c6
Spark 3.3: Fix typo (#5354)
20100507 Jul 26, 2022
4687798
Spec: Add table statistics tracking (#4945)
findepi Jul 26, 2022
e41758d
Core: Support creating custom tasks in ManifestGroup (#5301)
aokolnychyi Jul 26, 2022
cfc2b3f
Spark: Use constants for path, versionAsOf, and timestampAsOf (#5347)
zhaomin1423 Jul 26, 2022
b8dc8c4
Parquet: Use consistent random for bloom filter tests (#5329)
huaxingao Jul 26, 2022
d7b1a87
Spark 3.3: Support different task types in readers (#5248)
flyrain Jul 26, 2022
d5c0aa4
AWS: Verify Mock Invocation in OutputStream Tests (#5317)
abmo-x Jul 26, 2022
7c88e99
AWS: Fix malformed Javadoc (#5359)
kbendick Jul 26, 2022
679c3d4
Spark 3.2: Add prefix mismatch mode for deleting orphan files (#4652)
karuppayya Jul 27, 2022
7a92bf5
Spark 3.2: Support different task types in readers (#5363)
flyrain Jul 27, 2022
1556c13
Build: Memoize getProjectVersion (#5051)
snazy Jul 27, 2022
d393f17
Nessie: Do not delete default branch in tests (#5193)
snazy Jul 27, 2022
2ea5b9e
Core: Add base implementations for changelog tasks (#5300)
aokolnychyi Jul 27, 2022
6e7a5df
Remove conflicting checkstyle checks due to Spotless
nastra Jul 8, 2022
99b41eb
Build: Use google-java-format 1.7 for JDK8 compatibility
nastra Jul 20, 2022
c07f2aa
./gradlew spotlessApply
nastra Jul 27, 2022
dbb8a40
Docs: Mention how to fix code style
nastra Jul 27, 2022
9a58a84
Format: Fix broken link to Parquet LogicalTypes in spec.md (#5352)
skadyan Jul 27, 2022
23c9345
Flink: Support FLIP-27 source through SQL (#5318)
stevenzwu Jul 28, 2022
41570a1
Flink 1.14: Support FLIP-27 source through SQL (#5344)
stevenzwu Jul 28, 2022
3d00780
Parquet: Add option to set page row count limit (#5345)
bryanck Jul 28, 2022
6514d73
AWS: S3FileIOIntegration fix prefix list test (#5383)
amogh-jahagirdar Jul 29, 2022
b67cec5
Spark 3.3: Add prefix mismatch mode for deleting orphan files (#5385)
karuppayya Jul 29, 2022
0b84579
Python: Refactor unary and set expressions (#5362)
rdblue Jul 29, 2022
15c6817
Core: Update TestEnvironmentUtil for environments with no USER (#5353)
stevenzwu Jul 29, 2022
5360cfc
API: Track name and unit in Counter, Timer (#5386)
nastra Jul 30, 2022
9927572
Python: Add REST catalog implementation (#5287)
Fokko Jul 30, 2022
36b974b
Flink: Produce Flink metrics directly (#5393)
stevenzwu Jul 30, 2022
b88f011
Python: Bump fastavro from 1.5.3 to 1.5.4 in /python (#5396)
dependabot[bot] Jul 31, 2022
8d2d27b
Github: Add issue form (#4867)
Fokko Jul 31, 2022
14f4bc1
Build: Add an action to handle stale Github issues (#4949)
kbendick Jul 31, 2022
00e0f7b
Flink: Support write options in the in-line SQL comments (#5050)
hililiwei Aug 1, 2022
aae6155
AWS: Call abortUpload only once when any of the completable future fa…
singhpk234 Aug 1, 2022
aaa67d0
Spark: Implement FunctionCatalog (#5377)
kbendick Aug 1, 2022
5f2ce6e
Python: Refactor expression hierarchy (#5389)
rdblue Aug 1, 2022
e05f2bb
API: Fix ID assignment in schema merging (#5395)
karuppayya Aug 1, 2022
8fb88a7
Spark 3.2: Backport FunctionCatalog to Spark 3.2 (#5411)
kbendick Aug 1, 2022
3d47d46
Nessie: Bump to 0.40.3 (#5406)
snazy Aug 1, 2022
2945f76
Python: Minor REST catalog updates (#5402)
Fokko Aug 1, 2022
d44565b
AWS: S3OutputStream - failure to close should persist on subsequent c…
abmo-x Aug 2, 2022
6d67088
Core: Change table name to metadata_log_entries (#5367)
singhpk234 Aug 2, 2022
ebf8e18
AWS: Cleanup prefix in S3FileIO integration tests (#5413)
amogh-jahagirdar Aug 2, 2022
c87df9a
conflicts
Aug 2, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
4 changes: 0 additions & 4 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,6 @@ github:
- marton-bod
- nastra
- samarthjain
- szehon-ho
- findepi
- SreeramGarlapati
- samredai
Expand All @@ -48,6 +47,3 @@ notifications:
issues: issues@iceberg.apache.org
pullrequests: issues@iceberg.apache.org
jira_options: link label link label

publish:
whoami: asf-site
98 changes: 38 additions & 60 deletions .baseline/checkstyle/checkstyle.xml
Original file line number Diff line number Diff line change
Expand Up @@ -8,10 +8,6 @@
<property name="severity" value="error"/>

<module name="FileTabCharacter"/> <!-- Java Style Guide: Whitespace characters -->
<module name="LineLength"> <!-- Java Style Guide: No line-wrapping -->
<property name="max" value="120"/>
<property name="ignorePattern" value="^package.*|^import.*|a href|href|http://|https://|ftp://"/>
</module>
<module name="NewlineAtEndOfFile"> <!-- Java Style Guide: Line ending: LF -->
<property name="lineSeparator" value="lf"/>
</module>
Expand Down Expand Up @@ -51,6 +47,32 @@
<property name="format" value="new JavaSparkContext\(.*\)"/>
<property name="message" value="Prefer using JavaSparkContext.fromSparkContext() instead of calling a constructor directly."/>
</module>
<module name="RegexpSingleline">
<property name="format" value="new HashMap&lt;&gt;\(.*\)"/>
<property name="message" value="Prefer using Maps.newHashMap instead."/>
</module>
<module name="RegexpSingleline">
<property name="format" value="new ConcurrentHashMap&lt;&gt;\(.*\)"/>
<property name="message" value="Prefer using Maps.newConcurrentMap() instead."/>
</module>
<module name="RegexpSingleline">
<property name="format" value="new ArrayList&lt;&gt;\(.*\)"/>
<property name="message" value="Prefer using Lists.newArrayList() instead."/>
</module>
<module name="RegexpSingleline">
<property name="format" value="new HashSet&lt;&gt;\(.*\)"/>
<property name="message" value="Prefer using Sets.newHashSet() instead."/>
</module>
<module name="RegexpSingleline">
<property name="format" value="new LinkedHashSet&lt;&gt;\(.*\)"/>
<property name="message" value="Prefer using Sets.newLinkedHashSet() instead."/>
</module>
<module name="RegexpMultiline">
<property name="fileExtensions" value="java"/>
<property name="matchAcrossLines" value="true"/>
<property name="format" value="Preconditions\.checkArgument\([^;]+%d[^;]+\);"/>
<property name="message" value="Preconditions.checkArgument does not support %d. use %s instead"/>
</module>
<module name="SuppressionFilter"> <!-- baseline-gradle: README.md -->
<property name="file" value="${config_loc}/checkstyle-suppressions.xml"/>
</module>
Expand Down Expand Up @@ -87,13 +109,13 @@
<module name="AvoidStarImport"/> <!-- Java Style Guide: No wildcard imports -->
<module name="AvoidStaticImport"> <!-- Java Style Guide: No static imports -->
<property name="excludes" value="
com.google.common.base.Preconditions.*,
com.palantir.logsafe.Preconditions.*,
java.util.Collections.*,
java.util.stream.Collectors.*,
org.apache.commons.lang3.Validate.*,
org.apache.iceberg.expressions.Expressions.*,
org.apache.iceberg.expressions.Expression.Operation.*,
org.apache.iceberg.DistributionMode.*,
org.apache.iceberg.IsolationLevel.*,
org.apache.iceberg.NullOrder.*,
org.apache.iceberg.MetadataTableType.*,
Expand All @@ -105,6 +127,8 @@
org.apache.parquet.schema.OriginalType.*,
org.apache.parquet.schema.PrimitiveType.PrimitiveTypeName.*,
org.apache.spark.sql.functions.*,
org.apache.spark.sql.connector.iceberg.write.RowLevelOperation.Command.*,
org.apache.spark.sql.connector.write.RowLevelOperation.Command.*,
org.junit.Assert.*"/>
</module>
<module name="ClassTypeParameterName"> <!-- Java Style Guide: Type variable names -->
Expand All @@ -120,10 +144,6 @@
<property name="exceptionVariableName" value="expected"/>
</module>
<module name="EmptyForInitializerPad"/> <!-- Java Style Guide: Horizontal whitespace -->
<module name="EmptyLineSeparator"> <!-- Java Style Guide: Source file structure -->
<property name="tokens" value="PACKAGE_DEF, IMPORT, CLASS_DEF, ENUM_DEF, INTERFACE_DEF, CTOR_DEF, STATIC_INIT, INSTANCE_INIT, VARIABLE_DEF"/>
<property name="allowNoEmptyLineBetweenFields" value="true"/>
</module>
<module name="EmptyStatement"/> <!-- Java Style Guide: One statement per line -->
<module name="EqualsHashCode"/>
<module name="FallThrough"/> <!-- Java Style Guide: Fall-through: commented -->
Expand Down Expand Up @@ -227,20 +247,7 @@
<property name="illegalClassNames" value="com.google.common.cache.CacheBuilder, com.google.common.cache.Cache, com.google.common.cache.LoadingCache"/>
<message key="illegal.type" value="Do not use Guava caches, they are outperformed by and harder to use than Caffeine caches"/>
</module>
<module name="ImportOrder"> <!-- Java Style Guide: Ordering and spacing -->
<property name="groups" value="/.*/"/>
<property name="option" value="bottom"/>
<property name="separated" value="true"/>
<property name="sortStaticImportsAlphabetically" value="true"/>
</module>
<module name="Indentation"> <!-- Java Style Guide: Block indentation: +4 spaces -->
<property name="basicOffset" value="2"/>
<property name="arrayInitIndent" value="4"/>
<property name="lineWrappingIndentation" value="4"/>
<property name="caseIndent" value="2"/>
</module>
<module name="InnerAssignment"/> <!-- Java Coding Guidelines: Inner assignments: Not used -->
<module name="LeftCurly"/> <!-- Java Style Guide: Nonempty blocks: K & R style -->
<module name="MemberName"> <!-- Java Style Guide: Non-constant field names -->
<property name="format" value="^[a-z][a-zA-Z0-9]+$"/>
<message key="name.invalidPattern" value="Member name ''{0}'' must match pattern ''{1}''."/>
Expand All @@ -259,19 +266,8 @@
<module name="NoClone"/> <!-- Java Coding Guidelines: Never override Object#finalize or Object#clone -->
<module name="NoFinalizer"/> <!-- Java Coding Guidelines: Never override Object#finalize -->
<module name="NoLineWrap"/> <!-- Java Style Guide: No line-wrapping -->
<module name="NoWhitespaceAfter"> <!-- Java Style Guide: Horizontal whitespace -->
<property name="allowLineBreaks" value="false"/>
<property name="tokens" value="BNOT,DEC,DOT,INC,LNOT,UNARY_MINUS,UNARY_PLUS"/>
</module>
<module name="NoWhitespaceBefore"> <!-- Java Style Guide: Horizontal whitespace -->
<property name="allowLineBreaks" value="true"/>
</module>
<module name="OneStatementPerLine"/> <!-- Java Style Guide: One statement per line -->
<module name="OneTopLevelClass"/> <!-- Java Style Guide: Exactly one top-level class declaration -->
<module name="OperatorWrap"> <!-- Java Style Guide: Where to break -->
<property name="option" value="EOL"/>
<property name="tokens" value="BAND, BOR, BSR, BXOR, DIV, EQUAL, GE, GT, LAND, LE, LITERAL_INSTANCEOF, LOR, LT, MINUS, MOD, NOT_EQUAL, PLUS, QUESTION, SL, SR, STAR "/>
</module>
<module name="OuterTypeFilename"/> <!-- Java Style Guide: File name -->
<module name="PackageAnnotation"/> <!-- Java Style Guide: Package statement -->
<module name="PackageDeclaration"/> <!-- Java Style Guide: Package statement -->
Expand Down Expand Up @@ -316,6 +312,10 @@
<property name="format" value="Preconditions\.checkNotNull\((?!.*,)([^()]*(\(([^()]*|\(([^()]*|\([^()]*\))*\))*\)[^()]*)*)\)"/>
<property name="message" value="Use Preconditions.checkNotNull(Object, String)."/>
</module>
<module name="RegexpSinglelineJava">
<property name="format" value="Objects\.(require|)[Nn]onNull.*\(.*\)"/>
<property name="message" value="Prefer using Preconditions.checkNotNull(Object, String)."/>
</module>
<module name="RegexpSinglelineJava"> <!-- Java Coding Guidelines: Check parameters for validity -->
<property name="format" value="Validate\.notNull\((?!.*,)([^()]*(\(([^()]*|\(([^()]*|\([^()]*\))*\))*\)[^()]*)*)\)"/>
<property name="message" value="Use Validate.notNull(Object, String)."/>
Expand Down Expand Up @@ -351,7 +351,7 @@
<module name="RegexpSinglelineJava">
<property name="id" value="BanSystemErr"/>
<property name="format" value="System\.err\."/>
<property name="message" value="Logging with System.err is not allowed because it has no metadata and can't be configured at runtime. Please use an SLF4J logger instead, e.g. log.info(&quot;Message&quot;)."/>
<property name="message" value="Logging with System.err is not allowed because it has no metadata and can't be configured at runtime. Please use an SLF4J logger instead, e.g. log.error(&quot;Message&quot;)."/>
<property name="ignoreComments" value="true"/>
</module>
<module name="RegexpSinglelineJava">
Expand All @@ -378,21 +378,10 @@
<property name="format" value="(void setUp\(\))|(void setup\(\))|(void setupStatic\(\))|(void setUpStatic\(\))|(void beforeTest\(\))|(void teardown\(\))|(void tearDown\(\))|(void beforeStatic\(\))|(void afterStatic\(\))"/>
<property name="message" value="Test setup/teardown methods are called before(), beforeClass(), after(), afterClass(), but not setUp, teardown, etc."/>
</module>
<module name="RightCurly"> <!-- Java Style Guide: Nonempty blocks: K & R style -->
<property name="option" value="same"/>
<property name="tokens" value="LITERAL_TRY, LITERAL_CATCH, LITERAL_FINALLY, LITERAL_IF, LITERAL_ELSE, LITERAL_DO"/>
</module>
<module name="RightCurly"> <!-- Java Style Guide: Nonempty blocks: K & R style -->
<property name="option" value="alone"/>
<property name="tokens" value="CLASS_DEF, METHOD_DEF, CTOR_DEF, LITERAL_FOR, LITERAL_WHILE, STATIC_INIT, INSTANCE_INIT"/>
</module>
<module name="SeparatorWrap"> <!-- Java Style Guide: Where to break -->
<property name="tokens" value="DOT"/>
<property name="option" value="nl"/>
</module>
<module name="SeparatorWrap"> <!-- Java Style Guide: Where to break -->
<property name="tokens" value="COMMA"/>
<property name="option" value="EOL"/>
<module name="RegexpSinglelineJava">
<property name="ignoreComments" value="true"/>
<property name="format" value="@Test\(.*expected.*\)"/>
<property name="message" value="Prefer using Assertions.assertThatThrownBy(...).isInstanceOf(...) instead."/>
</module>
<module name="SimplifyBooleanExpression"/> <!-- Java Coding Guidelines: Keep Boolean expressions simple -->
<module name="SimplifyBooleanReturn"/> <!-- Java Coding Guidelines: Keep Boolean expressions simple -->
Expand All @@ -416,16 +405,6 @@
</module>
<module name="UpperEll"/> <!-- Java Style Guide: Numeric Literals -->
<module name="VisibilityModifier"/> <!-- Java Coding Guidelines: Minimize mutability -->
<module name="WhitespaceAfter"/> <!-- Java Style Guide: Horizontal whitespace -->
<module name="WhitespaceAround"> <!-- Java Style Guide: Horizontal whitespace -->
<property name="allowEmptyConstructors" value="true"/>
<property name="allowEmptyMethods" value="true"/>
<property name="allowEmptyTypes" value="true"/>
<property name="allowEmptyLoops" value="true"/>
<property name="ignoreEnhancedForColon" value="false"/>
<message key="ws.notFollowed" value="WhitespaceAround: ''{0}'' is not followed by whitespace. Empty blocks may only be represented as '{}' when not part of a multi-block statement (4.1.3)"/>
<message key="ws.notPreceded" value="WhitespaceAround: ''{0}'' is not preceded with whitespace."/>
</module>

<!-- Stricter checks begin: delete some or all of the following for faster prototyping, but please restore before pushing to production. -->

Expand Down Expand Up @@ -473,7 +452,6 @@
<message key="name.invalidPattern" value="Parameter name ''{0}'' must match pattern ''{1}''."/>
<property name="ignoreOverridden" value="true"/>
</module>
<module name="SingleLineJavadoc"/> <!-- Java Style Guide: General form -->

<!-- Stricter checks end -->
</module>
Expand Down
23 changes: 14 additions & 9 deletions .baseline/copyright/001_apache-2.0.txt
Original file line number Diff line number Diff line change
@@ -1,11 +1,16 @@
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0
http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
4 changes: 2 additions & 2 deletions .gitattributes
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,6 @@
jitpack.yml export-ignore
/python export-ignore
/python/** export-ignore
/site export-ignore
/site/** export-ignore
/docs export-ignore
/docs/** export-ignore

52 changes: 52 additions & 0 deletions .github/ISSUE_TEMPLATE/iceberg_bug_report.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,52 @@
---
name: Iceberg Bug report 🐞
description: Problems, bugs and issues with Apache Iceberg
labels: ["kind:bug"]
body:
- type: dropdown
attributes:
label: Apache Iceberg version
description: What Apache Iceberg version are you using?
multiple: false
options:
- "0.14.0 (latest release)"
- "0.13.1"
- "0.13.0"
- "0.12.1"
- "0.12.0"
- "0.11.1"
- "0.11.0"
- "main (development)"
validations:
required: false
- type: dropdown
attributes:
label: Query engine
description: Which query engine?
multiple: false
options:
- Spark
- Trino
- Starburst
- Snowflake
- Dremio
- Starrocks
- Doris
- EMR
- Athena
- PrestoDB
- Flink
- Impala
- Hive
- Other
validations:
required: false
- type: textarea
attributes:
label: Please describe the bug 🐞
description: >
Please describe the problem, what to expect, and how to reproduce.
Feel free to include stacktraces and the Iceberg catalog configuration.
You can include files by dragging and dropping them here.
validations:
required: true
33 changes: 33 additions & 0 deletions .github/ISSUE_TEMPLATE/iceberg_improvement.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
---
name: Iceberg Improvement / Feature Request
description: New features with Apache Iceberg
labels: ["kind:feature request"]
body:
- type: textarea
attributes:
label: Feature Request / Improvement
description: Please describe the feature and elaborate on the use case and motivation behind it
validations:
required: true
- type: dropdown
attributes:
label: Query engine
description: Which query engine?
multiple: false
options:
- Spark
- Trino
- Starburst
- Snowflake
- Dremio
- Starrocks
- Doris
- EMR
- Athena
- PrestoDB
- Flink
- Impala
- Hive
- Other
validations:
required: false
18 changes: 18 additions & 0 deletions .github/ISSUE_TEMPLATE/iceberg_question.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
---
name: Iceberg Question
description: Questions around Apache Iceberg
labels: ["kind:question"]
body:
- type: markdown
attributes:
value: "Feel free to ask your question on [Slack](https://join.slack.com/t/apache-iceberg/shared_invite/zt-tlv0zjz6-jGJEkHfb1~heMCJA3Uycrg) as well."
- type: textarea
attributes:
label: Query engine
description: Which query engine (Flink, Spark, EMR, etc) are you using?
- type: textarea
attributes:
label: Question
description: What is your question?
validations:
required: true
27 changes: 27 additions & 0 deletions .github/dependabot.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
#
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
#

version: 2
updates:
- package-ecosystem: "pip"
directory: "/python/"
schedule:
interval: "weekly"
day: "sunday"
open-pull-requests-limit: 5
7 changes: 6 additions & 1 deletion .github/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -32,9 +32,10 @@ BUILD:
- "**/*gradle*"
- versions.props
DOCS:
- site/**/*
- docs/**/*
- "**/*CHANGELOG.md"
- "**/*README.md"
- "**/*CONTRIBUTING.md"
EXAMPLES:
- examples/**/*
COMMON:
Expand Down Expand Up @@ -78,3 +79,7 @@ NESSIE:
- nessie/**/*
ALIYUN:
- aliyun/**/*
GCP:
- gcp/**/*
DELL:
- dell/**/*
Loading