Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
283 commits
Select commit Hold shift + click to select a range
ab0da55
Fix: fetch is lost in replace_order_preserving_variants method during…
xudong963 Apr 23, 2025
230f31b
Speed up `optimize_projection` (#15787)
xudong963 Apr 23, 2025
e41c02c
Support WITHIN GROUP syntax to standardize certain existing aggregate…
Garamda Apr 23, 2025
323c939
docs: add ArkFlow (#15826)
chenquan Apr 23, 2025
03fa3b9
chore(deps): bump env_logger from 0.11.7 to 0.11.8 (#15823)
dependabot[bot] Apr 23, 2025
07a310f
Support unparsing `UNION` for distinct results (#15814)
phillipleblanc Apr 23, 2025
5d98d2c
Add `MemoryPool::memory_limit` to expose setting memory usage limit …
Rachelint Apr 24, 2025
4fd295d
Preserve projection for inline scan (#15825)
jayzhan211 Apr 24, 2025
377483f
cleanup after emit (#15834)
jayzhan211 Apr 24, 2025
84513a4
chore(deps): bump pyo3 from 0.24.1 to 0.24.2 (#15838)
dependabot[bot] Apr 24, 2025
5eb0968
Fix: fetch is missing in `EnforceSorting` optimizer (two places) (#15…
xudong963 Apr 24, 2025
f85535b
Minor: fix potential flaky test in aggregate.slt (#15829)
bikbov Apr 24, 2025
e0fd892
Fix `ILIKE` expression support in SQL unparser (#15820)
ewgenius Apr 24, 2025
8f5158a
Make `Diagnostic` easy/convinient to attach by using macro and avoidi…
logan-keede Apr 24, 2025
11088b9
Feature/benchmark config from env (#15782)
ctsk Apr 24, 2025
1fe856b
predicate pruning: support cast and try_cast for more types (#15764)
adriangb Apr 24, 2025
b8b6214
Fix: fetch is missing in plan_with_order_breaking_variants method (#1…
xudong963 Apr 25, 2025
4ac9b55
Fix `CoalescePartitionsExec` proto serialization (#15824)
lewiszlw Apr 25, 2025
8b91f9a
Fix build (#15849)
lewiszlw Apr 25, 2025
00617a0
Fix scalar list comparison when the compared lists have different len…
gabotechs Apr 25, 2025
6bf2326
chore: More details to `No UDF registered` error (#15843)
comphead Apr 25, 2025
1af8566
chore(deps): bump clap from 4.5.36 to 4.5.37 (#15853)
dependabot[bot] Apr 25, 2025
611cebf
Remove usage of `dbg!` (#15858)
phillipleblanc Apr 25, 2025
40ed90e
Fix `from_unixtime` function documentation (#15844)
Viicos Apr 25, 2025
96a2086
Minor: Interval singleton (#15859)
jayzhan211 Apr 26, 2025
74dc419
Make aggr fuzzer query builder more configurable (#15851)
Rachelint Apr 27, 2025
d2a2a8b
chore(deps): bump aws-config from 1.6.1 to 1.6.2 (#15874)
dependabot[bot] Apr 28, 2025
cc65b72
Add slt tests for `datafusion.execution.parquet.coerce_int96` setting…
alamb Apr 28, 2025
81b4c07
Improve `ListingTable` / `ListingTableOptions` docs (#15767)
alamb Apr 28, 2025
b782cff
Upgrade-guide: Downgrade "FileScanConfig –> FileScanConfigBuilder" he…
simonvandel Apr 28, 2025
ff404cd
Migrate Optimizer tests to insta, part2 (#15884)
qstommyshu Apr 29, 2025
f52d9b9
fix: Avoid mistaken ILike to string equality optimization (#15836)
srh Apr 29, 2025
b2c210a
Improve documentation for `FileSource`, `DataSource` and `DataSourceE…
alamb Apr 29, 2025
5e1214c
allow min max dictionary (#15827)
XiangpengHao Apr 29, 2025
54302ac
Map file-level column statistics to the table-level (#15865)
xudong963 Apr 29, 2025
01f4424
chore(deps): bump blake3 from 1.8.1 to 1.8.2 (#15890)
dependabot[bot] Apr 29, 2025
5bb0a98
Respect ignore_nulls in array_agg (#15544)
joroKr21 Apr 29, 2025
e1ad767
Set HashJoin seed (#15783)
ctsk Apr 29, 2025
2d80194
Add Extension Type / Metadata support for Scalar UDFs (#15646)
timsaucer Apr 29, 2025
2d27ce4
chore: fix clippy::large_enum_variant for DataFusionError (#15861)
rroelke Apr 29, 2025
e44ae0a
Saner handling of nulls inside arrays (#15149)
joroKr21 Apr 29, 2025
324be53
Feat: introduce `ExecutionPlan::partition_statistics` API (#15852)
xudong963 Apr 29, 2025
1e756b3
Keeping pull request in sync with the base branch (#15894)
xudong963 Apr 30, 2025
a50a525
fix: cast inner fsl to list in flatten (#15898)
gstvg Apr 30, 2025
7b370e2
support OR operator in binary `evaluate_bounds` (#15716)
davidhewitt Apr 30, 2025
bc41cee
feat: Add option to adjust writer buffer size for query output (#15747)
m09526 Apr 30, 2025
af99b54
infer placeholder datatype for IN lists (#15864)
kczimm Apr 30, 2025
2f454ae
Update known users (#15895)
comphead Apr 30, 2025
5fe8fa4
fix(avro): Respect projection order in Avro reader (#15840)
nantunes Apr 30, 2025
e1cc80c
Fix allow_update_branch (#15904)
xudong963 May 1, 2025
026a2b1
fix: correctly specify the nullability of `map_values` return type (#…
rluvaton May 1, 2025
a726e5a
Add `union_tag` scalar function (#14687)
gstvg May 1, 2025
e230941
chore(deps): bump tokio from 1.44.1 to 1.44.2 (#15900)
dependabot[bot] May 1, 2025
061ee09
chore(deps): bump assert_cmd from 2.0.16 to 2.0.17 (#15909)
dependabot[bot] May 1, 2025
185a02d
Factor out Substrait consumers into separate files (#15794)
gabotechs May 1, 2025
a655ec7
Unparse `UNNEST` projection with the table column alias (#15879)
goldmedal May 1, 2025
6bda479
feat: Add `datafusion-spark` crate (#15168)
shehabgamin May 1, 2025
eeea69d
Fix typo in introduction.md (#15910)
tom-mont May 2, 2025
f99f8da
Fix CI in main (#15917)
blaginin May 2, 2025
b90e2a0
Migrate Optimizer tests to insta, part3 (#15893)
qstommyshu May 2, 2025
f40e0db
Add `FormatOptions` to Config (#15793)
blaginin May 2, 2025
47518aa
Minor: cleanup datafusion-spark scalar functions (#15921)
alamb May 2, 2025
e467644
Fix ClickBench extended queries after update to APPROX_PERCENTILE_CON…
alamb May 3, 2025
c1a4957
fix: SqlLogicTest on Windows (#15932)
nuno-faria May 3, 2025
b4b77e9
docs: Label `bloom_filter_on_read` as a reading config (#15933)
nuno-faria May 3, 2025
6cc514b
Add extended query for checking improvement for blocked groups optimi…
Rachelint May 3, 2025
e3e5d19
Character length (#15931)
Dandandan May 3, 2025
8c42a65
chore(deps): bump tokio-util from 0.7.14 to 0.7.15 (#15918)
dependabot[bot] May 3, 2025
6cc4953
Migrate Optimizer tests to insta, part4 (#15937)
qstommyshu May 5, 2025
9c1395f
fix query results for predicates referencing partition columns and da…
adriangb May 5, 2025
0c92091
chore(deps): bump substrait from 0.55.0 to 0.55.1 (#15941)
dependabot[bot] May 5, 2025
655d63f
feat: create helpers to set the max_temp_directory_size (#15919)
jdrouet May 5, 2025
6ba4152
Fix main CI (#15942)
xudong963 May 5, 2025
90fbb98
Improve sqllogictest error reporting (#15905)
gabotechs May 5, 2025
a39c07a
refactor filter pushdown apis (#15801)
adriangb May 5, 2025
555fc2e
fix: fold cast null to substrait typed null (#15854)
discord9 May 5, 2025
95df9ee
Add additional tests for filter pushdown apis (#15955)
adriangb May 6, 2025
48331e6
Improve filter pushdown optimizer rule performance (#15959)
adriangb May 6, 2025
2d1551d
feat: ORDER BY ALL (#15772)
PokIsemaine May 6, 2025
55ba4ca
Implement Parquet filter pushdown via new filter pushdown APIs (#15769)
adriangb May 6, 2025
6ce109b
Reduce rehashing cost for primitive grouping by also reusing hash val…
Rachelint May 6, 2025
09a7a2a
chore(deps): bump chrono from 0.4.40 to 0.4.41 (#15956)
dependabot[bot] May 6, 2025
e60b260
feat: support min/max for struct (#15667)
chenkovsky May 6, 2025
0ebe473
refactor: replace `unwrap_or` with `unwrap_or_else` for improved lazy…
NevroHelios May 6, 2025
7e89862
add benchmark code for `Reuse rows in row cursor stream` (#15913)
acking-you May 6, 2025
f12ba60
Update-docs_pr.yaml (#15966)
Adez017 May 7, 2025
5bdaeaf
Segfault in ByteGroupValueBuilder (#15968)
thinkharderdev May 7, 2025
c52237d
make can_expr_be_pushed_down_with_schemas public again (#15971)
adriangb May 7, 2025
a942263
re-export can_expr_be_pushed_down_with_schemas to be public (#15974)
adriangb May 7, 2025
8d79bdd
Migrate Optimizer tests to insta, part5 (#15945)
qstommyshu May 7, 2025
e1e4c7a
Show LogicalType name for `INFORMATION_SCHEMA` (#15965)
goldmedal May 7, 2025
97add6d
chore(deps): bump sha2 from 0.10.8 to 0.10.9 (#15970)
dependabot[bot] May 7, 2025
f5454c6
refactor: remove deprecated `ParquetExec` (#15973)
miroim May 8, 2025
7b920a7
chore(deps): bump insta from 1.42.2 to 1.43.1 (#15988)
dependabot[bot] May 8, 2025
d01082e
[datafusion-spark] Add Spark-compatible hex function (#15947)
andygrove May 8, 2025
d85e04f
refactor: remove deprecated AvroExec (#15987)
miroim May 8, 2025
70109bb
Substrait: Handle inner map fields in schema renaming (#15869)
cht42 May 8, 2025
4e26949
refactor: remove deprecated CsvExec (#15991)
miroim May 8, 2025
30bb548
Migrate Optimizer tests to insta, part6 (#15984)
qstommyshu May 9, 2025
a2e3c92
chore(deps): bump nix from 0.29.0 to 0.30.1 (#16002)
dependabot[bot] May 9, 2025
e917167
Implement RightSemi join for SortMergeJoin (#15972)
irenjj May 9, 2025
58f61c8
Enable repartitioning on MemTable. (#15409)
wiedld May 10, 2025
967384e
Migrate Optimizer tests to insta, part7 (#16010)
qstommyshu May 10, 2025
808e673
chore(deps): bump sysinfo from 0.34.2 to 0.35.1 (#16027)
dependabot[bot] May 12, 2025
f7c7d92
Fix: `build_predicate_expression` method doesn't process `false` expr…
xudong963 May 12, 2025
396e0d5
refactor: move should_enable_page_index from mod.rs to opener.rs (#1…
miroim May 12, 2025
3910073
fix: add an "expr_planners" method to SessionState (#15119)
niebayes May 12, 2025
547c4a7
Updated extending operators documentation (#15612)
the0ninjas May 12, 2025
7af93e2
feat(proto): udf decoding fallback (#15997)
leoyvens May 12, 2025
56c1a09
chore: Replace MSRV link on main page with Github badge (#16020)
comphead May 12, 2025
4098060
Add note to upgrade guide for removal of `ParquetExec`, `AvroExec`, `…
alamb May 13, 2025
f6ded55
refactor: remove deprecated ArrowExec (#16006)
miroim May 13, 2025
01ce04b
refactor: remove deprecated MemoryExec (#16007)
miroim May 13, 2025
4dee60b
refactor: remove deprecated JsonExec (#16005)
miroim May 13, 2025
7a3d020
chore(deps): bump sqllogictest from 0.28.1 to 0.28.2 (#16037)
dependabot[bot] May 13, 2025
9d06baf
chores: Add lint rule to enforce string formatting style (#16024)
Lordworms May 13, 2025
aa8dfd9
Use human-readable byte sizes in EXPLAIN (#16043)
tlm365 May 14, 2025
828ee5a
Docs: Add example of creating a field in `return_field_from_args` (#1…
alamb May 14, 2025
e2a5c1e
Support `MIN` and `MAX` for `DataType::List` (#16025)
gabotechs May 14, 2025
d6fe1de
fix: overcounting of memory in first/last. (#15924)
ashdnazg May 14, 2025
4efbeee
Improve docs for Exprs and scalar functions (#16036)
alamb May 14, 2025
8f898a7
Add h2o window benchmark (#16003)
2010YOUY01 May 14, 2025
d5ecaca
fix: track coalescer's consumption (#16048)
waynexia May 14, 2025
51452d6
Fix Infer prepare statement type tests (#15743)
brayanjuls May 15, 2025
2f67a7e
fix: Clarify that it is only the name of the field that is ignored (#…
alamb May 15, 2025
c74faee
style: simplify some strings for readability (#15999)
hamirmahal May 15, 2025
07fe23f
support simple/cross lateral joins (#16015)
jayzhan211 May 16, 2025
69dfe6c
Make error msg for oom human readable (#16050)
ding-young May 16, 2025
f9326f0
chore(deps): bump the arrow-parquet group with 7 updates (#16047)
dependabot[bot] May 16, 2025
64fb51f
chore(deps): bump petgraph from 0.7.1 to 0.8.1 (#15669)
dependabot[bot] May 16, 2025
3e30f77
[datafusion-spark] Add Spark-compatible `char` expression (#15994)
andygrove May 16, 2025
e74ae7d
[Docs]: Added SQL example for all window functions (#16074)
Adez017 May 19, 2025
fb8905f
chore(deps): bump substrait from 0.55.1 to 0.56.0 (#16091)
dependabot[bot] May 19, 2025
4c719af
Add test for collect_statistics (#16098)
alamb May 19, 2025
e3e7d50
Add window function examples in code (#16102)
alamb May 19, 2025
777a318
Refactor substrait producer into multiple files (#16089)
gabotechs May 19, 2025
8be2ea5
Fix temp dir leak in tests (#16094)
findepi May 19, 2025
bbf0f3d
Label Spark functions PRs with spark label (#16095)
findepi May 19, 2025
62fa67c
feat: add slt tests for imdb data (#16067)
kumarlokesh May 19, 2025
2eaac22
fix: stack overflow for substrait functions with large argument lists…
fmonjalet May 19, 2025
8de90b1
chore: Remove SMJ experimental status (#16072)
comphead May 19, 2025
2ea1e95
chore(CI) Update workspace / CI to Rust 1.87 (#16068)
kadai0308 May 19, 2025
ca46932
minor: Add benchmark query and corresponding documentation for Averag…
logan-keede May 19, 2025
577c424
feat: metadata handling for aggregates and window functions (#15911)
timsaucer May 19, 2025
4417d5c
doc: fix indent format explain (#16085)
chenkovsky May 20, 2025
efd9587
fix: coerce int96 resolution inside of list, struct, and map types (#…
mbutrovich May 20, 2025
8d9c0f6
Update documentation for `datafusion.execution.collect_statistics` (#…
alamb May 20, 2025
3fa111e
fix: Add coercion rules for Float16 types (#15816)
etseidl May 20, 2025
8c2264c
Use qualified names on DELETE selections (#16033)
nuno-faria May 20, 2025
e6d4905
chore(deps): bump testcontainers from 0.23.3 to 0.24.0 (#15989)
dependabot[bot] May 20, 2025
963b649
feat: make error handling in indent explain consistent with that in t…
chenkovsky May 20, 2025
9ec679b
Clean up ExternalSorter and use upstream converter (#16109)
alamb May 20, 2025
52f340b
Support `GroupsAccumulator` for Avg duration (#15748)
shruti2522 May 21, 2025
febc77e
Test Duration in `fuzz` tests (#16111)
alamb May 21, 2025
46d3f52
Move PruningStatistics into datafusion::common (#16069)
adriangb May 21, 2025
ca55f1c
Revert use file schema in parquet pruning (#16086)
adriangb May 21, 2025
6cf74d6
Make `SessionContext::register_parquet` obey `collect_statistics` con…
adriangb May 21, 2025
0589dbb
fix: describe escaped quoted identifiers (#16082)
jfahne May 21, 2025
4597d3b
Minor: Add `ScalarFunctionArgs::return_type` method (#16113)
alamb May 21, 2025
ce3e387
feat: coerce from fixed size binary to binary view (#16110)
chenkovsky May 21, 2025
dc45f60
Improve the DML / DDL Documentation (#16115)
alamb May 21, 2025
40fca47
Fix `contains` function expression (#16046)
liamzwbao May 21, 2025
37c266a
Optimize performance of `string::ascii` function (#16087)
tlm365 May 21, 2025
0b6678b
chore: Use materialized data for filter pushdown tests (#16123)
comphead May 21, 2025
5669500
chore: Upgrade rand crate and some other minor crates (#16062)
comphead May 21, 2025
d22e293
Include data types in logical plans of inferred prepare statements (#…
brayanjuls May 21, 2025
39063f6
docs: Fix typos and minor grammatical issues in Architecture docs (#1…
patrickcsullivan May 22, 2025
cb45f1f
add top-memory-consumers option in cli (#16081)
ding-young May 22, 2025
67a2173
fix ci extended test (#16144)
2010YOUY01 May 22, 2025
e5f596b
Fix: handle column name collisions when combining UNION logical input…
LiaCastaneda May 22, 2025
5293b70
adding support for Min/Max over LargeList and FixedSizeList (#16071)
logan-keede May 22, 2025
dc8161e
Move prepare/parameter handling tests into `params.rs` (#16141)
liamzwbao May 22, 2025
ce835da
Add `StateFieldsArgs::return_field` (#16112)
alamb May 22, 2025
17fe504
Support filtering specific sqllogictests identified by line number (#…
gabotechs May 23, 2025
e305353
Enrich GroupedHashAggregateStream name to ease debugging Resources ex…
ahmed-mez May 24, 2025
a256eb4
chore(deps): bump uuid from 1.16.0 to 1.17.0 (#16162)
dependabot[bot] May 24, 2025
2afa3aa
Minor: Fix links in substrait readme (#16156)
alamb May 24, 2025
2199e50
Remove Filter::having field (#16154)
findepi May 24, 2025
9031d6b
Clarify docs and names in parquet predicate pushdown tests (#16155)
alamb May 24, 2025
d4218fd
Minor: Fix name() for FilterPushdown physical optimizer rule (#16175)
adriangb May 24, 2025
2add813
migrate tests in `pool.rs` to use insta (#16145)
lifan-ake May 24, 2025
af67caa
refactor(optimizer): add `.with_schema` for defining test tables (#16…
atahanyorganci May 24, 2025
dacdda2
[Minor] Speedup TPC-H benchmark run with memtable option (#16159)
Dandandan May 24, 2025
34f250a
Fast path for joins with distinct values in build side (#16153)
Dandandan May 24, 2025
3b551e9
chore: Reduce repetition in the parameter type inference tests (#16079)
jsai28 May 25, 2025
605ccbd
feat: array_length for fixed size list (#16167)
chenkovsky May 25, 2025
c5df6ee
fix: remove trailing whitespace in `Display` for `LogicalPlan::Projec…
atahanyorganci May 26, 2025
f3aed4a
chore(deps): bump tokio from 1.45.0 to 1.45.1 (#16190)
dependabot[bot] May 26, 2025
16c7939
Improve `unproject_sort_expr` to handle arbitrary expressions (#16127)
phillipleblanc May 27, 2025
2d12bf6
chore(deps): bump rustyline from 15.0.0 to 16.0.0 (#16194)
dependabot[bot] May 27, 2025
260a28a
feat: ADD sha2 spark function (#16168)
getChan May 27, 2025
db0ab74
Add macro for creating DataFrame (#16090) (#16104)
cj-zhukov May 27, 2025
68e26f1
migrate `logical_plan` tests to insta (#16184)
lifan-ake May 27, 2025
aaae4d7
doc: Move `dataframe!` example into dedicated example (#16197)
comphead May 27, 2025
a92c5e9
chore(deps): bump clap from 4.5.38 to 4.5.39 (#16204)
dependabot[bot] May 28, 2025
7d3c7d8
implement `AggregateExec.partition_statistics` (#15954)
UBarney May 28, 2025
081e95c
doc: add diagram to describe how DataSource, FileSource, and DataSour…
onlyjackfrost May 28, 2025
33a2531
Clarify documentation about gathering statistics for parquet files (#…
alamb May 28, 2025
56a2af7
Propagate .execute() calls immediately in `RepartitionExec` (#16093)
gabotechs May 28, 2025
00132da
Shift from Field to FieldRef for all user defined functions (#16122)
timsaucer May 28, 2025
7002a00
Set aggregation hash seed (#16165)
ctsk May 28, 2025
5944e8b
feat: create builder for disk manager (#16191)
jdrouet May 29, 2025
72f4eab
Fix ScalarStructBuilder::build() for an empty struct (#16205)
Blizzara May 29, 2025
2c2f225
Return an error on overflow in `do_append_val_inner` (#16201)
liamzwbao May 29, 2025
21248fb
Change default SQL mapping for `VARCAHR` from `Utf8` to `Utf8View` (…
zhuqi-lucas May 30, 2025
795988d
chore(deps): bump testcontainers-modules from 0.12.0 to 0.12.1 (#16212)
dependabot[bot] May 30, 2025
9a5c61d
Substrait: handle identical grouping expressions (#16189)
cht42 May 30, 2025
c6e5c91
Add new stats pruning helpers to allow combining partition values in …
adriangb May 30, 2025
900279c
Implement schema adapter support for FileSource and add integration t…
kosiew May 30, 2025
376eb03
Minor: update documentation for PrunableStatistics (#16213)
alamb May 30, 2025
6844e56
Minor: Remove dead code (#16215)
alamb May 31, 2025
5a86142
Add change to VARCHAR in the upgrade guide (#16216)
alamb Jun 1, 2025
7248259
Reduce size of `Expr` struct (#16207)
hendrikmakait Jun 2, 2025
6f1ca29
fix: metadata of join schema (#16221)
chenkovsky Jun 2, 2025
8b9b2fc
fix: add missing row count limits to TPC-H queries (#16230)
0ax1 Jun 3, 2025
5b08b84
Remove use of deprecated dict_ordered in datafusion-proto (#16218) (#…
cj-zhukov Jun 3, 2025
deeff88
debug cargo command in bench script (#16236)
2010YOUY01 Jun 3, 2025
d68111d
Add iceberg-rust to user list (#16246)
jonathanc-n Jun 4, 2025
3236cc0
Simplify FileSource / SchemaAdapterFactory API (#16214)
alamb Jun 4, 2025
992d156
Prepare for 48.0.0 release: Version and Changelog (#16238)
xudong963 Jun 4, 2025
f513e2c
Add dicts to aggregation fuzz testing (#16232)
blaginin Jun 4, 2025
ffbc3a7
chore(deps): bump sysinfo from 0.35.1 to 0.35.2 (#16247)
dependabot[bot] Jun 4, 2025
0c30374
Improve performance of constant aggregate window expression (#16234)
suibianwanwank Jun 4, 2025
abbf73d
Support compound identifier when parsing tuples (#16225)
hozan23 Jun 4, 2025
bf7859e
Schema adapter helper (#16108)
kosiew Jun 4, 2025
9ae41b1
Update tpch, clickbench, sort_tpch to mark failed queries (#16182)
ding-young Jun 5, 2025
448c985
Adjust slttest to pass without RUST_BACKTRACE enabled (#16251)
alamb Jun 5, 2025
ab8cd8c
fix: NaN semantics in GROUP BY (#16256)
chenkovsky Jun 5, 2025
5e307b3
Handle dicts for distinct count (#15871)
blaginin Jun 5, 2025
eeee6b0
Add `--substrait-round-trip` option in sqllogictests (#16183)
gabotechs Jun 5, 2025
25727d4
Minor: fix upgrade papercut where structure was moved (#16264)
alamb Jun 5, 2025
5d3ed9c
feat: Add Aggregate UDF to FFI crate (#14775)
timsaucer Jun 5, 2025
2c8241a
feat(small): Add `BaselineMetrics` to `generate_series()` table funct…
2010YOUY01 Jun 5, 2025
2a7f64a
feat: Add Window UDFs to FFI Crate (#16261)
timsaucer Jun 5, 2025
85f6621
Chore: update DF48 changelog (#16269)
xudong963 Jun 6, 2025
c76c1f0
fix: [branch-48] Revert "Improve performance of constant aggregate wi…
andygrove Jun 6, 2025
b5dfdbe
feat: add metadata to literal expressions (#16170) (#16315)
andygrove Jun 7, 2025
33a32d4
[branch-48] Update CHANGELOG for latest 48.0.0 release (#16314)
alamb Jun 7, 2025
7b31676
Fix parquet filter_pushdown: respect parquet filter pushdown config i…
alamb Jul 3, 2025
d43ddf1
[branch-48] Set the default value of datafusion.execution.collect_sta…
blaginin Jul 4, 2025
bcb8dc5
fix: column indices in FFI partition evaluator (#16480) (#16657)
alamb Jul 4, 2025
1dbf5c5
[branch-48] Prepare 48.0.1 ad CHANGELOG (#16679)
alamb Jul 4, 2025
f03eda5
Merge branch 'spiceai-48' into upstream-48.0.1
mach-kernel Jul 29, 2025
1fe6e3d
[reconcile upstream] ecp -> pcp facade changes
mach-kernel Jul 29, 2025
5eff4c2
[reconcile upstream] deprecation notice for filescanconfig with_metad…
mach-kernel Jul 29, 2025
648e3d1
[reconcile upstream] spi changes for pruning/physical plan
mach-kernel Jul 29, 2025
604463b
[reconcile changes] spi physical/pruning -> examples
mach-kernel Jul 29, 2025
d514acb
fix assertion: it seems that this depended on some sort behavior that…
mach-kernel Jul 29, 2025
a34bcf3
update snapshot: this appears to be the result of common_subexpr_elim…
mach-kernel Jul 29, 2025
62687cc
lint
mach-kernel Jul 29, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
3 changes: 3 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,9 @@ github:
main:
required_pull_request_reviews:
required_approving_review_count: 1
pull_requests:
# enable updating head branches of pull requests
allow_update_branch: true

# publishes the content of the `asf-site` branch to
# https://datafusion.apache.org/
Expand Down
21 changes: 1 addition & 20 deletions .github/workflows/docs_pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -34,26 +34,7 @@ on:
workflow_dispatch:

jobs:
# Run doc tests
linux-test-doc:
name: cargo doctest (amd64)
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run doctests (embedded rust examples)
run: cargo test --doc --features avro,json
- name: Verify Working Directory Clean
run: git diff --exit-code


# Test doc build
linux-test-doc-build:
name: Test doc build
Expand Down
12 changes: 11 additions & 1 deletion .github/workflows/extended.yml
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,17 @@ jobs:
- name: Run tests (excluding doctests)
env:
RUST_BACKTRACE: 1
run: cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --workspace --lib --tests --bins --features avro,json,backtrace,extended_tests,recursive_protection
run: |
cargo test \
--profile ci \
--exclude datafusion-examples \
--exclude datafusion-benchmarks \
--exclude datafusion-cli \
--workspace \
--lib \
--tests \
--bins \
--features avro,json,backtrace,extended_tests,recursive_protection
- name: Verify Working Directory Clean
run: git diff --exit-code
- name: Cleanup
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ jobs:
uses: actions/[email protected]
with:
repo-token: ${{ secrets.GITHUB_TOKEN }}
configuration-path: .github/workflows/dev_pr/labeler.yml
configuration-path: .github/workflows/labeler/labeler-config.yml
sync-labels: true

# TODO: Enable this when eps1lon/actions-label-merge-conflict is available.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@ physical-expr:

physical-plan:
- changed-files:
- any-glob-to-any-file: [datafusion/physical-plan/**/*']
- any-glob-to-any-file: ['datafusion/physical-plan/**/*']


catalog:
Expand Down Expand Up @@ -77,6 +77,10 @@ proto:
- changed-files:
- any-glob-to-any-file: ['datafusion/proto/**/*', 'datafusion/proto-common/**/*']

spark:
- changed-files:
- any-glob-to-any-file: ['datafusion/spark/**/*']

substrait:
- changed-files:
- any-glob-to-any-file: ['datafusion/substrait/**/*']
Expand Down
31 changes: 29 additions & 2 deletions .github/workflows/rust.yml
Original file line number Diff line number Diff line change
Expand Up @@ -401,8 +401,8 @@ jobs:
- name: Run tests with headless mode
working-directory: ./datafusion/wasmtest
run: |
wasm-pack test --headless --firefox
wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --firefox
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver

# verify that the benchmark queries return the correct results
verify-benchmark-results:
Expand Down Expand Up @@ -476,6 +476,28 @@ jobs:
POSTGRES_HOST: postgres
POSTGRES_PORT: ${{ job.services.postgres.ports[5432] }}

sqllogictest-substrait:
name: "Run sqllogictest in Substrait round-trip mode"
needs: linux-build-lib
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run sqllogictest
# TODO: Right now several tests are failing in Substrait round-trip mode, so this
# command cannot be run for all the .slt files. Run it for just one that works (limit.slt)
# until most of the tickets in https://github.com/apache/datafusion/issues/16248 are addressed
# and this command can be run without filters.
run: cargo test --test sqllogictests -- --substrait-round-trip limit.slt

# Temporarily commenting out the Windows flow, the reason is enormously slow running build
# Waiting for new Windows 2025 github runner
# Details: https://github.com/apache/datafusion/issues/13726
Expand Down Expand Up @@ -693,6 +715,11 @@ jobs:
# If you encounter an error, run './dev/update_function_docs.sh' and commit
./dev/update_function_docs.sh
git diff --exit-code
- name: Check if runtime_configs.md has been modified
run: |
# If you encounter an error, run './dev/update_runtime_config_docs.sh' and commit
./dev/update_runtime_config_docs.sh
git diff --exit-code

# Verify MSRV for the crates which are directly used by other projects:
# - datafusion
Expand Down
Loading