Skip to content

Releases: Eventual-Inc/Daft

v0.6.1

10 Sep 17:37
778eba6
Compare
Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

🚀 Performance

  • perf: Use count_rows on fragment to reduce lance scans with limit pushdowns only @colin-ho (#5120)

📖 Documentation

  • docs: improve text readability on examples page @ykdojo (#5182)
  • docs: add TrendShift badge to README @ykdojo (#5181)
  • docs: improve explode method documentation with null/empty list examples @ykdojo (#5164)
  • docs: fix broken tutorial links and remove redundant file @ykdojo (#5154)

👷 CI

🔧 Maintenance

  • chore: add PR conventions note to AGENTS.md @ykdojo (#5166)

Full Changelog: v0.6.0...v0.6.1

v0.6.0

04 Sep 04:26
ef36735
Compare
Choose a tag to compare

What's Changed 🚀

v0.6.0 marks the official release of our new ray-based distributed engine, Flotilla! If you are already using the ray runner, you do not need to change anything. Setting the DAFT_RUNNER=ray environment variable, or within your python program via daft.context.set_runner_ray(), will use Flotilla by default.

All operations except cross join, sort merge join, and pivot are currently supported. We will be working on adding support for them soon! If you need to use the legacy ray runner, please set daft.set_execution_config(use_legacy_ray_runner=True)

💥 Breaking Changes

SQLCatalog was deprecated in v0.5 and is now removed, in favor of the bindings kwargs.

Before:

catalog = SQLCatalog({"test_data": df})
result = daft.sql("SELECT * FROM test_data", catalog=catalog)

After:

bindings = {"test_data": df}
result = daft.sql("SELECT * FROM test_data", **bindings)

✨ Features

🐛 Bug Fixes

🚀 Performance

♻️ Refactor

  • refactor: proper organization for providers and protocols in daft.ai @rchowell (#5125)

📖 Documentation

👷 CI

🔧 Maintenance

⏪ Reverts

Full Changelog: v0.5.22...v0.5.23

v0.5.22

28 Aug 15:00
248b7e8
Compare
Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

♻️ Refactor

📖 Documentation

👷 CI

🔧 Maintenance

  • chore: Remove deprecated APIs for 0.6 @colin-ho (#5050)
  • chore: disable hugging face library progress bars @kevinzwang (#5040)
  • chore: relax assertion in flaky sharding distribution test @Jay-ju (#5053)
  • chore(dev): use pyproject.toml to manage the dev dependencies @xy-xin (#4849)
  • chore: random the counter during creating DistributedActorPoolProject… @stayrascal (#5039)

⏪ Reverts

Full Changelog: v0.5.21...v0.5.22

v0.5.21

21 Aug 21:57
9017c8f
Compare
Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

📖 Documentation

🔧 Maintenance

Full Changelog: v0.5.20...v0.5.21

v0.5.20

20 Aug 19:34
0bcaf35
Compare
Choose a tag to compare

What's Changed 🚀

💥 Breaking Changes

✨ Features

🐛 Bug Fixes

♻️ Refactor

  • refactor: make DaftExtension class definition static @Jay-ju (#4968)

📖 Documentation

  • docs: fix grammar in CONTRIBUTING.md setup instructions @ykdojo (#4986)

👷 CI

🔧 Maintenance

Full Changelog: v0.5.19...v0.5.20

v0.5.19

14 Aug 23:03
23549cd
Compare
Choose a tag to compare

What's Changed 🚀

We have a pretty crazy release this time around. Some especially notable features include:

  • Interactive DataFrames in Jupyter Notebooks, with special support for some multimodal types
  • An async API for LLM text generation, particularly with OpenAI
  • A new .into_batches DataFrame API, the modern alternative to .into_partitions
  • Adding support for .offset / OFFSET operator across the engine. Thanks @plotor for the great work!
  • Various Flotilla performance and reliability improvements
  • Various casting improvements

✨ Features

🐛 Bug Fixes

  • fix: Always just use actor for flotilla scheduler @colin-ho (#4978)
  • fix: Add handle for swordfish runtime stats manager @colin-ho (#4970)
  • fix: Dudep lance read required columns @xloya (#4967)
  • fix: Don't use wildcard for logical plan match in pushdown rules @colin-ho (#4945)
  • fix: Coerce arrow schema for parquet decoding @colin-ho (#4948)
  • fix: use associate type for swordfish into_batches operator state @ohbh (#4956)
  • fix: raise error on invalid cross join parameters @rchowell (#4952)
  • fix: interactive html fixes @colin-ho (#4943)

♻️ Refactor

  • refactor: Use associate type for swordfish operator state @colin-ho (#4921)

📖 Documentation

👷 CI

  • ci: add 3.11 to matrix and always include typing_extensions @rchowell (#4950)

🔧 Maintenance

  • chore: Remove Err Case for Series and RecordBatch size_bytes @srilman (#4954)

👋 New Contributors

Full Changelog: v0.5.18...v0.5.19

v0.5.18

08 Aug 08:39
bda006a
Compare
Choose a tag to compare

What's Changed 🚀

✨ Features

🐛 Bug Fixes

  • fix: Batch RuntimeSubscriber updates for all nodes @srilman (#4932)
  • fix: Column Ordering in UDF & Project Optimizations @srilman (#4923)
  • fix: Refactor Progress Bar to be a RuntimeStatSubscriber @srilman (#4837)

🚀 Performance

♻️ Refactor

📖 Documentation

🔧 Maintenance

  • chore: config isort known_third_party to fix import formatting errors @Jay-ju (#4840)
  • chore: add warning on repartition in native runner @kevinzwang (#4910)

Full Changelog: v0.5.17...v0.5.18

v0.5.17

05 Aug 07:11
2558eca
Compare
Choose a tag to compare

What's Changed 🚀

📖 Documentation

🔧 Maintenance

Full Changelog: v0.5.16...v0.5.17

v0.5.16

02 Aug 01:09
105580b
Compare
Choose a tag to compare

What's Changed 🚀

✨ Features

  • feat: Interactive jupyter display @colin-ho (#4835)
  • feat: supports passing a projection kwargs in select @rchowell (#4884)
  • feat: Add offset operator support to DataFrame for Native Runner @plotor (#4582)

🐛 Bug Fixes

🚀 Performance

📖 Documentation

Full Changelog: v0.5.15...v0.5.16

v0.5.15

01 Aug 03:35
7987455
Compare
Choose a tag to compare

What's Changed 🚀

✨ Features

  • feat: add openai provider in llm_generate function @huleilei (#4809)
  • feat: Use shuffle_aggregation_default_partitions in flotilla aggregate @colin-ho (#4869)
  • feat: abstract the interface of scan pushdown @Jay-ju (#4772)
  • feat: Add get_or_infer_runner_type to support getting runner type from context @plotor (#4810)
  • feat: support glob multiple path @stayrascal (#4811)

🐛 Bug Fixes

📖 Documentation

🔧 Maintenance

Full Changelog: v0.5.14...v0.5.15