-
Notifications
You must be signed in to change notification settings - Fork 747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release proposal: Nightly v1.0 #9604
Comments
Is there an expected time to release v1.0? |
The preliminary plan is to release in March, mainly focusing on |
Hope simplify the way to insert data, it will help get more user. |
@BohuTANG Are there any plans for higher-performance client reads, like maybe streaming Arrow/Parquet/some other high-perf format? I'm not familiar with other read protocols like for example ClickHouse's, I've just been using the mysql connector. But it would be neat to be able to have databend in the middle while paying little overhead vs reading the raw parquet files from S3. |
Databend supports the suffix an For example:
With
|
@BohuTANG That is neat and confirms my suspicion that MySql protocol is a bottleneck in some usecases. Parquet read speeds are in the GB/s, but even by telling the mysql client not to handle the result, we get only MB/s. This confirms the results in the paper I linked, see "Postgres++" in the final table of results vs "Postgres". If one wanted to use databend as a simple intermediary between dataframes and s3 (more lake-house style), databend is providing a lot of value still in interactive query handling, file size and metadata mgmt, far simpler interface, etc. But it presents a bottleneck when it comes to raw-read-speed. If I wanted to do this for example: |
I believe the modern open source protocol most similar to what that paper describes is "Apache Arrow Flight" |
Yes, we have plan to do this in #9832. If the query result is small, MySQL client could work as normal since OLTP data result will commonly be small so it's ok. Otherwise, we should use other formats or protocols to handle large output (MySQL client is really bad in this case) You can use:
This paper did not cover |
Summary
Release name: v1.0-nightly, get on the train now ✋
Let's make the Databend more Lakehouse!
v1.0 (Prepare for release on March 5th)
Archive releases
Reference
What are Databend release channels?
Nightly v1.0 is part of our Roadmap 2023
Community website: https://databend.rs
The text was updated successfully, but these errors were encountered: