Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs: Update roadmap to point at EPIC's, clarify project goals #6639

Merged
merged 5 commits into from
Jun 15, 2023

Conversation

alamb
Copy link
Contributor

@alamb alamb commented Jun 12, 2023

Which issue does this PR close?

Closes #3935

Related to #6441

Rationale for this change

Our roadmap is somewhat out of date as it refers to several projects that seem to have been completed (the relevant ticket have been closed)

Also, I am working on #5812 and wanted to have an up to date roadmap to discuss

Also, we recently had a discussion #6441 about the vision of DataFusion, which should be reflected in the user facing documentation

What changes are included in this PR?

  1. Update the roadmap section of the docs to point at github epics
  2. Incorporate feedback from Discussion: "Goals of DataFusion" / "Vision" for DataFusion #6441 (turns out what was on the site was already pretty close)

Are these changes tested?

N/A

Are there any user-facing changes?

@github-actions github-actions bot added the core Core DataFusion crate label Jun 12, 2023
@@ -132,7 +132,13 @@
//!
//! ## Customization and Extension
//!
//! DataFusion supports extension at many points:
//! DataFusion is designed to be a "disaggregated" query engine. This
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is trying to address @boazberman 's comments in #6441 (comment)

[Rust](https://www.rust-lang.org/) and [Apache Arrow](https://arrow.apache.org/)
to provide:

1. Best-in-class single node query performance
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These goals are largely redundant with the introduction, so I figured it would be better to leave a link and direct people back there rather than partially replicate the content


## Planning `EPIC`s

DataFusion uses [GitHub
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I began this PR by trying to summarize the outstanding work and to do so I looked at the EPICs -- pretty soon I found that I was just replicating https://github.com/apache/arrow-datafusion/issues?q=is%3Aissue+is%3Aopen+epic in a markdown document that would end up out of date

While a more free form version of the roadmap in text (rather than a github issue list) is probably easier to consume, unless we have a volunteer to commit to doing, keeping our efforts focused on keeping github updated seemed better.

building new systems. Here are some examples of systems built using DataFusion:
building new systems.

While most current usecases are "analytic" or (throughput) some
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is trying to channel @avantgardnerio 's suggestion on #6441 (comment) though I am not sure how faithfully I have done so

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure I could say it any better.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you need the or after "analytic" though?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
While most current usecases are "analytic" or (throughput) some
While most current usecases are "analytic" (throughput) some

Nice catch -- 🦅 👁️

@alamb alamb self-assigned this Jun 12, 2023
Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@alamb
Copy link
Contributor Author

alamb commented Jun 13, 2023

I plan to leave this PR open for a few more days to make sure anyone who is interested gets a chance to reply / comment. I'll try and merge it in towards the end of the week

@alamb alamb merged commit 9dfaf42 into apache:main Jun 15, 2023
@alamb alamb deleted the alamb/update_roadmap branch June 15, 2023 19:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Update the Roadmap.md
6 participants