Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Looking for help maintaining (reviewing PRs) in arrow/parquet/object_store #6418

Open
alamb opened this issue Sep 18, 2024 · 12 comments
Open
Labels
enhancement Any new improvement worthy of a entry in the changelog

Comments

@alamb
Copy link
Contributor

alamb commented Sep 18, 2024

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

PRs in this repository are now taking days/weeks to review, which results in

  1. Slower velocity (how fast we can change this library)
  2. A bad community experience (if you get no feedback for a while, it is likely you will go elsewhere to help)

This repo isn't something "we" are providing or are gate keeping. This is project is a community effort in all aspects.

Describe the solution you'd like
I would like more capacity to review PRs in this repo so that we can merge them faster and accelerate progress in this repository.

Reviewing PRs means;

  1. Ensure all PRs are looked at in a timely manner (ideally within 24 hours) (see list here)
  2. Ensure the PR follow the contribution guidelines.
  3. Ensure the PR is clearly documented, and the code follows existing patterns
  4. Ensure the new functionality is covered with tests
  5. When it looks good, @ mention one of the committers and we'll give it a review and merge

Describe alternatives you've considered
I would like everyone who contributes code to this repository to feel both welcome and empowered to review PRs. It takes time and effort, no doubt, but you will very likely learn something and sustained help reviewing PRs is a great way to be considered for committer yourself.

If you are waiting on a review, please consider helping review other PRs -- it will reduce the time before your PR is reviewed.

People may have the idea that PRs can only be reviewed by "committers" in arrow. However, this is not the case -- committers need to do a final approve + merge, but anyone can (and should!) be reviewing other PRs.

As an example, @iffyio's work in sqlparser-rs is textbook example of how to review PRs: https://github.com/sqlparser-rs/sqlparser-rs/pulls?q=is%3Apr+commenter%3Aiffyio

Additional context
Previously @tustvold spent a significant amount of his effort on reviewing PRs in this crate and was paid to do so by InfluxData. He has now moved on to other endeavors, and while he is still helping and InfluxData is working on backfilling his review capacity this project should not be seen as something we can afford to maintain by ourselves

@alamb alamb added the enhancement Any new improvement worthy of a entry in the changelog label Sep 18, 2024
@alamb alamb pinned this issue Sep 18, 2024
@alamb alamb changed the title Looking for help maintaining this crate Looking for help maintaining (reviewing PRs) in arrow/parquet/object_store Sep 18, 2024
@alamb
Copy link
Contributor Author

alamb commented Sep 18, 2024

BTW total kudos to @Xuanwo for their help so far -- it has been most appreciated

@ByteBaker
Copy link
Contributor

@alamb I'd be happy to chip in. Especially around the extremely older ones. Because I feel some of them have stalled to the point where they're effectively obsolete but still keeping an entry in open issues' list.

@Xuanwo
Copy link
Member

Xuanwo commented Sep 19, 2024

BTW total kudos to @Xuanwo for their help so far -- it has been most appreciated

Please feel free to ping me if there are object_store or storage-related PRs (such as the parquet async reader) that need my review, as I'm confident in handling them.

For other parts of this project, I'm still learning, so I can only handle the simpler ones.

@alamb
Copy link
Contributor Author

alamb commented Sep 19, 2024

@alamb I'd be happy to chip in. Especially around the extremely older ones. Because I feel some of them have stalled to the point where they're effectively obsolete but still keeping an entry in open issues' list.

That would be super helpful @ByteBaker -- indeed some of the older ones are stalled indeed. Any help cleaning would be helpful (even if it is just a ping to me or another committer saying they should probably be closed)

@alamb
Copy link
Contributor Author

alamb commented Sep 19, 2024

@dsgibbons asked on discord:

Is there a list of relevant committers for arrow-rs specifically? (as opposed to C++, Go etc.)

TLDR there is no such list I know of or official distinction between arrow commiters; However, clearly there are those of us who focus more on the rust implementation,

I think the easiest thing to do is look at who has merged recent PRs in this repo. Examples: myself @alamb , @tustvold , @viirya , @Dandandan )

@bkietz
Copy link
Member

bkietz commented Sep 19, 2024

It might be worthwhile to include google/flatbuffers#8150 on this list ( PRs google/flatbuffers#8398 #6426 )

@alamb
Copy link
Contributor Author

alamb commented Sep 23, 2024

It might be worthwhile to include google/flatbuffers#8150 on this list ( PRs google/flatbuffers#8398 #6426 )

@bkietz what list are you referring to?

@bkietz
Copy link
Member

bkietz commented Sep 23, 2024

I meant the list of PRs linked to this issue above

@alamb
Copy link
Contributor Author

alamb commented Sep 30, 2024

BTW there has been a marked uptick in review activity, thank you to @etseidl @wiedld @Xuanwo @westonpace @ByteBaker and others who I probably forgot for all their help 🙏

@samuelcolvin
Copy link
Contributor

@alamb we don't have that much spare time to devote to reviews, but we are contributing a fair amount and using arrow-rs and datafusion a lot.

Therefore, we (Pydantic Services Inc.) could be happy sponsor individuals or organizations significantly (up to $1000/month) for helping to maintain and develop arrow-rs.

If that can help someone (or multiple people) find more time to work on arrow-rs (or datafusion), it would make me very happy. If not, feel free to ignore my comment — I know most people don't work on open source for the money, so I don't want to pollute the platonic ideals of open source with grubby 💸 if it's not wanted.

@ByteBaker
Copy link
Contributor

I'd urge others to also please go through the list above. If an item they see doesn't have a recent comment from me, please add your inputs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Any new improvement worthy of a entry in the changelog
Projects
None yet
Development

No branches or pull requests

5 participants