-
Couldn't load subscription status.
- Fork 3
Description
Have you read the RFC docs?
- Yes, I have read and understood the RFC docs
Summary
As generated code becomes easier and easier to create, what is our policy in nf-core?
Let's put cursor rules, CLAUDE.md files, etc., in our repos. I don't think we should have agents running constantly or that nf-core should be paying for that. I don't want people to have to keep their siloed rules files, regenerate these rules every time they want to contribute, or generate rules that aren't accurate and misguide their AI-generated code, further wasting the community's time and resources.
Example PR: nf-core/tools#3678
I'm hoping to keep this not in any ethical or views on it, more as a how are we going to cope with this as an org from a code quality standpoint and quality of nf-core as an organization.
Champion
Background & Motivation
Accepting generated PRs is going to come in
I personally want more agent-generated code; I want to review it first. We lack humans who understand the codebase in lots of repos(tools, website, ops) that we can leverage the generated code to tackle long-standing issues.
We're seeing lots of generated code coming in and are utilizing it. The principle is to keep PRs small and concise, so that humans can review the code and aren't overwhelmed by the extreme scope.
Again, the goal is not that nf-core will start running entirely on agents and whatnot. It's more that I would rather have more documentation and rules for people who are using AI to contribute to nf-core, improve their experience, and collaborate and iterate on those rules versus us just cutting that off and not adding any of the markdown files that are not changing any of the code base itself, just on the principle that we are being inundated with slop.
Unless we want to block PRs as AI-generated, which I think will just discourage more people from contributing to nf-core, we should just improve their experience, and maybe we will have less slop and better code contributed.
Against Argument
I don't see how generated code can fix the problem with lack of people having insight into the tools repo.
I totally agree with the latter point, but we can only get more people having (some) insight but having them write code. and generated PRs are a lot trickier to review then human generated code, because they hide the experience level of the person opening the PR, so it is difficult to triage and make it more difficult to guide the people through the PR.
I already see them popping up here and on the website and they cost more effort on my side than the give features/contributors
Goals
- Add markdown files to guide AI-generated code so it better adheres to nf-core standards
- Prevent wasted time from off-the-mark AI-generated code PRs coming in
- Close long-standing issues that we're not able to find the resources to complete in nf-core.
Non-Goals
- Move away from the goal of being "A global community effort to collect a curated set of open‑source analysis pipelines built using Nextflow."
- Run AI Models to generate code
- Allow AI-generated code to get merged without being reviewed by humans
References
Metadata
Metadata
Assignees
Labels
Type
Projects
Status