Improved preprompts for improve flow #1045

ATheorell · 2024-03-04T14:10:09Z

I have noticed that the current preprompts for the improve flow sometimes leads to lazy behavior, in the sense that no or little code is written. In this PR, I have aligned the preprompts as much as possible with the generate workflow, which we know is not lazy. I hope this will lead to tangible improvements and hopefully we can soon test the actual improvement using the apps benchmarks which is about to be supported (#1025).

codecov · 2024-03-04T14:13:02Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 82.96%. Comparing base (b16eef1) to head (fc12cdc).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1045      +/-   ##
==========================================
- Coverage   83.41%   82.96%   -0.45%     
==========================================
  Files          25       25              
  Lines        1272     1274       +2     
==========================================
- Hits         1061     1057       -4     
- Misses        211      217       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ATheorell · 2024-03-04T14:21:03Z

@ErikBjare is there a quick explanation why the codecov workflow fails?

ErikBjare · 2024-03-04T15:04:37Z

@ATheorell codecov.io has these configurable thresholds for what constitutes an acceptable code coverage.

Since we only use it for informational purposes, and not as a blocker for merging PRs, we don't want the checks to fail.

I was supposed to configure this, but I cannot since I don't have org-level maintainer access.

Here are the steps:

Go here: https://app.codecov.io/gh/gpt-engineer-org
Go to settings -> Global YAML
Screenshot
Paste this:

coverage:
  status:
    patch:
      default:
        informational: true
    project:
      default:
        informational: true

ATheorell · 2024-03-04T18:26:18Z

Thanks for instructions. I have now updated the global yaml. Tried to rerun the jobs, but maybe I need to make a new commit for this to come into effect. Anyway, we can still merge if we want.

similato87 · 2024-03-05T03:37:33Z

Hi @ATheorell , this looks fantastic! I've noticed instances where the LLMs yield no differences, leading to hit 'no diff' exceptions for users. My workaround has been advising users to reselect files and modify their prompts. However, ensuring that the LLMs consistently offer guidance on changes would enhance user experience, as that's the primary reason users engage with this feature.
Additionally, introducing an option for users to specify their preference for the extent of changes—via a flag—could align well with the new configuration feature from @ErikBjare. This approach could make the tool more user-centric and adaptable to individual user needs.

gpt_engineer/preprompts/improve

ATheorell requested review from TheoMcCabe, similato87 and ErikBjare March 4, 2024 14:10

TheoMcCabe reviewed Mar 5, 2024

View reviewed changes

gpt_engineer/preprompts/improve Show resolved Hide resolved

ATheorell added 4 commits March 9, 2024 13:42

aligning improve prompt with generate prompt

83ed29e

clarified file format prompt

6e3f074

updated cache

36063bc

updating cache

fc12cdc

ATheorell force-pushed the main branch from 1a51cd0 to fc12cdc Compare March 9, 2024 12:47

ATheorell merged commit 178d17d into AntonOsika:main Mar 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved preprompts for improve flow #1045

Improved preprompts for improve flow #1045

ATheorell commented Mar 4, 2024

codecov bot commented Mar 4, 2024 •

edited

Loading

ATheorell commented Mar 4, 2024

ErikBjare commented Mar 4, 2024 •

edited

Loading

ATheorell commented Mar 4, 2024

similato87 commented Mar 5, 2024

Improved preprompts for improve flow #1045

Improved preprompts for improve flow #1045

Conversation

ATheorell commented Mar 4, 2024

codecov bot commented Mar 4, 2024 • edited Loading

Codecov Report

ATheorell commented Mar 4, 2024

ErikBjare commented Mar 4, 2024 • edited Loading

ATheorell commented Mar 4, 2024

similato87 commented Mar 5, 2024

codecov bot commented Mar 4, 2024 •

edited

Loading

ErikBjare commented Mar 4, 2024 •

edited

Loading