How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? #4190

waynehamadi · 2023-05-14T13:12:10Z

Duplicates

I have searched the existing issues

Summary 💡

Currently someone needs to intentionally modify the code to say " I have beaten challenge A".
But it's possible someone makes an improvement on challenge A but also improves challenge B.

We need to attempt challenges anytime there is a prompt change.

Examples 🌈

No response

Motivation 🔦

No response

Boostrix · 2023-05-14T15:06:53Z

for starters, by keeping track of the costs spent to arrive at a solution ?
In other words, at least steps/API tokens + time ?

In the future, maybe by tracking CPU/RAM utiization as well.

But in general we should gather data for different prompts so that we can use gnuplot to plot performance for each version/commit.

And we should probably start by using GPT to come up with N mutations for a given task (that we know works) and then use those as a baseline for future benchmarking

github-actions · 2023-09-06T20:51:26Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions · 2023-09-18T01:46:42Z

This issue was closed automatically because it has been stale for 10 days with no activity.

waynehamadi changed the title ~~How do we know if a prompt improved Auto-GPT ?~~ How do we know **AUTOMATICALLY** if a prompt improved Auto-GPT ? May 14, 2023

waynehamadi changed the title ~~How do we know **AUTOMATICALLY** if a prompt improved Auto-GPT ?~~ How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? May 14, 2023

waynehamadi mentioned this issue May 14, 2023

Help us build challenges! #3835

Closed

waynehamadi self-assigned this May 14, 2023

Boostrix added challenge ci labels May 15, 2023

github-actions bot added the Stale label Sep 6, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? #4190

How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? #4190

waynehamadi commented May 14, 2023

Boostrix commented May 14, 2023 •

edited

Loading

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 18, 2023

How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? #4190

How do we know AUTOMATICALLY if a prompt improved Auto-GPT ? #4190

Comments

waynehamadi commented May 14, 2023

Duplicates

Summary 💡

Examples 🌈

Motivation 🔦

Boostrix commented May 14, 2023 • edited Loading

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 18, 2023

Boostrix commented May 14, 2023 •

edited

Loading