Introducing Looping Heuristics / Detection #3668

Boostrix · 2023-05-02T05:56:25Z

Duplicates

I have searched the existing issues

Summary 💡

This is a "meta" issue to keep track of issues relating to redundant/unnecessary (infinite/endless) looping and the idea to keep track of previous arguments to detect such situations, as per: #3444 (comment)

Idea: maintain a "call stack" that contains hashed values of each query/prompt - whenever this is used, increment a counter to detect if we're inside a loop without making much progress (the arguments will remain the same, and so will the hash) - if we are not making any progress at all, the response will also be the same (hash that too).

This should also help the agent determine if it's trying to re-solve a task that was previously tackled.

Solution: For a sub-agent, it should notify its parent agent using either the messaging API or by throwing the equivalent of an exception, so that it can be terminated/restarted: #1548 (comment)

For top-level agents, it's probably best to interrupt the loop and pursue an alternate option - which may involve human feedback: #3396
The cleanest method might be offering a list of options to the user (inspired by the current state of things, as per #1548), including an option for open ended feedback.

This feedback should be serialized in some form, so that the agent can easily refer back to it as per: #1377
The goal being to provide a means to do some form of self-assessment, as per: #305
This may involve telling the agent to log its progress to a task specific log file so that a parent agent can evaluate the log file, comparing it to the stated long-term goal.

Examples 🌈

This is just based on hashing full thoughts + new decision (command + args) and incrementing a counter every time we get to see the same "situation":

Motivation 🔦

detect whether an agent is trying to tackle a task that it tackled previously
detect whether it's using the same arguments and seeing the same response (=being stuck)
get rid of unnecessary looping
allow an agent to detect whether its work is in line with the stated goal or not
provide a means to bail out if necessary, either informing the parent agent and/or asking for human feedback
at the very least, use this as a means to change the problem solving strategy

zachary-kaelan · 2023-05-05T18:36:24Z

We could have a compact list of previously completed tasks fed into the prompt every interation. But a more robust solution would be to make it so that memory queries are done automatically every iteration and their top N results are fed into the prompt as, "You remember X, Y, Z."

Boostrix · 2023-05-05T18:46:07Z

I've tried to use a separate interpretation step to interpret the result of an action and modify the plan accordingly, that worked at least somewhat better than it did before. However, a number of folks now mentioned that there are 2 issues relating to pinecone memory and self feedback not working, so maybe what I am seeing is not representative currently.

anonhostpi · 2023-05-05T19:03:50Z

I think it would be worth creating an issue label for this as well. May prevent the need for this "meta" issue.

anonhostpi · 2023-05-05T21:26:40Z

~~There should also be a tag for JSON issues.~~ - https://github.com/Significant-Gravitas/Auto-GPT/labels/invalid_json

Every 3rd notification I get is about issues with JSON

Boostrix · 2023-05-11T12:19:17Z

This is just based on hashing full thoughts + new decision (command + args) and incrementing a counter every time we get to see the same "situation":

For starters:

should add a counter to track number of agent invocations with the same request/response resulting in the same local action
support an outer agent specific settings to restrict this to MAX_ITERATIONS_IDENTICAL_STEPS (or via the env file)
also, should probably consider using a configurable TIMEOUT_SECS so that the action is interrupted using a timeout error and an exact message stating it's doing some redundant.

And this stuff needs to work per agent instance, so that sub-agents can be set up accordingly.

We could have a compact list of previously completed tasks fed into the prompt every interation.

This is interesting stuff and touching on keep track of "experiences", the agent being able to remember its actions by maintaining a history of command/param tuples that worked/didn't and the associated errors/interpretation - as per: #3835 (comment)

eyalk11 · 2023-06-30T15:01:24Z

There could be two sets.

set of already executed commands+args
set of already executed commands+args that we asked the user about and he approved

If a command was already executed , then it stops and ask the user. If the user confirms, then we know that we are good to go in terms of this command and we won't stop next time. If the user rejects/ add feedback, we will stop next time.

I originally introduced the first one in #3914 . So I think I will delete this section and open a new PR.

8139493 is the old (deleted) version.

eyalk11 · 2023-07-01T03:46:32Z

Following the discussion with @Boostrix, I did some work on the subject. I allowed every command to have its own calculate_hash function so that it could return the hash of the file instead of the hash of the command arguments (which is the default case). eyalk11@25d694f .

Boostrix · 2023-07-01T14:39:59Z

as I mentioned on discord, I believe the first step to be coming up with ideas/challenges to trigger redundant looping and then use that as a baseline for any fixes we can come up with - no matter if it's using my original hashing based approach or something that you came up with.

Therefore, gonna ping @merwanehamadi to keep him in the loop (head of our challenges department)

Boostrix · 2023-07-03T08:57:59Z

FWIW, this was recently posted on discord and the article covers our looping issue: https://lorenzopieri.com/autogpt_fix/

Do they work? Nope!
The problem is … AI agents do not work. The typical session of AutoGPT ends up stuck in an infinite cycle of actions, such as google something, write it to file, read the file, google again… In general, goals requiring more than 4-5 actions seem to be out of reach.

github-actions · 2023-09-06T21:00:26Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

Pwuts · 2023-09-14T12:14:22Z

Partial solution in e437065

This was referenced May 2, 2023

do_nothing is the only response i get from auto-gpt: NEXT ACTION: COMMAND = do_nothing ARGUMENTS = {} #2023

Closed

The unlooping and fixing of file execution. #3368

Merged

k-boikov added enhancement New feature or request AI efficacy labels May 2, 2023

This was referenced May 3, 2023

Theres no meta data in the memories #3451

Closed

Constraints awareness #3466

Closed

Separation of execution and planning into different agents #3593

Closed

This was referenced May 7, 2023

Improve Self Regulatory Agency (Avoid getting stuck in sub loops) #3916

Closed

Dead loop in unknown command #3994

Closed

This was referenced May 12, 2023

Add: Auto Confirm For Self-Feedback #3934 #4000

Closed

Automated Self Feedback #4220

Closed

eyalk11 mentioned this issue Jun 30, 2023

More control over flow ( interrupt command JIT + choose commands to ignore/stop on + stop when looping ) #3914

Closed

5 tasks

eyalk11 mentioned this issue Jul 1, 2023

WIP: Attempt at looping + redesign of commands #4862

Closed

5 tasks

Pwuts self-assigned this Jul 1, 2023

Pwuts added this to the v0.5.0 Release milestone Jul 1, 2023

github-actions bot added the Stale label Sep 6, 2023

Pwuts added the meta Meta-issue about a topic that multiple issues already exist for label Sep 14, 2023

Pwuts added this to AutoGPT development kanban Sep 14, 2023

Pwuts moved this to ⏩ In Progress in AutoGPT development kanban Sep 14, 2023

github-actions bot removed the Stale label Sep 15, 2023

Boostrix mentioned this issue Oct 4, 2023

Continuous looping of the same instruction #5405

Closed

1 task

Pwuts modified the milestones: Auto-GPT v0.5.0, Auto-GPT v0.6.0 Dec 13, 2023

Swiftyos closed this as completed Jun 28, 2024

github-project-automation bot moved this from ⏩ In Progress to ✅ Done in AutoGPT development kanban Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing Looping Heuristics / Detection #3668

Introducing Looping Heuristics / Detection #3668

Boostrix commented May 2, 2023 •

edited

Loading

zachary-kaelan commented May 5, 2023

Boostrix commented May 5, 2023

anonhostpi commented May 5, 2023

anonhostpi commented May 5, 2023 •

edited

Loading

Boostrix commented May 11, 2023 •

edited

Loading

eyalk11 commented Jun 30, 2023 •

edited

Loading

eyalk11 commented Jul 1, 2023

Boostrix commented Jul 1, 2023

Boostrix commented Jul 3, 2023 •

edited

Loading

github-actions bot commented Sep 6, 2023

Pwuts commented Sep 14, 2023

Introducing Looping Heuristics / Detection #3668

Introducing Looping Heuristics / Detection #3668

Comments

Boostrix commented May 2, 2023 • edited Loading

Duplicates

Summary 💡

Examples 🌈

Motivation 🔦

zachary-kaelan commented May 5, 2023

Boostrix commented May 5, 2023

anonhostpi commented May 5, 2023

anonhostpi commented May 5, 2023 • edited Loading

Boostrix commented May 11, 2023 • edited Loading

eyalk11 commented Jun 30, 2023 • edited Loading

eyalk11 commented Jul 1, 2023

Boostrix commented Jul 1, 2023

Boostrix commented Jul 3, 2023 • edited Loading

github-actions bot commented Sep 6, 2023

Pwuts commented Sep 14, 2023

Boostrix commented May 2, 2023 •

edited

Loading

anonhostpi commented May 5, 2023 •

edited

Loading

Boostrix commented May 11, 2023 •

edited

Loading

eyalk11 commented Jun 30, 2023 •

edited

Loading

Boostrix commented Jul 3, 2023 •

edited

Loading