Split bash commands by the new line character #4462

tofarr · 2024-10-17T19:47:23Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

When a bash command spans multiple lines the output is really confusing - so we split each command by the newline character for better readability. My initial approach was to split the command into multiple separate commands and add these to the event stream individually, but this did not play well with the LLM - The behavior was non-deterministic, but sometimes it would basically say "Thank you for running the first command - what about the others?" And it would supply the commands that had not yet been run to be run again, resulting in duplicates in the stream!

So my approach was to improve the output from splitting commands in the runtime client, while still treating these as a single RunCmdAction

Example 1:

Print the word "OpenHands" in the bash console in ascii art. Use only the echo command to do this. New lines of text should use the same invocation of the echo command

Example 2


* ls -lah
* echo "May the force be with you!"
* echo "May the odds be ever in your favor!"
* echo "Live long and prosper!"

Example 3


* ls -lah
* echo "hello world"
* echo a sample kubernetes deployment yaml into deploy.yaml

(This looks a little uglier because the console interprets each tab as 8 spaces.)

Example 4 (With confirmation mode!)

Please do the following in a single bash command:

* sleep for 5 seconds
* print "You must Narfle the Garthok!"
* Print the current working directory

Example 5 (Running user commands)

xingyaoww

We actually something like this on the backend - what do you feel if we move them to the controller here? That is, we break one CmdRunAction to multiple ones and send them over for execution in agent controller?

OpenHands/openhands/runtime/client/client.py

Lines 433 to 437 in ec3152b

    
           commands = split_bash_commands(action.command) 
        
           all_output = '' 
        
           for command in commands: 
        
               if command == '': 
        
                   output, exit_code = self._continue_bash(

enyst · 2024-10-18T00:14:10Z

openhands/controller/agent_controller.py

+                        new_action.thought = ''
+                    else:
+                        new_action.thought = action.thought
+                    self.event_stream.add_event(new_action, EventSource.AGENT)


This is an interesting idea, how will it work with longer running commands though? Here _pending_action is set. When the first new action is executed the runtime will put an obs in the stream which will make it unset here... this places them in the stream and continues looping here. Then the step would pass here before all are executed, no? What prevents it?

Yeah this probably messes with _pending_action quite a bit....

@tofarr you can probably test this by asking the agent something like:

Please run echo hello and ls -lah in the same <execute_bash> block

openhands/controller/agent_controller.py

enyst

I kept thinking about this PR since the other day, for some odd reason. I'd appreciate if we take some time to make sure this could work. It seems to me it won't work right or I just don't see why it would.

For added fun, let's take a delegates' case: if the action is split in 5 in a delegate controller, and mini-action 1 of 5 has an error, then... okay, I think something else is bugged currently, but even with a fix, say like this, the delegate ends here. In this PR, is it possible to have obs 2-5 arrive in the stream after the delegate ends? If not, can you please point out what would prevent it?

Because it seems like the child obs could end up in the parent's history, which is a tad messy. 😅

rbren · 2024-10-18T18:31:19Z

openhands/controller/agent_controller.py

+                        continue
+                    new_action = CmdRunAction(command=cmd)
+                    if i < len(commands) - 1:
+                        new_action.thought = ''


what's this about?

This was from the original prompt:
.. Each instance should have a blank 'thought' attribute, except for the last one which should have the thought from the original action...

The reason for this is that it prevents the frontend UI from printing the thought multiple times. e.g.: (Without this functionality):

enyst · 2024-10-18T19:20:54Z

It is possible that the action includes several times the same bash command, right? Like echo *, I don't know.

I wonder what would happen here. If they're separate in the stream, like mini-action 1, mini-obs 1, mini-action 2... it seems like they can be detected as repeated actions / obs, and the agent should stop with stuck in a loop error. 🤔 We do this for some cases when the LLM gets trapped and responds with the same thing over and over.

rbren · 2024-10-18T19:22:53Z

It is possible that the action includes several times the same bash command, right? Like echo *, I don't know.

Yeah this is possible, but I can't imagine a scenario where the agent would want to do this :)

rbren · 2024-10-21T14:53:44Z

So this seems to work on the backend, but the frontend isn't splitting things up correctly

here's my sample prompt:

please run the following commands all at once:

* ls -lah
* echo "hello world"
* echo a sample kubernetes deployment yaml into deploy.yaml

it seems like we get all the CommandRunActions first, then we get all the Observations. Instead, we should get them collated properly

rbren · 2024-10-21T14:54:14Z

also notice the mangled YAML content on the last command there--not sure what the issue could be...

tofarr · 2024-10-21T15:44:48Z

The command posted by @rbren seems to be working now - though the bash console seems to turn tabs into 8 spaces, which makes it hard to read:

tofarr · 2024-10-21T15:47:23Z

Confirmation mode is now behaving appropriately too:

rbren · 2024-10-21T15:48:29Z

@tofarr looks like it's still not working right based on your screenshot. The console in the UI should show

$ ls -lah
./foo
./bar
$ echo "hello world"
hello world

etc. I.e. we should see the output of one command before the start of the next

…s-AI/OpenHands into feat_split_commands_by_new_line

tofarr · 2024-10-22T23:02:25Z

frontend/src/services/actions.ts

-      message.args.is_confirmed !== "rejected"
-    ) {
-      store.dispatch(appendInput(message.args.command));
-    }


Commands are only outputted on completion

tofarr added 2 commits October 17, 2024 14:21

Fix for split commands into multiple around new lines

b36d7bc

Handle empty commands

5b62bff

tofarr force-pushed the feat_split_commands_by_new_line branch from e4b2264 to 5b62bff Compare October 17, 2024 20:26

tofarr added 2 commits October 17, 2024 14:28

Merge branch 'main' into feat_split_commands_by_new_line

a7998d6

Lint fixes

061400f

tofarr marked this pull request as ready for review October 17, 2024 20:44

xingyaoww reviewed Oct 17, 2024

View reviewed changes

enyst reviewed Oct 18, 2024

View reviewed changes

rbren requested changes Oct 18, 2024

View reviewed changes

openhands/controller/agent_controller.py Outdated Show resolved Hide resolved

Used existing split command

daf4f32

enyst requested changes Oct 18, 2024

View reviewed changes

rbren reviewed Oct 18, 2024

View reviewed changes

tofarr added 2 commits October 21, 2024 07:48

Merge branch 'main' into feat_split_commands_by_new_line

418db23

Added comment

8bce4ac

tofarr added 2 commits October 21, 2024 09:20

Now works with confirmation mode

2c72e32

Merge branch 'main' into feat_split_commands_by_new_line

e985e5c

tofarr requested review from rbren, enyst and xingyaoww October 21, 2024 15:43

tofarr added 4 commits October 21, 2024 16:18

Fix agent running bug

cbf5490

Merge branch 'feat_split_commands_by_new_line' of github.com:All-Hand…

8cc70dc

…s-AI/OpenHands into feat_split_commands_by_new_line

Merge branch 'main' into feat_split_commands_by_new_line

e4df45b

WIP

5cbab76

tofarr added 3 commits October 22, 2024 14:21

WIP

e2ba150

WIP

0152e2f

Messages are only added on completion

e34ce2a

tofarr commented Oct 22, 2024

View reviewed changes

tofarr added 2 commits October 22, 2024 19:48

Merge branch 'main' into feat_split_commands_by_new_line

13a6890

Lint fix

106e20c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split bash commands by the new line character #4462

Split bash commands by the new line character #4462

tofarr commented Oct 17, 2024 •

edited

Loading

xingyaoww left a comment

enyst Oct 18, 2024

rbren Oct 18, 2024

enyst left a comment

rbren Oct 18, 2024

tofarr Oct 21, 2024

enyst commented Oct 18, 2024

rbren commented Oct 18, 2024

rbren commented Oct 21, 2024

rbren commented Oct 21, 2024

tofarr commented Oct 21, 2024

tofarr commented Oct 21, 2024

rbren commented Oct 21, 2024

tofarr Oct 22, 2024

	commands = split_bash_commands(action.command)
	all_output = ''
	for command in commands:
	if command == '':
	output, exit_code = self._continue_bash(

Split bash commands by the new line character #4462

Are you sure you want to change the base?

Split bash commands by the new line character #4462

Conversation

tofarr commented Oct 17, 2024 • edited Loading

Example 1:

Example 2

Example 3

Example 4 (With confirmation mode!)

Example 5 (Running user commands)

xingyaoww left a comment

Choose a reason for hiding this comment

enyst Oct 18, 2024

Choose a reason for hiding this comment

rbren Oct 18, 2024

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

rbren Oct 18, 2024

Choose a reason for hiding this comment

tofarr Oct 21, 2024

Choose a reason for hiding this comment

enyst commented Oct 18, 2024

rbren commented Oct 18, 2024

rbren commented Oct 21, 2024

rbren commented Oct 21, 2024

tofarr commented Oct 21, 2024

tofarr commented Oct 21, 2024

rbren commented Oct 21, 2024

tofarr Oct 22, 2024

Choose a reason for hiding this comment

tofarr commented Oct 17, 2024 •

edited

Loading