Write good prompts
Reliability is still not great for computer-use agents. If you want better reliability, the best thing is to write GOOD prompts and SPLIT up your big prompts/tasks into smaller chunks Good prompts generally have the following properties:- STOP conditions (e.g., You should ALWAYS stop when you see the bottom of the page)
- INTERACTION conditions (e.g., You should NEVER scroll up)
- RETURN format/structure (e.g., Always return a JSON object with this structure ‘example’ : [‘value1’, ‘value2’])
Have a VNC viewer open as you developer
When building your agents with spongecake, we recommend having your VM pulled up in a VNC viewer so you can jump in and control the desktop if needed. SeeConnecting to desktop
in the quickstart