Skip to content

[Bug]: Workarena L2 fails with #89

@xhluca

Description

@xhluca

Browsergym Version

0.14.1

Playwright Version

1.44.0

Operating System Type

Ubuntu

Operating System Version

Ubuntu 22.04 LTS (Jammy Jellyfish)

Affected Browsers

Chromium

What happened?

See error code

Reproduction Steps

I tried running workarena l2

Relevant Logs

2025-07-28 10:30:00,845 - <PID> - browsergym.experiments.loop - INFO - Running experiment GenericAgent-checkpoints_<model_name> on <task_name> in:
  agentlab_results/<timestamp>_<experiment_name>/<timestamp>_<experiment_name>
2025-07-28 10:30:00,846 - <PID> - root - WARNING - Model <checkpoint_path> not found in the pricing information, prices are set to 0. Maybe try upgrading langchain_community.
2025-07-28 10:30:00,889 - <PID> - browsergym.experiments.loop - DEBUG - Agent created.
2025-07-28 10:30:00,890 - <PID> - browsergym.experiments.loop - DEBUG - Environment created.
2025-07-28 10:30:00,891 - <PID> - urllib3.connectionpool - DEBUG - Starting new HTTPS connection (1): <redacted>.service-now.com:443
2025-07-28 10:30:01,184 - <PID> - urllib3.connectionpool - DEBUG - https://<redacted>.service-now.com:443 "GET / HTTP/1.1" 200 None
...
...truncated middle of the log...
...
    html = parse_document(document_idx=0)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File ".../browsergym/utils/obs.py", line 177, in parse_document
    soup = BeautifulSoup(html, "lxml")
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File ".../bs4/__init__.py", line 473, in __init__
    self._feed()
  File ".../bs4/__init__.py", line 658, in _feed
    self.builder.feed(self.markup)
  File ".../bs4/builder/_lxml.py", line 484, in feed
    self.parser.feed(markup)
  ...
  File ".../bs4/builder/_lxml.py", line 196, in _getNsTag
    namespace, name = tag[1:].split("}", 1)
                      ^^^^^^^^^^^^^^^^^^^^^
ValueError: not enough values to unpack (expected 2, got 1)

2025-07-28 10:30:19,423 - <PID> - browsergym.experiments.loop - INFO - Saving summary info.
2025-07-28 10:30:19,425 - <PID> - urllib3.connectionpool - DEBUG - Starting new HTTPS connection (1): <redacted>.service-now.com:443
2025-07-28 10:30:19,637 - <PID> - urllib3.connectionpool - DEBUG - https://<redacted>.service-now.com:443 "GET /api/now/table/problem?sysparm_query=sys_id%3D<redacted> HTTP/1.1" 200 None
2025-07-28 10:30:19,954 - <PID> - urllib3.connectionpool - DEBUG - https://<redacted>.service-now.com:443 "DELETE /api/now/table/problem/<redacted> HTTP/1.1" 204 0
...
(repeats of GET/DELETE redacted for brevity)
...
2025-07-28 10:30:21,760 - <PID> - root - DEBUG - Tearing down the task

Additional Context

No response

Code of Conduct

  • I agree to follow this project's Code of Conduct

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions