sec(cli): introduce secure command execution wrapper and cross-platfo… by vedant-kawale-27 · Pull Request #348 · sreerevanth/AgentWatch

vedant-kawale-27 · 2026-06-16T09:20:41Z

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Security patch (fixes a vulnerability or enhances system safety)
Documentation update

Description

This PR addresses a potential command-injection vulnerability in the AgentWatch CLI by introducing a secure, centralized command execution wrapper (agentwatch/cli/_utils/run_cmd.py) and integrating it with cross-platform speech synthesis alerts.

Proposed Changes

Central Secure Command Utility (agentwatch/cli/_utils/run_cmd.py):
- Implements a wrapper around subprocess.run(..., shell=False).
- Validates command arguments against an allowed characters regex whitelist, blocking shell control characters like ;, |, <, >, ` to prevent command injection.
- Exposes a check_args parameter (default True) to allow bypassing validation for internally-controlled scripts (like PowerShell commands) that require semicolons.
- Raises a custom CommandError exception carrying the returncode, stdout, and stderr on command failures.
- Logs execution details at the DEBUG level.
Speech Synthesis with Graceful Fallback (agentwatch/cli/_utils/speech.py):
- Implements speak(message) across platform speech commands:
  - macOS: say command.
  - Windows: PowerShell SpeechSynthesis.
  - Linux/Unix: espeak command.
  - Falls back silently and returns False if voice synthesis tools are not installed or fail.
Visual Spinner & Audible Blocks (agentwatch/cli/animator.py):
- Created CLIAnimator supplying visual spinner animations and safety block voice triggers.
- Integrated speak alerts into verify_env.py (diagnostics completion), main.py (tool block trigger), and demo.py (safety demo tool block).
Documentation:
- Updated the README.md file with a usage example of the new run helper under the Safety Engine section.

Related Issue

Closes #332

How Has This Been Tested?

Automated Unit Tests: Added unit tests in tests/test_run_cmd.py covering validation rules, exceptions, and platform speech mocks.
- Command: .venv\Scripts\pytest (All 449 unit tests passed)
Linting & Code Style: Checked and reformatted the codebase.
- Command: .venv\Scripts\ruff check agentwatch/ / .venv\Scripts\ruff format --check agentwatch/ (All checks passed)
Manual Verification: Successfully ran CLI commands (agentwatch.exe serve, agentwatch.exe safety) and the interactive demo (demo.py) on Windows to confirm no encoding crashes.

Checklist

My code follows the code style of this project.
I have updated the documentation accordingly.
I have added tests to cover my changes.
All new and existing tests passed.

Summary by CodeRabbit

New Features
- Added a documented, secure CLI subprocess execution wrapper with argument allowlisting and a dedicated CommandError for failures.
Bug Fixes
- Status command now enforces a minimum refresh rate of 0.1s.
Improvements
- Updated demo console banner/output formatting and added UTF-8 stdout/stderr support where available.
- Environment diagnostics now print the populated variables table and a “Diagnostics complete” message.
- Removed speech synthesis from the demo/animation startup.
Tests
- Expanded coverage for secure command execution and added a CLI test for invalid refresh rate.

…rm speech alerts

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

coderabbitai · 2026-06-16T09:20:56Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds agentwatch/cli/_utils/run_cmd.py, a centralized subprocess wrapper that enforces shell=False, validates arguments against an ALLOWED_ARG_RE regex allowlist, and raises CommandError on failures. Removes the speak_welcome() Windows speech synthesis function from animator.py, hardens the status command refresh_rate option with a min=0.1 constraint, updates demo.py output formatting, and adds verify_env.py diagnostic output. Accompanied by unit tests and README documentation.

Changes

Secure subprocess wrapper and CLI hardening

Layer / File(s)	Summary
run_cmd: ALLOWED_ARG_RE, CommandError, and run() `agentwatch/cli/_utils/run_cmd.py`	Defines `ALLOWED_ARG_RE` compiled regex allowlist, `CommandError` exception with optional `returncode`/`stdout`/`stderr` fields, and `run()` that validates arguments, executes with `shell=False`/`capture_output=True`/`text=True`, and raises `CommandError` on validation failures, non-zero exits, or caught OS exceptions.
_utils package init and README docs `agentwatch/cli/_utils/__init__.py`, `README.md`	Creates the `agentwatch.cli._utils` package `__init__.py` re-exporting `run` and `CommandError` via `__all__`. Adds a README "Secure Subprocess Wrapper" subsection with a usage example and `CommandError` handling pattern.
Tests for run_cmd `tests/test_run_cmd.py`	Covers valid command invocation parameters (`shell=False`, `capture_output=True`, `text=True`), forbidden metacharacter rejection, `check_args=False` bypass, allowlisted character acceptance, non-zero exit `CommandError` with attached context, `FileNotFoundError`-to-`CommandError` conversion, and allowed vs blocked shell metacharacter patterns.
Remove speak_welcome(), harden refresh_rate, update demo/verify_env `agentwatch/cli/animator.py`, `agentwatch/cli/main.py`, `agentwatch/cli/demo.py`, `agentwatch/cli/verify_env.py`	Removes `speak_welcome()` and its `os`/`subprocess` imports from `animator.py`. Adds `min=0.1` constraint to `status` command's `refresh_rate`. Refactors `[BLOCKED]` `status_str` assignment. Adds UTF-8 stdout/stderr reconfiguration and ASCII banner/next-steps formatting to `demo.py`. Prints `var_table` and "Diagnostics complete" in `verify_environment()`.
CLI status test and test_cost.py housekeeping `tests/test_cli_status.py`, `tests/test_cost.py`	Adds `test_status_command_invalid_refresh_rate` asserting `server status --refresh 0` returns non-zero. Moves `import time` to top of `test_cost.py` and removes the duplicate inline occurrence.

Sequence Diagram

sequenceDiagram
  participant Caller
  participant run as run_cmd.run()
  participant subprocess
  Caller->>run: run(args, check_args=True)
  run->>run: validate args against ALLOWED_ARG_RE
  alt forbidden chars found
    run-->>Caller: raise CommandError(invalid chars)
  end
  run->>subprocess: subprocess.run(args, shell=False, capture_output=True, text=True)
  alt returncode != 0
    subprocess-->>run: CompletedProcess(returncode>0)
    run-->>Caller: raise CommandError(returncode, stdout, stderr)
  else FileNotFoundError / OSError
    subprocess-->>run: raise exception
    run-->>Caller: raise CommandError("execution failed")
  else success
    subprocess-->>run: CompletedProcess(returncode=0)
    run-->>Caller: return CompletedProcess
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

sreerevanth/AgentWatch#184: Touches the same verify_environment() output flow in agentwatch/cli/verify_env.py, specifically the environment table printing and "Diagnostics complete" message.
sreerevanth/AgentWatch#385: Modifies the same CLI subprocess wrapper implementation area—agentwatch/cli/_utils/run_cmd.py plus agentwatch/cli/_utils/__init__.py exports—to enforce secure, validated command execution behavior.

Suggested labels

security, infra, Hard, SSoC26, high-priority

Suggested reviewers

sreerevanth

Poem

🐰 Hoppity-hop, no more shell=True,
The rabbit checked each arg right through!
ALLOWED_ARG_RE guards the gate,
CommandError raised if args aren't straight.
speak_welcome's gone, the CLI's clean—
The safest subprocess you've ever seen! 🔒

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 52.94% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title is truncated and incomplete (ends with '…'), making it vague about what security improvements are included beyond 'cross-platfo' suggestion.	Complete the title to clearly specify the main change, e.g. 'sec(cli): introduce secure command execution wrapper' or include the full intended message.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Linked Issues check	✅ Passed	The PR implements all core requirements from issue `#332`: secure wrapper, argument validation, error handling, logging, tests, and README updates, successfully addressing command-injection vulnerabilities.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with the narrowed scope of issue `#332`; speech synthesis was removed per reviewer feedback and only command-execution hardening remains.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/_utils/run_cmd.py`:
- Around line 78-104: The logging calls are exposing sensitive information that
could persist in logs. At line 78, the logger.debug call logs the full command
string which could contain tokens/secrets in the arguments. More critically, at
lines 99-104, the logger.error call logs complete stdout and stderr outputs
which could leak sensitive data. Remove or redact the sensitive payload details
from these logging calls. Either eliminate the full command string logging
entirely, log only the command name (args[0]) instead of the full args, and
reduce the stderr/stdout logging to a summary message at ERROR level (or move it
to DEBUG level) without including the raw output payloads.

In `@agentwatch/cli/_utils/speech.py`:
- Around line 33-52: The TTS subprocess calls in the speech synthesis function
lack timeouts, which can cause them to hang indefinitely. Add a timeout
parameter to all three run_cmd.run() calls: the macOS 'say' command call, the
Windows PowerShell SpeechSynthesis call, and the Linux espeak call. Choose an
appropriate timeout value (in seconds) that allows sufficient time for speech
synthesis to complete while preventing indefinite hangs. Apply the same timeout
consistently across all three platforms to ensure uniform behavior.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: d5e149ad-48b3-41e1-b6fe-0045727cd601

📥 Commits

Reviewing files that changed from the base of the PR and between d8dc2e8 and 7bdc28f.

📒 Files selected for processing (12)

README.md
agentwatch/cli/_utils/__init__.py
agentwatch/cli/_utils/run_cmd.py
agentwatch/cli/_utils/speech.py
agentwatch/cli/animator.py
agentwatch/cli/demo.py
agentwatch/cli/main.py
agentwatch/cli/verify_env.py
tests/test_channels.py
tests/test_cli_status.py
tests/test_cost.py
tests/test_run_cmd.py

💤 Files with no reviewable changes (1)

tests/test_channels.py

vedant-kawale-27 · 2026-06-16T13:29:28Z

hee @sreerevanth review the pr, and if everything looks good approve it
add SSoC badge and hardness level

github-actions · 2026-06-16T20:30:51Z

🧪 PR Test Results

Check	Result
Tests (`pytest tests/`)	❌ failure
Lint (`ruff check .`)	❌ failure
Coverage (`agentwatch`)	72.88%

_{Python 3.12 · commit 854bae2}

vedant-kawale-27 · 2026-06-17T13:06:33Z

hee @sreerevanth if everything is good to go then merge this pr and add SSocbadge and hardness level badge

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/animator.py`:
- Around line 83-92: The subprocess.Popen call that executes the PowerShell
command is building the command string with unsanitized user input interpolated
directly, creating a script-injection vulnerability. Replace this legacy direct
subprocess.Popen execution with a secure wrapper function designed for safe
PowerShell command execution. Ensure that the user variable is properly escaped
and quoted within the PowerShell command to prevent injection attacks, or use
the secure wrapper that handles this escaping automatically. Reference the
secure execution path mentioned in the review and apply it to this code block
instead of the direct Popen call at line 87.

In `@agentwatch/cli/main.py`:
- Around line 851-852: The refresh_rate parameter in the function signature
(around line 851) uses typer.Option without any constraints, but the code
divides by this value later at lines 937-938, causing a division-by-zero crash
if --refresh 0 is passed. Add validation to the typer.Option call for the
refresh_rate parameter by using the gt (greater than) constraint to ensure the
value must be greater than 0, which will prevent invalid inputs at the CLI level
rather than causing runtime errors.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 61c0e89d-1c10-4736-8287-157dd96f8352

📥 Commits

Reviewing files that changed from the base of the PR and between 29be66c and f21a2f5.

📒 Files selected for processing (3)

agentwatch/cli/animator.py
agentwatch/cli/main.py
agentwatch/cli/verify_env.py

🚧 Files skipped from review as they are similar to previous changes (1)

agentwatch/cli/verify_env.py

coderabbitai

Caution

Inline review comments failed to post. This is likely due to GitHub's internal server error or limits when posting large numbers of comments. If you are seeing this consistently it is likely a permissions issue. Please check "Moderation" -> "Code review limits" under your organization settings.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/animator.py`:
- Around line 83-92: The subprocess.Popen call that executes the PowerShell
command is building the command string with unsanitized user input interpolated
directly, creating a script-injection vulnerability. Replace this legacy direct
subprocess.Popen execution with a secure wrapper function designed for safe
PowerShell command execution. Ensure that the user variable is properly escaped
and quoted within the PowerShell command to prevent injection attacks, or use
the secure wrapper that handles this escaping automatically. Reference the
secure execution path mentioned in the review and apply it to this code block
instead of the direct Popen call at line 87.

In `@agentwatch/cli/main.py`:
- Around line 851-852: The refresh_rate parameter in the function signature
(around line 851) uses typer.Option without any constraints, but the code
divides by this value later at lines 937-938, causing a division-by-zero crash
if --refresh 0 is passed. Add validation to the typer.Option call for the
refresh_rate parameter by using the gt (greater than) constraint to ensure the
value must be greater than 0, which will prevent invalid inputs at the CLI level
rather than causing runtime errors.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 61c0e89d-1c10-4736-8287-157dd96f8352

📥 Commits

Reviewing files that changed from the base of the PR and between 29be66c and f21a2f5.

📒 Files selected for processing (3)

agentwatch/cli/animator.py
agentwatch/cli/main.py
agentwatch/cli/verify_env.py

🚧 Files skipped from review as they are similar to previous changes (1)

agentwatch/cli/verify_env.py

🛑 Comments failed to post (2)

agentwatch/cli/animator.py (1)

83-92: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Replace legacy subprocess.Popen PowerShell execution with the secure wrapper and escape user input.

Line 84 builds a PowerShell command string with interpolated user, and Line 87 executes it via direct subprocess.Popen. This bypasses the new hardened execution path and leaves a script-injection surface in PowerShell command parsing.

Proposed direction

-    if sys.platform == "win32":
-        cmd = f"Add-Type -AssemblyName System.speech; $synth = New-Object System.Speech.Synthesis.SpeechSynthesizer; $synth.Rate = 1; $synth.Speak('Welcome {user}, to Agent Watch.')"
-        subprocess.Popen(
-            ["powershell", "-WindowStyle", "Hidden", "-Command", cmd],
-            stdout=subprocess.DEVNULL,
-            stderr=subprocess.DEVNULL,
-            creationflags=subprocess.CREATE_NO_WINDOW,
-        )
+    if sys.platform == "win32":
+        from agentwatch.cli._utils.run_cmd import run
+        safe_user = user.replace("'", "''")
+        cmd = (
+            "Add-Type -AssemblyName System.Speech; "
+            "$synth = New-Object System.Speech.Synthesis.SpeechSynthesizer; "
+            "$synth.Rate = 1; "
+            f"$synth.Speak('Welcome {safe_user}, to Agent Watch.')"
+        )
+        run(["powershell", "-WindowStyle", "Hidden", "-Command", cmd], check_args=False)

🧰 Tools

🪛 ast-grep (0.43.0)

[error] 86-91: Command coming from incoming request
Context: subprocess.Popen( # nosec # noqa: S603, S607
["powershell", "-WindowStyle", "Hidden", "-Command", cmd], # noqa: S607
stdout=subprocess.DEVNULL,
stderr=subprocess.DEVNULL,
creationflags=subprocess.CREATE_NO_WINDOW,
)
Note: [CWE-20].

(subprocess-from-request)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/animator.py` around lines 83 - 92, The subprocess.Popen call
that executes the PowerShell command is building the command string with
unsanitized user input interpolated directly, creating a script-injection
vulnerability. Replace this legacy direct subprocess.Popen execution with a
secure wrapper function designed for safe PowerShell command execution. Ensure
that the user variable is properly escaped and quoted within the PowerShell
command to prevent injection attacks, or use the secure wrapper that handles
this escaping automatically. Reference the secure execution path mentioned in
the review and apply it to this code block instead of the direct Popen call at
line 87.

agentwatch/cli/main.py (1)

851-852: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Validate --refresh to prevent runtime crash.

Line 937 divides by refresh_rate, but Line 851 does not constrain the value. Passing --refresh 0 crashes with division-by-zero.

Suggested fix

-    refresh_rate: float = typer.Option(1.0, "--refresh", help="Refresh rate in seconds"),
+    refresh_rate: float = typer.Option(
+        1.0, "--refresh", min=0.1, help="Refresh rate in seconds (must be > 0)"
+    ),

Also applies to: 937-938

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/main.py` around lines 851 - 852, The refresh_rate parameter in
the function signature (around line 851) uses typer.Option without any
constraints, but the code divides by this value later at lines 937-938, causing
a division-by-zero crash if --refresh 0 is passed. Add validation to the
typer.Option call for the refresh_rate parameter by using the gt (greater than)
constraint to ensure the value must be greater than 0, which will prevent
invalid inputs at the CLI level rather than causing runtime errors.

vedant-kawale-27 · 2026-06-17T18:55:21Z

hee @sreerevanth check it

vedant-kawale-27 · 2026-06-17T19:32:22Z

hee @sreerevanth everything is green and if everything is good merge it and add SSoC badge and hardness badge

sreerevanth · 2026-06-17T19:49:39Z

hey @vedant-kawale-27 Thanks for the contribution. The centralized command wrapper is a good direction, but this PR currently mixes security hardening with speech synthesis, CLI UX changes, demo updates, and documentation updates. I'd like to keep the scope focused on the command-execution hardening itself.

Before merge:

Remove or separate unrelated speech/animation changes into a follow-up PR.
Ensure all subprocess entry points consistently use the wrapper where appropriate.
Clarify the security model around check_args=False and document when bypassing validation is acceptable.
Confirm which existing command execution paths are now protected by the wrapper.

Once those are addressed, I'll take another look.

vedant-kawale-27 · 2026-06-18T02:29:05Z

Summary of Work Done

Speech Synthesis Removal: Deleted the speech.py module and removed all imports, invocations, and speech synthesis integrations across animator.py, verify_env.py, main.py, and demo.py.
Wrapper Documentation: Updated the docstring of the central command wrapper run() in run_cmd.py to document the security model of argument validation and when check_args=False is acceptable.
Test Cleanup: Removed speech-related tests from test_run_cmd.py while keeping all command verification tests.
Verification: Ran pytest locally; all 501 test cases passed successfully.

hee @sreerevanth check it

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/test_run_cmd.py (1)
35-60: ⚠️ Potential issue | 🟠 Major

Document or restrict characters allowed by ALLOWED_ARG_RE regex.

The current regex pattern allows several characters ($, &, parentheses, braces) that enable command substitution and chaining patterns ($(command), ${var}, &&, ||). While subprocess.run(..., shell=False) prevents shell interpretation at the subprocess level, the defense-in-depth validation should either:

Add explicit tests documenting that these characters are intentionally allowed (with a comment explaining why they are safe despite their shell metacharacter status), or

Restrict the regex to remove $, &, and other unnecessary characters if they are not required by legitimate use cases

The current test coverage for ;, |, >, and backticks should be complemented with tests clarifying the behavior of $(...), ${...}, &&, and || patterns to prevent future confusion about what the validator actually blocks.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_run_cmd.py` around lines 35 - 60, The
test_run_cmd_invalid_arguments function tests forbidden characters like
semicolons, pipes, redirects, and backticks, but does not test command
substitution patterns like `$(command)`, `${var}`, or logical operators like
`&&` and `||` that may be allowed by the ALLOWED_ARG_RE regex. Add test cases in
test_run_cmd_invalid_arguments that verify the behavior for these patterns
(either confirming they raise CommandError or documenting with comments why they
are intentionally allowed despite being shell metacharacters). If these
characters should be blocked, update the ALLOWED_ARG_RE regex pattern to exclude
`$`, `&`, parentheses, and braces, ensuring the validation strategy is explicit
and well-tested for all potentially dangerous shell metacharacters.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/main.py`:
- Around line 851-858: Remove the redundant runtime validation check for
refresh_rate that checks if refresh_rate <= 0. The typer.Option definition with
min=0.1 already enforces that refresh_rate must be greater than or equal to 0.1
before the command function executes, making the subsequent if statement that
raises typer.BadParameter unnecessary. Delete the entire if block that validates
refresh_rate <= 0 since Typer will handle this validation at the parameter
level.

---

Outside diff comments:
In `@tests/test_run_cmd.py`:
- Around line 35-60: The test_run_cmd_invalid_arguments function tests forbidden
characters like semicolons, pipes, redirects, and backticks, but does not test
command substitution patterns like `$(command)`, `${var}`, or logical operators
like `&&` and `||` that may be allowed by the ALLOWED_ARG_RE regex. Add test
cases in test_run_cmd_invalid_arguments that verify the behavior for these
patterns (either confirming they raise CommandError or documenting with comments
why they are intentionally allowed despite being shell metacharacters). If these
characters should be blocked, update the ALLOWED_ARG_RE regex pattern to exclude
`$`, `&`, parentheses, and braces, ensuring the validation strategy is explicit
and well-tested for all potentially dangerous shell metacharacters.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: b5632d8f-92f2-4a02-bce2-3788629e6393

📥 Commits

Reviewing files that changed from the base of the PR and between f21a2f5 and 1168494.

📒 Files selected for processing (8)

agentwatch/cli/_utils/__init__.py
agentwatch/cli/_utils/run_cmd.py
agentwatch/cli/animator.py
agentwatch/cli/demo.py
agentwatch/cli/main.py
agentwatch/cli/verify_env.py
tests/test_cli_status.py
tests/test_run_cmd.py

💤 Files with no reviewable changes (2)

agentwatch/cli/verify_env.py
agentwatch/cli/demo.py

🚧 Files skipped from review as they are similar to previous changes (1)

agentwatch/cli/_utils/run_cmd.py

vedant-kawale-27 · 2026-06-18T12:46:01Z

hee @sreerevanth check it

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/_utils/__init__.py`:
- Around line 9-11: Remove the stale import of run_validated_command from the
run_cmd module on line 9, as this symbol does not exist in
agentwatch/cli/_utils/run_cmd.py and is causing an ImportError. Update the
__all__ list on line 11 to export only the symbols that actually exist in the
run_cmd module (such as run and CommandError instead of the non-existent
run_validated_command) to properly maintain the import contract for the _utils
package.

In `@agentwatch/cli/_utils/run_cmd.py`:
- Around line 149-154: The validation logic in the cmd_args checks currently
only enforces that arguments are provided as a list of strings, but is missing
the allowlist-based character validation that was part of the original security
model. Add validation after the existing type checks that enforces an allowlist
of permitted characters for each argument in cmd_args to properly harden the
command execution against untrusted CLI input. This allowlist validation should
examine each string argument and ensure all characters match the allowed set
defined by the module's security model before the subprocess.run call occurs.
- Around line 169-174: The exception handling in the except blocks for
subprocess.TimeoutExpired and subprocess.CalledProcessError is re-raising the
raw subprocess exceptions, which breaks the wrapper's contract. Instead of
re-raising these exceptions, convert them to CommandError with structured fields
that capture the relevant error information (such as the timeout duration from
TimeoutExpired and the return code and stderr from CalledProcessError). This
ensures callers only need to handle CommandError and have access to the
underlying error details through CommandError's structured fields.

In `@agentwatch/cli/main.py`:
- Around line 1554-1564: The RollbackEngine is instantiated with no state
initialization before calling rollback_session(), which causes the method to
fail because _session_checkpoints is empty and undiscoverable. Before calling
rollback_session() on the newly created RollbackEngine instance, add a
state-hydration step to load or index the existing checkpoints (such as invoking
a load method or similar initialization routine that populates
_session_checkpoints), or alternatively refactor to use the existing API or
server-based rollback path that already maintains active engine state to avoid
creating an empty RollbackEngine instance.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: e89a2cf9-7bdd-4974-8e60-cc82dcdcc779

📥 Commits

Reviewing files that changed from the base of the PR and between 1168494 and 854bae2.

📒 Files selected for processing (6)

README.md
agentwatch/cli/_utils/__init__.py
agentwatch/cli/_utils/run_cmd.py
agentwatch/cli/demo.py
agentwatch/cli/main.py
tests/test_run_cmd.py

✅ Files skipped from review due to trivial changes (1)

README.md

🚧 Files skipped from review as they are similar to previous changes (2)

agentwatch/cli/demo.py
tests/test_run_cmd.py

coderabbitai

Caution

Inline review comments failed to post. This is likely due to GitHub's internal server error or limits when posting large numbers of comments. If you are seeing this consistently it is likely a permissions issue. Please check "Moderation" -> "Code review limits" under your organization settings.

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/cli/_utils/__init__.py`:
- Around line 9-11: Remove the stale import of run_validated_command from the
run_cmd module on line 9, as this symbol does not exist in
agentwatch/cli/_utils/run_cmd.py and is causing an ImportError. Update the
__all__ list on line 11 to export only the symbols that actually exist in the
run_cmd module (such as run and CommandError instead of the non-existent
run_validated_command) to properly maintain the import contract for the _utils
package.

In `@agentwatch/cli/_utils/run_cmd.py`:
- Around line 149-154: The validation logic in the cmd_args checks currently
only enforces that arguments are provided as a list of strings, but is missing
the allowlist-based character validation that was part of the original security
model. Add validation after the existing type checks that enforces an allowlist
of permitted characters for each argument in cmd_args to properly harden the
command execution against untrusted CLI input. This allowlist validation should
examine each string argument and ensure all characters match the allowed set
defined by the module's security model before the subprocess.run call occurs.
- Around line 169-174: The exception handling in the except blocks for
subprocess.TimeoutExpired and subprocess.CalledProcessError is re-raising the
raw subprocess exceptions, which breaks the wrapper's contract. Instead of
re-raising these exceptions, convert them to CommandError with structured fields
that capture the relevant error information (such as the timeout duration from
TimeoutExpired and the return code and stderr from CalledProcessError). This
ensures callers only need to handle CommandError and have access to the
underlying error details through CommandError's structured fields.

In `@agentwatch/cli/main.py`:
- Around line 1554-1564: The RollbackEngine is instantiated with no state
initialization before calling rollback_session(), which causes the method to
fail because _session_checkpoints is empty and undiscoverable. Before calling
rollback_session() on the newly created RollbackEngine instance, add a
state-hydration step to load or index the existing checkpoints (such as invoking
a load method or similar initialization routine that populates
_session_checkpoints), or alternatively refactor to use the existing API or
server-based rollback path that already maintains active engine state to avoid
creating an empty RollbackEngine instance.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: e89a2cf9-7bdd-4974-8e60-cc82dcdcc779

📥 Commits

Reviewing files that changed from the base of the PR and between 1168494 and 854bae2.

📒 Files selected for processing (6)

README.md
agentwatch/cli/_utils/__init__.py
agentwatch/cli/_utils/run_cmd.py
agentwatch/cli/demo.py
agentwatch/cli/main.py
tests/test_run_cmd.py

✅ Files skipped from review due to trivial changes (1)

README.md

🚧 Files skipped from review as they are similar to previous changes (2)

agentwatch/cli/demo.py
tests/test_run_cmd.py

🛑 Comments failed to post (4)

agentwatch/cli/_utils/__init__.py (1)

9-11: ⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

Remove stale run_validated_command export; it breaks _utils import contract.

Line 9 imports a symbol that is not present in agentwatch/cli/_utils/run_cmd.py, and Line 11 overwrites __all__ to that stale name. This can raise ImportError and hides run/CommandError from package exports.
Suggested fix
-from .run_cmd import run_validated_command
-
-__all__ = ["run_validated_command"]
+__all__ = ["run", "CommandError"]
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/_utils/__init__.py` around lines 9 - 11, Remove the stale
import of run_validated_command from the run_cmd module on line 9, as this
symbol does not exist in agentwatch/cli/_utils/run_cmd.py and is causing an
ImportError. Update the __all__ list on line 11 to export only the symbols that
actually exist in the run_cmd module (such as run and CommandError instead of
the non-existent run_validated_command) to properly maintain the import contract
for the _utils package.

agentwatch/cli/_utils/run_cmd.py (2)

149-154: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Restore allowlist-based argument validation before subprocess.run.

The current validation only enforces list[str]; it no longer enforces the argument-character allowlist described by this module’s security model. That weakens command hardening for untrusted CLI input paths.

Suggested patch

 def run_validated_command(
     cmd_args: list[str], check: bool = True, timeout: int | float | None = 30.0
 ) -> tuple[int, str, str]:
@@
     if not all(isinstance(arg, str) for arg in cmd_args):
         raise ValueError("All command arguments must be strings.")
+
+    for arg in cmd_args:
+        if not ALLOWED_ARG_RE.fullmatch(arg):
+            raise CommandError(f"Invalid characters in command argument: {arg!r}")

Also applies to: 157-164

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/_utils/run_cmd.py` around lines 149 - 154, The validation
logic in the cmd_args checks currently only enforces that arguments are provided
as a list of strings, but is missing the allowlist-based character validation
that was part of the original security model. Add validation after the existing
type checks that enforces an allowlist of permitted characters for each argument
in cmd_args to properly harden the command execution against untrusted CLI
input. This allowlist validation should examine each string argument and ensure
all characters match the allowed set defined by the module's security model
before the subprocess.run call occurs.

169-174: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Use CommandError consistently instead of re-raising raw subprocess exceptions.

Re-raising TimeoutExpired / CalledProcessError breaks the wrapper’s contract and forces callers to handle multiple exception types. Convert these to CommandError with structured fields.

Suggested patch

-    except subprocess.TimeoutExpired as e:
-        logger.error(f"Command timed out after {timeout} seconds: {e}")
-        raise
-    except subprocess.CalledProcessError as e:
-        logger.error(f"Command failed with exit code {e.returncode}: {e.stderr}")
-        raise
+    except subprocess.TimeoutExpired as e:
+        raise CommandError(
+            f"Command timed out after {timeout} seconds",
+            stdout=e.stdout,
+            stderr=e.stderr,
+        ) from e
+    except subprocess.CalledProcessError as e:
+        raise CommandError(
+            f"Command failed with exit code {e.returncode}",
+            returncode=e.returncode,
+            stdout=e.stdout,
+            stderr=e.stderr,
+        ) from e

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/_utils/run_cmd.py` around lines 169 - 174, The exception
handling in the except blocks for subprocess.TimeoutExpired and
subprocess.CalledProcessError is re-raising the raw subprocess exceptions, which
breaks the wrapper's contract. Instead of re-raising these exceptions, convert
them to CommandError with structured fields that capture the relevant error
information (such as the timeout duration from TimeoutExpired and the return
code and stderr from CalledProcessError). This ensures callers only need to
handle CommandError and have access to the underlying error details through
CommandError's structured fields.

agentwatch/cli/main.py (1)

1554-1564: 🛠️ Refactor suggestion | 🟠 Major | 🏗️ Heavy lift

session rollback likely always fails due to unhydrated RollbackEngine state.

Line 1554 constructs a new RollbackEngine() with empty indexes, and Line 1563 calls rollback_session() immediately. In agentwatch/rollback/engine.py, rollback_session() reads _session_checkpoints; without a load/index step, existing checkpoints are not discoverable.

A reliable fix is to add a state-hydration path before rollback (or invoke the API/server rollback path that already has active engine state), then call rollback_session().
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/cli/main.py` around lines 1554 - 1564, The RollbackEngine is
instantiated with no state initialization before calling rollback_session(),
which causes the method to fail because _session_checkpoints is empty and
undiscoverable. Before calling rollback_session() on the newly created
RollbackEngine instance, add a state-hydration step to load or index the
existing checkpoints (such as invoking a load method or similar initialization
routine that populates _session_checkpoints), or alternatively refactor to use
the existing API or server-based rollback path that already maintains active
engine state to avoid creating an empty RollbackEngine instance.

sreerevanth · 2026-06-20T17:52:33Z

hee @sreerevanth check it

could have pinged me in discord

vedant-kawale-27 · 2026-06-20T18:34:18Z

hee @sreerevanth cleared the problem that cause the CI to fail

sec(cli): introduce secure command execution wrapper and cross-platfo…

7bdc28f

…rm speech alerts

Copilot AI review requested due to automatic review settings June 16, 2026 09:20

Copilot AI reviewed Jun 16, 2026

coderabbitai Bot reviewed Jun 16, 2026

View reviewed changes

Comment thread agentwatch/cli/_utils/run_cmd.py Outdated

Comment thread agentwatch/cli/_utils/speech.py Outdated

Fix

29be66c

Merge branch 'main' into main

f21a2f5

coderabbitai Bot reviewed Jun 17, 2026

View reviewed changes

Fix

87a64a1

Changes

1168494

coderabbitai Bot reviewed Jun 18, 2026

View reviewed changes

Comment thread agentwatch/cli/main.py Outdated

Fix

66152f7

Merge branch 'main' into main

854bae2

coderabbitai Bot reviewed Jun 19, 2026

View reviewed changes

sreerevanth added the not-ready-to-merge label Jun 20, 2026

vedant-kawale-27 added 2 commits June 20, 2026 23:46

Merge branch 'sreerevanth:main' into main

3fe58b9

Fix

5a6595a

Conversation

vedant-kawale-27 commented Jun 16, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of Change

Description

Proposed Changes

Related Issue

How Has This Been Tested?

Checklist

Summary by CodeRabbit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vedant-kawale-27 commented Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🧪 PR Test Results

Uh oh!

vedant-kawale-27 commented Jun 17, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

vedant-kawale-27 commented Jun 17, 2026

Uh oh!

vedant-kawale-27 commented Jun 17, 2026

Uh oh!

sreerevanth commented Jun 17, 2026

Uh oh!

vedant-kawale-27 commented Jun 18, 2026

Summary of Work Done

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vedant-kawale-27 commented Jun 18, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sreerevanth commented Jun 20, 2026

Uh oh!

vedant-kawale-27 commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vedant-kawale-27 commented Jun 16, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 16, 2026 •

edited

Loading

github-actions Bot commented Jun 16, 2026 •

edited

Loading