feat(governance): tamper-evident audit log for RBAC + policy changes (#395) by Prateeks16 · Pull Request #450 · sreerevanth/AgentWatch

Prateeks16 · 2026-06-20T19:24:27Z

Addresses the outstanding acceptance criterion of #395 (CMP-005).

Context

The RBAC role model, permission map, deny-by-default checks, and SAML/RBAC API enforcement already shipped in #387 (Closes #377). The one #395 criterion left unmet was "any changes to RBAC or policies must be logged to a tamper-evident audit table" — RBACEngine recorded nothing, and the existing governance/engine.py audit log is append-only (not cryptographically tamper-evident).

Change

governance/audit_log.py — a hash-chained AuditLog. Each AuditRecord.record_hash is SHA-256 over its own fields and the previous record's hash, seeded from a genesis hash. Altering or reordering any past record breaks every subsequent hash, so AuditLog.verify() detects tampering even in memory.
RBACEngine now owns an audit log and records every add_user, set_role, and set_team_policy change (with an optional actor). Added set_role so role changes are captured explicitly with before/after values. Mutating methods stay backward compatible (actor is keyword-only and optional).
9 tests: chain linkage, tamper + reorder detection, and that each RBAC mutation is audited with an intact chain.

Acceptance criteria (#395)

✅ Audit logs capture role changes (tamper-evident).
Authentication via SAML SSO and endpoint role enforcement were already delivered in feat(api): wire CMP-005 SAML/RBAC enforcement into the API layer (Closes #377) #387.

Test

pytest tests/test_rbac_audit.py   # 9 passed
ruff check / format --check        # clean

Scope kept to the audit-log gap to avoid duplicating the merged #387 work. Full-fidelity OIDC/SAML (replacing the HMAC token stand-in) would be a separate, larger PR.

Note: the unrelated test_cli_* failures on main are the duplicate session_app issue fixed in #449, not introduced here.

Summary by CodeRabbit

New Features
- Added tamper-evident audit logging with SHA-256 hash chain verification for governance operations
- Integrated audit tracking into role-based access control for user management, role changes, and policy updates
Tests
- Added comprehensive test suite covering audit chain integrity and RBAC audit event validation

sreerevanth#395) CMP-005 already ships roles, permissions, and SAML/RBAC enforcement (sreerevanth#387). The remaining acceptance criterion — "any changes to RBAC or policies must be logged to a tamper-evident audit table" — was unaddressed. Add a hash-chained AuditLog where each record's hash covers its fields plus the previous record's hash, so altering or reordering any past entry breaks the chain and AuditLog.verify() returns False. RBACEngine now records every add_user, set_role, and set_team_policy change (with optional actor) to this log; the mutating methods stay backward compatible. set_role is added so role changes are captured explicitly with before/after.

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

coderabbitai · 2026-06-20T19:24:39Z

Warning

Review limit reached

@Prateeks16, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 42 minutes and 20 seconds. Learn how PR review limits work.

Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file).

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits.

🚦 How do rate limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan refill rate.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, the refill rate gradually slows as usage increases. The highest same-day bursts are limited more strictly.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 02ffc94d-cfbc-4133-aff7-3e10a6a8fb37

📥 Commits

Reviewing files that changed from the base of the PR and between 960ce15 and f07d1ff.

📒 Files selected for processing (2)

agentwatch/governance/audit_log.py
tests/test_rbac_audit.py

📝 Walkthrough

Walkthrough

Adds a new agentwatch/governance/audit_log.py module implementing an append-only, SHA-256 hash-chained in-memory audit log (AuditLog, AuditRecord, GENESIS_HASH). RBACEngine in rbac.py is updated to initialize an AuditLog and emit audit entries from add_user, a new set_role, and set_team_policy. Tests in tests/test_rbac_audit.py cover chain integrity, tampering detection, and RBAC event emission.

Changes

CMP-005 Hash-Chained Audit Log and RBAC Integration

Layer / File(s)	Summary
AuditLog data types, digest, and chain implementation `agentwatch/governance/audit_log.py`	Defines `GENESIS_HASH`, `_digest` (SHA-256 over canonical JSON), `AuditRecord` dataclass with `to_dict`, and `AuditLog` with `head_hash`, `append`, `records`, `verify`, and `__len__`. Each `append` computes and chains `record_hash` from all fields plus `prev_hash`; `verify` recomputes and checks every link.
RBACEngine audit wiring and mutator signature updates `agentwatch/governance/rbac.py`	Imports `AuditLog`, initializes `self.audit` in `RBACEngine.__init__`, adds optional `actor` keyword to `add_user` and `set_team_policy`, and implements `set_role` returning a boolean while logging the previous-to-new role transition.
Audit chain integrity and RBAC integration tests `tests/test_rbac_audit.py`	Tests genesis state, correct `prev_hash`/`record_hash` chaining, tampering via payload mutation and record reordering (both must fail `verify`), and RBAC integration events for `add_user`, `set_role` (found and not-found no-op), `set_team_policy`, and a multi-step history asserting record count and full chain integrity.

Sequence Diagram(s)

sequenceDiagram
  participant Client
  participant RBACEngine
  participant AuditLog
  participant _digest

  rect rgba(70, 130, 180, 0.5)
    note over Client,RBACEngine: RBAC Mutation
    Client->>RBACEngine: add_user(user, actor="admin")
    RBACEngine->>AuditLog: append("user.add", user_id, actor="admin", details={role})
    AuditLog->>_digest: SHA-256(fields + prev_hash)
    _digest-->>AuditLog: record_hash
    AuditLog-->>RBACEngine: AuditRecord
    RBACEngine-->>Client: None
  end

  rect rgba(60, 179, 113, 0.5)
    note over Client,RBACEngine: Role Change
    Client->>RBACEngine: set_role(user_id, new_role, actor="admin")
    RBACEngine->>AuditLog: append("role.change", user_id, details={from, to})
    AuditLog->>_digest: SHA-256(fields + prev_hash)
    _digest-->>AuditLog: record_hash
    AuditLog-->>RBACEngine: AuditRecord
    RBACEngine-->>Client: True
  end

  rect rgba(210, 105, 30, 0.5)
    note over Client,AuditLog: Chain Verification
    Client->>AuditLog: verify()
    loop each AuditRecord
      AuditLog->>_digest: recompute expected_hash
      _digest-->>AuditLog: expected_hash
      AuditLog->>AuditLog: check prev_hash linkage and record_hash match
    end
    AuditLog-->>Client: True / False
  end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐇 Hop, hop, the ledger grows,
Each record hash-chained as it goes,
A genesis zero starts the trail,
Tampering? Verify will fail!
The rabbit signs each audit right,
CMP-005 shines bright tonight. ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 26.09% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely describes the main change: introducing a tamper-evident audit log for RBAC and policy changes, with issue reference.
Linked Issues check	✅ Passed	The PR implements the tamper-evident audit log requirement for RBAC changes specified in `#377`, completing the compliance/governance objective with cryptographic hash-chaining for immutable record-keeping.
Out of Scope Changes check	✅ Passed	All changes are focused on audit logging and RBAC integration; no unrelated modifications detected outside the stated scope of adding tamper-evident audit trails.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-06-20T19:25:35Z

🧪 PR Test Results

Check	Result
Tests (`pytest tests/`)	❌ failure
Lint (`ruff check .`)	❌ failure
Coverage (`agentwatch`)	72.93%

_{Python 3.12 · commit f07d1ff}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

agentwatch/governance/audit_log.py (1)

117-135: ⚡ Quick win

Also verify sequence continuity.

verify() currently does not enforce seq monotonic continuity. Adding an index check strengthens integrity semantics and forensic reliability.

Suggested verification check

-        for record in self._records:
+        for expected_seq, record in enumerate(self._records):
+            if record.seq != expected_seq:
+                return False
             if record.prev_hash != prev_hash:
                 return False

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agentwatch/governance/audit_log.py` around lines 117 - 135, The verify()
method in the audit log currently checks hash integrity but does not enforce
sequence number continuity. Add a check to verify that each record's seq field
is monotonically increasing by one from the previous record, starting from
sequence 0 or 1 after the GENESIS_HASH. Initialize an expected_seq variable
before the loop over self._records, and for each record, verify that record.seq
equals the expected_seq value before proceeding with the existing hash checks.
If the sequence is not continuous, return False. Increment expected_seq after
each successful record verification to ensure the next record has the correct
sequence number.

tests/test_rbac_audit.py (1)

36-36: ⚡ Quick win

Avoid making public mutability part of the test contract.

Line 36 tampering through records() implicitly requires that public accessor to expose mutable internals. Since Line 45 already uses private-state tampering for simulation, prefer the same pattern here.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_rbac_audit.py` at line 36, The test on line 36 is mutating log
records through the public records() accessor method, which violates
encapsulation by requiring public methods to expose mutable internals. To fix
this, replace the public accessor mutation pattern on line 36 with the same
private-state tampering approach already being used on line 45, directly
accessing the private internal state instead of going through the public
records() method. This maintains test isolation without exposing mutable
internals through the public API.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/governance/audit_log.py`:
- Around line 63-73: The to_dict() method exposes the live details dictionary
and the code returning AuditRecord objects at line 113 expose mutable internal
state that callers can modify, breaking the append-only contract. Fix this by
creating a deep copy of the details dictionary in the to_dict() method before
returning it, and by returning copies of AuditRecord objects instead of live
references where they are returned to callers. This ensures that external
mutations cannot corrupt the in-memory audit log.
- Around line 79-111: The AuditLog class has race conditions where concurrent
calls to append() and property accesses to head_hash can interleave, causing
sequence numbers and hash linkage to become inconsistent. Fix this by adding a
threading.RLock instance variable to the __init__ method, then wrap all access
to the _records attribute with the lock in the append() method, the head_hash
property, and any other methods that read or write _records (such as records()
and verify() if they exist) to ensure atomic operations and maintain chain
integrity.

---

Nitpick comments:
In `@agentwatch/governance/audit_log.py`:
- Around line 117-135: The verify() method in the audit log currently checks
hash integrity but does not enforce sequence number continuity. Add a check to
verify that each record's seq field is monotonically increasing by one from the
previous record, starting from sequence 0 or 1 after the GENESIS_HASH.
Initialize an expected_seq variable before the loop over self._records, and for
each record, verify that record.seq equals the expected_seq value before
proceeding with the existing hash checks. If the sequence is not continuous,
return False. Increment expected_seq after each successful record verification
to ensure the next record has the correct sequence number.

In `@tests/test_rbac_audit.py`:
- Line 36: The test on line 36 is mutating log records through the public
records() accessor method, which violates encapsulation by requiring public
methods to expose mutable internals. To fix this, replace the public accessor
mutation pattern on line 36 with the same private-state tampering approach
already being used on line 45, directly accessing the private internal state
instead of going through the public records() method. This maintains test
isolation without exposing mutable internals through the public API.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 722530f0-cf8c-4e92-a09a-1c6fd1d45947

📥 Commits

Reviewing files that changed from the base of the PR and between 7f01274 and 960ce15.

📒 Files selected for processing (3)

agentwatch/governance/audit_log.py
agentwatch/governance/rbac.py
tests/test_rbac_audit.py

…y check Address review feedback on the tamper-evident audit log: - records() and append() now return detached copies, and to_dict() copies the details dict, so callers cannot mutate stored records and bypass the append-only / tamper-evident contract. - Guard all _records access with a threading.RLock so concurrent appends keep sequence numbers and hash linkage consistent. - verify() now also enforces monotonic seq continuity, not just hash linkage. - Test tampering goes through private state; added a test that mutating a returned record leaves the stored chain intact.

Copilot AI review requested due to automatic review settings June 20, 2026 19:24

Copilot AI reviewed Jun 20, 2026

coderabbitai Bot reviewed Jun 20, 2026

View reviewed changes

Comment thread agentwatch/governance/audit_log.py

Comment thread agentwatch/governance/audit_log.py Outdated

Prateeks16 mentioned this pull request Jun 20, 2026

fix(cli): route all session subcommands through one Typer group #449

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(governance): tamper-evident audit log for RBAC + policy changes (#395)#450

feat(governance): tamper-evident audit log for RBAC + policy changes (#395)#450
Prateeks16 wants to merge 2 commits into
sreerevanth:mainfrom
Prateeks16:feat/rbac-audit-log-395

Prateeks16 commented Jun 20, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

coderabbitai Bot commented Jun 20, 2026 •

edited

Loading

Review limit reached

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Jun 20, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Prateeks16 commented Jun 20, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Change

Acceptance criteria (#395)

Test

Summary by CodeRabbit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🧪 PR Test Results

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prateeks16 commented Jun 20, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 20, 2026 •

edited

Loading

github-actions Bot commented Jun 20, 2026 •

edited

Loading