feat: Enterprise: Add agentwatch shield command to block zero-day prompt injections dynamically#433
feat: Enterprise: Add agentwatch shield command to block zero-day prompt injections dynamically#433SHAURYASANYAL3 wants to merge 2 commits into
agentwatch shield command to block zero-day prompt injections dynamically#433Conversation
|
Warning Review limit reached
More reviews will be available in 52 minutes and 47 seconds. Learn how PR review limits work. Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file). ⌛ How to resolve this issue?After more reviews become available, a review can be triggered using the To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits. 🚦 How do rate limits work?CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan refill rate. For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, the refill rate gradually slows as usage increases. The highest same-day bursts are limited more strictly. Please see our Fair Usage Limits Policy for further information. ℹ️ Review info⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Plus Run ID: 📒 Files selected for processing (2)
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
🧪 PR Test Results
Python 3.12 · commit 718851f |
5cc5cab to
718851f
Compare
agentwatch shield command to block zero-day prompt injections dynamically
Resolves #419
Overview
This Pull Request implements the highly requested Enterprise: Add
agentwatch shieldcommand to block zero-day prompt injections dynamically functionality into the AgentWatch CLI.Why do we need this?
For a 5-year-old: We need a superhero shield command to block bad guys who try to trick our robot into doing bad things!
For developers: Static safety checks aren't enough for advanced prompt injections (e.g., DAN, jailbreaks). We need dynamic, LLM-based firewalling for incoming requests.
What is it?
A new CLI command
agentwatch shield enablethat activates an inline threat-intelligence gateway. It intercepts prompts before they hit the model. This is a PAID Enterprise feature.Suggestions for Implementation
RiskLevel.CRITICALtoSafetyCheckDatawhen a zero-day payload is detected.Implementation Notes 🛠️
typerframework inagentwatch/cli/main.py.rich.