feat(api): detect duplicate/repeat PRs via GitHub Search API and appl… by ljahier · Pull Request #17 · stratorys/pratrol

ljahier · 2026-03-02T02:24:08Z

…y history penalty

Note

Adds a history penalty system that detects duplicate/repeat PRs via GitHub Search API. Three parallel API calls fetch rejected PRs by author (repo-scoped), by title keywords, and by author globally. Results feed a penalty calculator that reduces the combined score and can apply a patrol:repeat-offender label.

^{Written by Mendral for commit e5162c7.}

…y history penalty

pratrol

Pratrol Triage Brief: 🟢 High Confidence

Pratrol assessed this pull request across the primary risk vectors below.

Risk Vector	Score	Tier
Contributor Context	72/100	🟢 High
Logic Risk (Mistral)	82/100	🟢 High
Overall Confidence	78/100	🟢 High

Summary: Adds functionality to detect duplicate/repeat PRs and apply history penalties.

Key Signal: The PR introduces a new history evaluation system to penalize repeat PR rejections.

Recommendation: Review the new history evaluation logic and ensure it integrates well with existing systems.

cloudflare-workers-and-pages · 2026-03-02T02:24:55Z

Deploying pratrol with Cloudflare Pages

Latest commit:	`e5162c7`
Status:	✅ Deploy successful!
Preview URL:	https://5e8b4b14.pratrol.pages.dev
Branch Preview URL:	https://feat-triage-with-history.pratrol.pages.dev

View logs

mendral-app

🤖 Code Review

Assessment ⚠️

Two issues worth fixing before merge:

Query injection via PR title keywords — the keywords string is interpolated directly into a GitHub Search query with only surrounding quotes. A title like foo" repo:attacker/evil breaks out of the quoted term and redirects the search to an arbitrary repo. The extract_title_keywords function strips non-alphanumeric chars, which mitigates most cases, but the resulting keyword string is still passed unsanitized into the query format string. The quotes in search_rejected_prs_by_title only protect against whitespace, not against a keyword that itself contains a double-quote (possible if the split/filter logic ever changes).
is_repeat_offender label applied even when ensure_label fails — if ensure_label for the repeat-offender label returns an error, the code logs a warning and continues. labels_to_apply still contains REPEAT_OFFENDER_LABEL, so add_labels will attempt to apply a label that may not exist, causing the GitHub API call to fail silently (only a warn log). The tier label has the same structure but was pre-existing; the repeat-offender label is created on-demand, so this race is real.

Note

Tag @mendral-app with feedback or questions. View session

mendral-app · 2026-03-02T02:30:22Z

+    ) -> Result<RejectedPrSearchResult, GitHubError> {
+        let query = format!("is:pr is:closed is:unmerged repo:{owner}/{repo} \"{keywords}\"");
+        self.search_rejected_prs(&query).await


🔒 security (medium)

The keywords string is interpolated directly into the search query. Although extract_title_keywords currently strips non-alphanumeric chars, the query construction relies on surrounding double-quotes for safety. If a keyword ever contains " (e.g. after a future change to the extractor), an attacker-controlled PR title could inject arbitrary search qualifiers like repo:other/repo. Escape or validate the keywords before interpolation.

📝 Suggested change

Suggested change

) -> Result<RejectedPrSearchResult, GitHubError> {

let query = format!("is:pr is:closed is:unmerged repo:{owner}/{repo} \"{keywords}\"");

self.search_rejected_prs(&query).await

async fn search_rejected_prs_by_title(

&self,

keywords: &str,

owner: &str,

repo: &str,

) -> Result<RejectedPrSearchResult, GitHubError> {

// Ensure keywords contain only alphanumeric/space chars before embedding in query

let safe_keywords: String = keywords

.chars()

.filter(|c| c.is_alphanumeric() || *c == ' ')

.collect();

let query = format!("is:pr is:closed is:unmerged repo:{owner}/{repo} \"{safe_keywords}\"");

self.search_rejected_prs(&query).await

}

mendral-app · 2026-03-02T02:30:22Z

+                if history_result.is_repeat_offender
+                    && let Err(error) = client
+                        .ensure_label(
+                            owner,
+                            repo,
+                            REPEAT_OFFENDER_LABEL.to_owned(),
+                            REPEAT_OFFENDER_COLOR.to_owned(),
+                            REPEAT_OFFENDER_DESCRIPTION.to_owned(),
+                        )
+                        .await
+                {
+                    warn!(
+                        message = "Failed to ensure repeat-offender label exists.",
+                        triage_id = %triage_id,
+                        pr_number,
+                        %error,
+                    );
+                }
+
                if let Err(error) = client
-                    .add_labels(owner, repo, pr_number, vec![label_name])
+                    .add_labels(owner, repo, pr_number, labels_to_apply)


🐛 bug (medium)

When ensure_label for REPEAT_OFFENDER_LABEL fails, the code logs a warning but labels_to_apply still contains the label. The subsequent add_labels call will then attempt to apply a non-existent label, causing a silent API failure. Remove the label from labels_to_apply on ensure_label error.

📝 Suggested change

Suggested change

if history_result.is_repeat_offender

&& let Err(error) = client

.ensure_label(

owner,

repo,

REPEAT_OFFENDER_LABEL.to_owned(),

REPEAT_OFFENDER_COLOR.to_owned(),

REPEAT_OFFENDER_DESCRIPTION.to_owned(),

)

.await

{

warn!(

message = "Failed to ensure repeat-offender label exists.",

triage_id = %triage_id,

pr_number,

%error,

);

}

if let Err(error) = client

.add_labels(owner, repo, pr_number, vec![label_name])

.add_labels(owner, repo, pr_number, labels_to_apply)

if history_result.is_repeat_offender {

match client

.ensure_label(

owner,

repo,

REPEAT_OFFENDER_LABEL.to_owned(),

REPEAT_OFFENDER_COLOR.to_owned(),

REPEAT_OFFENDER_DESCRIPTION.to_owned(),

)

.await

{

Ok(()) => {}

Err(error) => {

warn!(

message = "Failed to ensure repeat-offender label exists.",

triage_id = %triage_id,

pr_number,

%error,

);

labels_to_apply.retain(|l| l != REPEAT_OFFENDER_LABEL);

}

}

}

feat(api): detect duplicate/repeat PRs via GitHub Search API and appl…

e5162c7

…y history penalty

pratrol Bot reviewed Mar 2, 2026

View reviewed changes

pratrol Bot added the patrol:trusted PR author and content look trustworthy label Mar 2, 2026

ljahier merged commit 271f411 into main Mar 2, 2026
6 checks passed

ljahier deleted the feat/triage_with_history branch March 2, 2026 02:28

mendral-app Bot reviewed Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api): detect duplicate/repeat PRs via GitHub Search API and appl…#17

feat(api): detect duplicate/repeat PRs via GitHub Search API and appl…#17
ljahier merged 1 commit into
mainfrom
feat/triage_with_history

ljahier commented Mar 2, 2026 •

edited by mendral-app Bot

Loading

Uh oh!

pratrol Bot left a comment

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 2, 2026

Uh oh!

Uh oh!

mendral-app Bot left a comment

Uh oh!

mendral-app Bot Mar 2, 2026

Uh oh!

mendral-app Bot Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-    ) -> Result<RejectedPrSearchResult, GitHubError> {
-        let query = format!("is:pr is:closed is:unmerged repo:{owner}/{repo} \"{keywords}\"");
-        self.search_rejected_prs(&query).await
+    async fn search_rejected_prs_by_title(
+        &self,
+        keywords: &str,
+        owner: &str,
+        repo: &str,
+    ) -> Result<RejectedPrSearchResult, GitHubError> {
+        // Ensure keywords contain only alphanumeric/space chars before embedding in query
+        let safe_keywords: String = keywords
+            .chars()
+            .filter(|c| c.is_alphanumeric() || *c == ' ')
+            .collect();
+        let query = format!("is:pr is:closed is:unmerged repo:{owner}/{repo} \"{safe_keywords}\"");
+        self.search_rejected_prs(&query).await
+    }

Conversation

ljahier commented Mar 2, 2026 • edited by mendral-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pratrol Bot left a comment

Choose a reason for hiding this comment

Pratrol Triage Brief: 🟢 High Confidence

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 2, 2026

Deploying pratrol with Cloudflare Pages

Uh oh!

Uh oh!

mendral-app Bot left a comment

Choose a reason for hiding this comment

🤖 Code Review

Assessment ⚠️

Uh oh!

mendral-app Bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

mendral-app Bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ljahier commented Mar 2, 2026 •

edited by mendral-app Bot

Loading