Skip to content

indexer,api: log non-actionable failures at warn, not error#679

Open
nikw9944 wants to merge 1 commit into
mainfrom
indexer-api/lower-nonactionable-log-levels
Open

indexer,api: log non-actionable failures at warn, not error#679
nikw9944 wants to merge 1 commit into
mainfrom
indexer-api/lower-nonactionable-log-levels

Conversation

@nikw9944

@nikw9944 nikw9944 commented Jul 1, 2026

Copy link
Copy Markdown
Contributor

Summary

The lake-api-errors / lake-indexer-errors alerts fire on true ERR-level log lines. A few code paths log at Error for conditions that are self-recovering or already handled, so they raise alerts despite needing no human action. This lowers those to Warn so only genuinely-actionable failures alert.

  • workflow interrupted, reattaching (dzingest.go, rollup.go, solingest.go): the long-running workflow future returned an error and the code reattaches — routine on worker restarts/deploys. Real crash-loops are already covered by the separate lake-indexer-crash-loop / -down alerts. ErrorWarn.
  • swap rate: oracle fetch failed, serving stale cache (swap_rate.go): a handled fallback — it serves the cached rate and returns success. ErrorWarn. The sibling "no cached fallback" path (which returns 502) stays at Error.

No behavior change beyond log severity.

Testing Verification

  • Verified against a 24h Loki inventory of true ERR lines: these messages appear there today (i.e. they currently alert) and are non-actionable.

Follow-ups (not in this PR)

  • Rollup activity failures emit Temporal Activity error at ERR (staging). The clean fix (mirror dzingest's return-nil + failure-threshold) must stay backfill-safe (BackfillRollupWorkflow relies on error propagation/retries), so it needs a live-vs-backfill-aware design + tests.

@nikw9944 nikw9944 requested a review from thijsvanemmerik July 1, 2026 17:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant