Ruler does not consistently restore for
state
#6465
Labels
component/rules
Bits & bobs todo with rules and alerts: the ruler, config service etc.
for
state
#6465
Description
Currently Prometheus rule manager only restores
for
state of rule groups after restarts. This is fine for Prometheus. However, in Cortex, rule groups can jump from one ruler instance (r1) to another (r2) due to resharding. If r2 happens to be evaluating rule groups for that tenant already, then the manager will not restore thefor
state and will result in alerts going into an incorrect state. For example, an alert can go fromFIRING
toPENDING
To Reproduce
FIRING
PENDING
Expected behavior
FIRING
stateAdditional Context
There is a PR open for Prometheus to address this issue. Without the PR approved, it is difficult to fix this issue
The text was updated successfully, but these errors were encountered: