signal
political discourse · cross-community mapping
live
Reddit·Twitter/X
Why this benchmark exists
Signal treats evaluation as falsification, not aesthetics. This page records where models fail, how failures appear in outputs, and what safeguards are applied so analyst decisions remain transparent and auditable.
Failure modes and mitigations
AreaKnown limitationCurrent mitigation
Stance detectionZero-shot NLI marks most informal posts as ambiguous.Expose ambiguity as a finding and pair with blind-spot examples + subreddit priors.
CoordinationTemporal+URL synchrony can include non-malicious cross-post behavior.Show confidence labels, normalization mode, and explicit innocent explanations.
Trend rankingViral score can overweight one-off posts.Show score + velocity + spread + post_count rationale side by side.
Origin inferenceFirst observed subreddit may differ from true narrative birthplace.Report confidence and time-to-mainstream with explicit uncertainty tier.