Skip to main content

Run Modes control how DeepRails executes evaluations across Monitor and Defend. Every run uses two different LLMs in parallel to reduce bias and improve accuracy. The selected Run Mode determines which models are selected — from compact cost-efficient models to advanced reasoning models — so you can optimize your workflow.

Why Run Modes Matter

Not every task requires the same evaluation depth. A simple summarization prompt can be tested cost-effectively with smaller models, while multi-step reasoning (math generation, chained steps, or multi-task prompts) benefits from reasoning-capable models. DeepRails’ Run Modes let you tune this balance.
  • Always two models in parallel: Every evaluation uses two distinct LLMs to generate scores, avoiding single-model bias.
  • Reasoning vs. non-reasoning models: For complex prompts, modes that include reasoning models yield better accuracy and interpretability.
  • Available everywhere: Run Modes function the same across the Monitor and Defend APIs on all plans.

The Six Run Modes

Super Fast

High-speed lightweight checks. Rapid evaluations with minimal overhead, ideal for large-scale screening.
Accuracy  ●●○○○
Speed      ●●●●○

Fast (default)

Balanced speed and accuracy. The default mode — a strong balance of evaluation quality and throughput for most use cases.
Accuracy  ●●●●○
Speed      ●●●○○

Precision

Deep multi-model analysis. Recommended for complex prompts that benefit from reasoning.
Accuracy  ●●●●○
Speed      ●●○○○

Precision Codex

Code-optimized deep analysis. Recommended for code-based workflows that benefit from specialized code analysis.
Accuracy  ●●●●○
Speed      ●●○○○

Precision Max

Exhaustive multi-pass verification. Two reasoning models in parallel — best for mission-critical use cases where accuracy outweighs cost or latency.
Accuracy  ●●●●●
Speed      ●○○○○

Precision Max Codex

Ultimate code-aware verification. Two reasoning models with Codex-optimized deep analysis for the highest accuracy on code-based workflows.
Accuracy  ●●●●●
Speed      ●○○○○
Choosing whether to use reasoning models is often part of the prompt engineering process. If your task involves multi-step logic, mathematics, or complex instructions, Precision or Precision Max are recommended. If your task requires complex code analysis, use Precision Codex or Precision Max Codex.

Choosing the Right Run Mode

At a Glance

Run ModeIDAccuracySpeedStreamingWeb / File Search
Super Fastsuper_fast●●○○○●●●●○YesNo
Fast (default)fast●●●●○●●●○○YesYes
Precisionprecision●●●●○●●○○○YesYes
Precision Codexprecision_codex●●●●○●●○○○YesYes
Precision Maxprecision_max●●●●●●○○○○NoYes
Precision Max Codexprecision_max_codex●●●●●●○○○○NoYes

Detailed Comparison

NameDescriptionWhen to UseExample Use Case
Super FastHigh-speed lightweight checks; rapid evaluations with minimal overhead.Large-scale screening, early exploration, low-stakes triage.Screening 10,000 code-gen outputs to flag potential safety risks.
Fast (default)Balanced speed and accuracy; the default mode for most use cases.General-purpose workflows that need a good balance of quality and throughput.Monitoring daily regressions in a customer support chatbot.
PrecisionDeep multi-model analysis; strong reasoning coverage with balanced cost/latency.Complex prompts with logic/calculations or multi-step reasoning.Monitoring daily regressions in a legal research bot.
Precision CodexCode-optimized deep analysis; specialized for code-based workflows.Code reviews, software development tasks, and technical documentation requiring code analysis.Evaluating AI-generated code snippets for correctness and best practices.
Precision MaxExhaustive multi-pass verification; two reasoning models in parallel (very high cost and latency).Mission-critical evaluations, final QA sweeps, regulated or safety-sensitive domains.Compliance evaluation on a healthcare agent before production.
Precision Max CodexUltimate code-aware verification (highest accuracy, lowest speed).Complex code creation or refactoring tasks, debugging large systems, or similar software development tasks requiring the highest accuracy.Final security and correctness review of AI-generated code before merging to production.
Super Fast does not support Web Search or File Search capabilities. If your workflow or monitor has these enabled, requests using super_fast will be rejected. To resolve this, either switch to a run mode that supports these capabilities (fast, precision, precision_codex, precision_max, or precision_max_codex) or edit the workflow/monitor to disable Web Search and File Search. This mode does support Context Awareness — you can still pass context in your model_input.