Run Modes control how DeepRails executes evaluations across Monitor and Defend. Every run uses two different LLMs in parallel to reduce bias and improve accuracy. The selected Run Mode determines which models are selected — from compact cost-efficient models to advanced reasoning models — so you can optimize your workflow.
Why Run Modes Matter
Not every task requires the same evaluation depth. A simple summarization prompt can be tested cost-effectively with smaller models, while multi-step reasoning (math generation, chained steps, or multi-task prompts) benefits from reasoning-capable models. DeepRails’ Run Modes let you tune this balance.- Always two models in parallel: Every evaluation uses two distinct LLMs to generate scores, avoiding single-model bias.
- Reasoning vs. non-reasoning models: For complex prompts, modes that include reasoning models yield better accuracy and interpretability.
- Available everywhere: Run Modes function the same across the Monitor and Defend APIs on all plans.
The Six Run Modes
Super Fast
High-speed lightweight checks. Rapid evaluations with minimal overhead, ideal for large-scale screening.
Accuracy ●●○○○
Speed ●●●●○
Fast (default)
Balanced speed and accuracy. The default mode — a strong balance of evaluation quality and throughput for most use cases.
Accuracy ●●●●○
Speed ●●●○○
Precision
Deep multi-model analysis. Recommended for complex prompts that benefit from reasoning.
Accuracy ●●●●○
Speed ●●○○○
Precision Codex
Code-optimized deep analysis. Recommended for code-based workflows that benefit from specialized code analysis.
Accuracy ●●●●○
Speed ●●○○○
Precision Max
Exhaustive multi-pass verification. Two reasoning models in parallel — best for mission-critical use cases where accuracy outweighs cost or latency.
Accuracy ●●●●●
Speed ●○○○○
Precision Max Codex
Ultimate code-aware verification. Two reasoning models with Codex-optimized deep analysis for the highest accuracy on code-based workflows.
Accuracy ●●●●●
Speed ●○○○○
Choosing whether to use reasoning models is often part of the prompt engineering process. If your task involves multi-step logic, mathematics, or complex instructions, Precision or Precision Max are recommended. If your task requires complex code analysis, use Precision Codex or Precision Max Codex.
Choosing the Right Run Mode
At a Glance
| Run Mode | ID | Accuracy | Speed | Streaming | Web / File Search |
|---|---|---|---|---|---|
| Super Fast | super_fast | ●●○○○ | ●●●●○ | Yes | No |
| Fast (default) | fast | ●●●●○ | ●●●○○ | Yes | Yes |
| Precision | precision | ●●●●○ | ●●○○○ | Yes | Yes |
| Precision Codex | precision_codex | ●●●●○ | ●●○○○ | Yes | Yes |
| Precision Max | precision_max | ●●●●● | ●○○○○ | No | Yes |
| Precision Max Codex | precision_max_codex | ●●●●● | ●○○○○ | No | Yes |
Detailed Comparison
| Name | Description | When to Use | Example Use Case |
|---|---|---|---|
| Super Fast | High-speed lightweight checks; rapid evaluations with minimal overhead. | Large-scale screening, early exploration, low-stakes triage. | Screening 10,000 code-gen outputs to flag potential safety risks. |
| Fast (default) | Balanced speed and accuracy; the default mode for most use cases. | General-purpose workflows that need a good balance of quality and throughput. | Monitoring daily regressions in a customer support chatbot. |
| Precision | Deep multi-model analysis; strong reasoning coverage with balanced cost/latency. | Complex prompts with logic/calculations or multi-step reasoning. | Monitoring daily regressions in a legal research bot. |
| Precision Codex | Code-optimized deep analysis; specialized for code-based workflows. | Code reviews, software development tasks, and technical documentation requiring code analysis. | Evaluating AI-generated code snippets for correctness and best practices. |
| Precision Max | Exhaustive multi-pass verification; two reasoning models in parallel (very high cost and latency). | Mission-critical evaluations, final QA sweeps, regulated or safety-sensitive domains. | Compliance evaluation on a healthcare agent before production. |
| Precision Max Codex | Ultimate code-aware verification (highest accuracy, lowest speed). | Complex code creation or refactoring tasks, debugging large systems, or similar software development tasks requiring the highest accuracy. | Final security and correctness review of AI-generated code before merging to production. |
