Reasoning Modes

Crucible exposes three reasoning modes that let you control the tradeoff between speed, depth, and cost per request.

Standard

The default mode. The model processes your input and returns a response without an explicit reasoning chain. Best for classification, labeling, summarization, and extraction tasks where the answer is relatively direct.

Deep

The model works through the problem step by step before producing a final output. The full reasoning chain is returned in the reasoning_trace field of the response. Deep mode is best for multi-step analysis, complex document review, and tasks where accuracy matters more than latency.

Fast

Optimized for latency-sensitive applications. The model produces a response with minimal processing overhead. Best for high-volume, lower-complexity tasks where throughput is the priority.

Choosing a mode

Task type

Recommended mode

Document extraction

Deep

Classification at volume

Fast

Research synthesis

Deep

Real-time query answering

Standard or Fast

Contract review

Deep

Structured data parsing

Standard

You can switch modes per request by passing the reasoning_mode parameter in your API call.

Create a free website with Framer, the website builder loved by startups, designers and agencies.