LOADING_

INITIALIZING SYSTEM...

Reasoning Modes Explained: Direct, CoT, GoT, and ReAct

CONCLAVE Inc./Jan 22, 2026/5 min read

COD3X agents have four reasoning modes. Each one changes how the AI thinks — not what it thinks about, but the process it uses to arrive at a decision. Choosing the right mode for the right task is the difference between an efficient agent and one that burns credits for no reason.

The Four Modes#

Direct#

Single-pass execution. No reasoning chain. The AI reads the goal and context, makes a decision, and acts.

How it works: One inference call. Goal + context in, decision out. No intermediate steps.

Credit cost: Lowest. One API call per decision.

Speed: Fastest. Sub-second decisions.

When to use:

Routine order placement ("place a limit buy at $60,000")
Simple data lookups ("what's the current funding rate?")
Pre-defined actions with no ambiguity
High-frequency triggers where speed matters more than analysis

When not to use:

Complex multi-factor trade analysis
Situations where you want to review the AI's logic
Anything where the wrong decision is costly

Direct mode is a hammer. Fast, cheap, no nuance. Perfect for nails, terrible for everything else.

Chain-of-Thought (CoT)#

Step-by-step reasoning. The AI walks through its analysis sequentially before making a decision.

How it works: The AI writes out its reasoning: "Step 1: Check trend direction. BTC is in an uptrend on 4h based on higher highs and higher lows. Step 2: Evaluate entry timing. RSI is at 52, pulling back from 68 — momentum is cooling but not bearish. Step 3: Check volume. Above 20-period average, confirming interest. Step 4: Decision — enter long with 3x leverage."

Credit cost: Moderate. One extended inference call with a longer output.

Speed: A few seconds. The reasoning chain takes time to generate.

When to use:

Complex trade analysis with multiple factors
When you want to review why the agent made a decision
Multi-indicator evaluation (ADX + RSI + EMA + volume)
Any situation where the reasoning matters as much as the outcome

When not to use:

Simple, unambiguous tasks (use Direct)
When you need to compare multiple strategies (use GoT)
Open-ended discovery tasks (use ReAct)

CoT is the workhorse. Most trading decisions benefit from step-by-step analysis.

Graph-of-Thought (GoT)#

Parallel strategy evaluation. The AI explores multiple approaches simultaneously and picks the strongest branch.

How it works: The AI spawns multiple reasoning paths:

Path A: "Momentum entry — ADX is strong, enter on pullback"
Path B: "Mean reversion — price is extended, fade the move"
Path C: "No trade — conflicting signals, sit this one out"

Each path is evaluated independently. The AI then compares them and selects the highest-conviction option.

Credit cost: High. Multiple parallel inference paths, each generating a full analysis.

Speed: Slower. Multiple paths must complete before comparison.

When to use:

When the optimal approach isn't obvious
Comparing strategies (momentum vs. mean reversion vs. hold)
Multi-timeframe analysis (15m says buy, 4h says sell — which wins?)
High-value decisions where credit cost is justified by decision quality

When not to use:

When the strategy is well-defined (use CoT)
Simple tasks (use Direct)
When credit budget is tight

GoT is the strategist. Use it when you genuinely don't know which approach is best and want the AI to evaluate alternatives.

ReAct (Reason + Act)#

Iterative observe-reason-act loops. The AI gathers information, reasons about it, takes an action, observes the result, and repeats.

How it works:

Observe: Check BTC funding rate → 0.04%
Reason: Elevated funding suggests longs are crowded. Check if there's a trade.
Act: Pull order book depth for BTC-PERP
Observe: Large ask wall at $63,000, thin bids below $62,500
Reason: Order book confirms potential for a flush. Short setup forming.
Act: Check correlated assets (ETH, SOL funding and positioning)
Observe: ETH funding also elevated, SOL neutral
Reason: BTC/ETH correlation high, both showing crowded longs. SOL uncorrelated — skip.
Act: Enter short BTC-PERP and ETH-PERP, skip SOL

Credit cost: Highest. Multiple inference calls per decision, each with tool invocations.

Speed: Slowest. Multiple round-trips between reasoning and action.

When to use:

Open-ended market scanning ("find the best setup across 50 pairs")
Complex research tasks ("analyze what top traders are doing and find patterns")
Adaptive strategies that need to gather information before deciding
Situations where the agent needs to explore before committing

When not to use:

Well-defined strategies with clear rules (use CoT)
Simple execution tasks (use Direct)
When credit budget is limited

ReAct is the researcher. It's expensive but thorough. Use it for discovery and exploration, not routine execution.

Comparison Table#

Mode	Inference Calls	Avg Credit Cost	Speed	Best For
Direct	1	$2	<1s	Routine execution, simple lookups
CoT	1 (extended)	$8	2-5s	Complex trade analysis, reviewable logic
GoT	3-5 (parallel)	$15	5-10s	Strategy comparison, multi-approach evaluation
ReAct	4-8 (iterative)	$22	10-30s	Market scanning, adaptive research, exploration

One Goal, One Mode#

With Sonnet 4.6 and Opus 4.6, a single goal can handle an entire scan-analyze-execute workflow in one run. The model holds the full context — scanning markets, analyzing setups, evaluating risk, and executing trades — without losing coherence. Most users now run one goal per trigger or asset.

Pick the reasoning mode that fits the job:

CoT for most single-asset strategies — scan, analyze, and execute in one pass
ReAct when the agent needs to explore unfamiliar territory or scan many markets
GoT when comparing fundamentally different approaches to the same opportunity
Direct for pure execution where the decision is already made externally

Goal chaining — wiring multiple goals together — is still available for emergency exits, cross-asset coordination, and failure recovery, but is no longer needed for typical trading workflows.

The Cost-Quality Tradeoff#

More expensive modes produce better decisions — but with diminishing returns. The jump from Direct to CoT is dramatic: the agent goes from blind execution to reasoned analysis. The jump from CoT to GoT is meaningful but smaller: the agent considers alternatives instead of committing to the first approach. The jump from GoT to ReAct is situational: valuable for discovery, marginal for well-defined strategies.

For most users, CoT is the default. It's the best ratio of decision quality to credit cost. Use GoT when the stakes are high or the approach is unclear. Use ReAct for scanning and discovery. Use Direct for execution-only tasks. For a data-driven comparison of all four modes on the same strategy, see our reasoning modes benchmark.

Four reasoning modes: Direct for speed, Chain-of-Thought for analysis, Graph-of-Thought for comparison, ReAct for discovery. Match the mode to the task.

ShareTwitter

Continue reading3 related

Education

Risk Management for AI Agents: How COD3X Protects Capital

AI agents operate 24/7 without fatigue. That is why they need harder risk limits than human traders. Ten configurable parameters, five defense layers.

Sep 22, 20255 min

Technical

Comparing Reasoning Modes: A Performance Analysis

Feb 6, 20266 min

Education

ICT Concepts for AI Agents: FVG, CHoCH, and Smart Money

Feb 14, 20265 min

←All posts

LOADING_

Reasoning Modes Explained: Direct, CoT, GoT, and ReAct

CONCLAVE Inc./Jan 22, 2026/5 min read

The Four Modes#

Direct#

Single-pass execution. No reasoning chain. The AI reads the goal and context, makes a decision, and acts.

How it works: One inference call. Goal + context in, decision out. No intermediate steps.

Credit cost: Lowest. One API call per decision.

Speed: Fastest. Sub-second decisions.

When to use:

Routine order placement ("place a limit buy at $60,000")
Simple data lookups ("what's the current funding rate?")
Pre-defined actions with no ambiguity
High-frequency triggers where speed matters more than analysis

When not to use:

Complex multi-factor trade analysis
Situations where you want to review the AI's logic
Anything where the wrong decision is costly

Direct mode is a hammer. Fast, cheap, no nuance. Perfect for nails, terrible for everything else.

Chain-of-Thought (CoT)#

Step-by-step reasoning. The AI walks through its analysis sequentially before making a decision.

Credit cost: Moderate. One extended inference call with a longer output.

Speed: A few seconds. The reasoning chain takes time to generate.

When to use:

Complex trade analysis with multiple factors
When you want to review why the agent made a decision
Multi-indicator evaluation (ADX + RSI + EMA + volume)
Any situation where the reasoning matters as much as the outcome

When not to use:

Simple, unambiguous tasks (use Direct)
When you need to compare multiple strategies (use GoT)
Open-ended discovery tasks (use ReAct)

CoT is the workhorse. Most trading decisions benefit from step-by-step analysis.

Graph-of-Thought (GoT)#

Parallel strategy evaluation. The AI explores multiple approaches simultaneously and picks the strongest branch.

How it works: The AI spawns multiple reasoning paths:

Path A: "Momentum entry — ADX is strong, enter on pullback"
Path B: "Mean reversion — price is extended, fade the move"
Path C: "No trade — conflicting signals, sit this one out"

Each path is evaluated independently. The AI then compares them and selects the highest-conviction option.

Credit cost: High. Multiple parallel inference paths, each generating a full analysis.

Speed: Slower. Multiple paths must complete before comparison.

When to use:

When the optimal approach isn't obvious
Comparing strategies (momentum vs. mean reversion vs. hold)
Multi-timeframe analysis (15m says buy, 4h says sell — which wins?)
High-value decisions where credit cost is justified by decision quality

When not to use:

When the strategy is well-defined (use CoT)
Simple tasks (use Direct)
When credit budget is tight

GoT is the strategist. Use it when you genuinely don't know which approach is best and want the AI to evaluate alternatives.

ReAct (Reason + Act)#

Iterative observe-reason-act loops. The AI gathers information, reasons about it, takes an action, observes the result, and repeats.

How it works:

Observe: Check BTC funding rate → 0.04%
Reason: Elevated funding suggests longs are crowded. Check if there's a trade.
Act: Pull order book depth for BTC-PERP
Observe: Large ask wall at $63,000, thin bids below $62,500
Reason: Order book confirms potential for a flush. Short setup forming.
Act: Check correlated assets (ETH, SOL funding and positioning)
Observe: ETH funding also elevated, SOL neutral
Reason: BTC/ETH correlation high, both showing crowded longs. SOL uncorrelated — skip.
Act: Enter short BTC-PERP and ETH-PERP, skip SOL

Credit cost: Highest. Multiple inference calls per decision, each with tool invocations.

Speed: Slowest. Multiple round-trips between reasoning and action.

When to use:

Open-ended market scanning ("find the best setup across 50 pairs")
Complex research tasks ("analyze what top traders are doing and find patterns")
Adaptive strategies that need to gather information before deciding
Situations where the agent needs to explore before committing

When not to use:

Well-defined strategies with clear rules (use CoT)
Simple execution tasks (use Direct)
When credit budget is limited

ReAct is the researcher. It's expensive but thorough. Use it for discovery and exploration, not routine execution.

Comparison Table#

Mode	Inference Calls	Avg Credit Cost	Speed	Best For
Direct	1	$2	<1s	Routine execution, simple lookups
CoT	1 (extended)	$8	2-5s	Complex trade analysis, reviewable logic
GoT	3-5 (parallel)	$15	5-10s	Strategy comparison, multi-approach evaluation
ReAct	4-8 (iterative)	$22	10-30s	Market scanning, adaptive research, exploration

One Goal, One Mode#

Pick the reasoning mode that fits the job:

CoT for most single-asset strategies — scan, analyze, and execute in one pass
ReAct when the agent needs to explore unfamiliar territory or scan many markets
GoT when comparing fundamentally different approaches to the same opportunity
Direct for pure execution where the decision is already made externally

Goal chaining — wiring multiple goals together — is still available for emergency exits, cross-asset coordination, and failure recovery, but is no longer needed for typical trading workflows.

The Cost-Quality Tradeoff#

Four reasoning modes: Direct for speed, Chain-of-Thought for analysis, Graph-of-Thought for comparison, ReAct for discovery. Match the mode to the task.

ShareTwitter

Continue reading3 related

Education

Risk Management for AI Agents: How COD3X Protects Capital

AI agents operate 24/7 without fatigue. That is why they need harder risk limits than human traders. Ten configurable parameters, five defense layers.

Sep 22, 20255 min

Technical

Comparing Reasoning Modes: A Performance Analysis

Feb 6, 20266 min

Education

ICT Concepts for AI Agents: FVG, CHoCH, and Smart Money

Feb 14, 20265 min

←All posts