LangGraph Architecture in Cadence¶

Overview¶

The Cadence system uses LangGraph to orchestrate multi-agent conversations through a sophisticated workflow that dynamically routes between different plugin agents. This document provides a comprehensive guide to understanding the architectural design, how the graph is conceptually constructed, and how the conversation flow is designed to be flexible and extensible.

Architecture Layers¶

The LangGraph implementation in Cadence follows a layered architecture approach:

graph TB
    subgraph "Application Layer"
        A[User Query] --> B[Orchestrator Entry Point]
        B --> C[Graph Execution]
        C --> D[Final Response]
    end

    subgraph "Orchestration Layer"
        E[Coordinator Node] --> F[Control Tools Node]
        F --> G[Plugin Agent Nodes]
        G --> H[Plugin Tool Nodes]
        H --> I[Synthesizer Node]
        E --> J[Suspend Node]
        E --> K[Timeout Handler]
    end

    subgraph "Plugin Layer"
        K[Math Agent] --> L[Math Tools]
        M[Search Agent] --> N[Search Tools]
        O[Info Agent] --> P[Info Tools]
    end

    subgraph "Infrastructure Layer"
        Q[State Management] --> R[LLM Factory]
        S[Plugin Manager] --> T[Graph Builder]
        U[ToolExecutionLogger] --> V[Agent Hop Counting]
        W[Message Filtering] --> X[Safety Validation]
        Y[Structured Response Handler] --> Z[Response Context Builder]
        AA[Model Factory] --> BB[Timeout Handler]
    end

Graph Construction Design¶

The graph construction follows a systematic 6-phase design approach that ensures proper setup and integration of all components.

Phase 1: Graph Initialization¶

The process begins with creating a new state graph instance that will manage the conversation flow.

Design Principles:

Creates a new state graph instance with conversation state schema
Associates it with the agent state schema for type safety
Prepares the graph for dynamic node and edge additions

Phase 2: Core Node Registration¶

The orchestrator starts by adding four essential nodes that form the backbone of the conversation flow:

graph LR
    subgraph "Core Nodes"
        A[coordinator] --> B[control_tools]
        A --> C[suspend]
        A --> D[finalizer]
    end

    A --> E[Entry Point]
    C --> F[END]
    D --> F

Core Node Design:

Coordinator Node: Main decision-making hub that analyzes user queries and routes to appropriate agents
Control Tools Node: Manages routing tools that direct conversation flow to specific plugin agents
Suspend Node: Handles graceful termination when hop limits are exceeded with tone-aware messaging
Synthesizer Node: Synthesizes conversation results into coherent final responses with structured handling

Phase 3: Plugin Node Integration¶

Dynamic plugin nodes are discovered and integrated based on registered plugins in the system:

graph TB
    subgraph "Plugin Integration"
        A[Plugin Manager] --> B[Get Plugin Bundles]
        B --> C[Extract Nodes & Edges]
        C --> D[Add Plugin Nodes]
        C --> E[Add Plugin Edges]
    end

    subgraph "Plugin Nodes Example"
        F[math_agent] --> G[math_tools]
        H[search_agent] --> I[search_tools]
        J[info_agent] --> K[info_tools]
    end

Plugin Integration Design:

Dynamic Discovery: Plugin manager discovers available plugin bundles
Node Extraction: Each plugin bundle provides agent and tool nodes
Graph Integration: Nodes are dynamically added to the conversation graph
Edge Configuration: Plugin bundles define their own routing logic

Phase 4: Routing Edge Establishment¶

The routing network creates the decision tree that guides conversation flow:

graph TB
    subgraph "Coordinator Routing"
        A[coordinator] --> B{Decision Logic}
        B -->|continue| C[control_tools]
        B -->|suspend| D[suspend]
        B -->|done| E[finalizer]
    end

    subgraph "Control Tools Routing"
        C --> F{Tool Result}
        F -->|math_agent| G[math_agent]
        F -->|search_agent| H[search_agent]
        F -->|info_agent| I[info_agent]
        F -->|finalize| E
    end

    subgraph "Plugin Flow with Conditional Routing"
        G --> J{should_continue}
        H --> K{should_continue}
        I --> L{should_continue}
        J -->|continue| M[math_tools]
        J -->|back| N[coordinator]
        K -->|continue| O[search_tools]
        K -->|back| N
        L -->|continue| P[info_tools]
        L -->|back| N
        M --> N
        O --> N
        P --> N
    end

    D --> Q[END]
    E --> Q

Routing Design Principles:

Conditional Edges: Agent routing decisions based on should_continue method
Direct Edges: Tools always route to coordinator (prevents circular routing)
No Circular Routing: Eliminated the tools → agent edge that caused infinite loops
Dynamic Edge Creation: Plugin bundles define their own routing logic

Phase 5: Entry Point Configuration¶

The graph needs a starting point for all conversations.

Design Principles:

Every conversation starts at the coordinator node
The coordinator analyzes the user query and makes routing decisions
This creates a consistent entry point for all conversations

Phase 6: Graph Compilation¶

The final step compiles the graph for execution.

graph LR
    subgraph "Compilation"
        A[Raw Graph] --> B[Compile]
        B --> C[Add Checkpointer]
        C --> D[Final Compiled Graph]
    end

    subgraph "Execution Ready"
        D --> E[Can be invoked]
        D --> F[Supports async operations]
        D --> G[State management ready]
    end

Compilation Design:

Graph Compilation: Converts the raw graph into an executable workflow
Checkpointer Integration: Optional state persistence for conversation continuity
Debug Information: Graph structure logging for development and debugging

Decision Logic Design:

Agent Decision Making¶

The system implements agent decision-making through a standardized decision method:

Decision Logic Design:

If the agent's response has tool calls → routes to tools for execution
If the agent's response has NO tool calls → returns control to coordinator
This ensures consistent routing behavior across all agents

Routing Implementation¶

The implementation is simple and elegant:

Implementation in BaseAgent:

@staticmethod
def should_continue(state: Dict[str, Any]) -> str:
    """Simple routing decision based on tool calls presence"""
    last_msg = state.get("messages", [])[-1] if state.get("messages") else None
    if not last_msg:
        return "back"

    tool_calls = getattr(last_msg, "tool_calls", None)
    return "continue" if tool_calls else "back"

Design Principles:

Pure Decision Logic: Check if agent response has tool_calls
Consistent Flow: All agent responses follow the same routing path through should_continue
Simple and Reliable: Clean routing logic without complexity
Standardized Interface: All plugins use the same decision method

Plugin Bundle Edge Configuration¶

The plugin bundles define their own routing logic through a standardized interface.

Edge Configuration (from SDKPluginBundle.get_graph_edges()):

def get_graph_edges(self) -> Dict[str, Any]:
    normalized_agent_name = str.lower(self.metadata.name).replace(" ", "_")
    return {
        "conditional_edges": {
            f"{normalized_agent_name}_agent": {
                "condition": self.agent.should_continue,  # Static method reference
                "mapping": {
                    "continue": f"{normalized_agent_name}_tools",  # Route to decorators
                    "back": "coordinator",  # Return to coordinator
                },
            }
        },
        "direct_edges": [(f"{normalized_agent_name}_tools", "coordinator")],  # Tools always return
    }

Edge Configuration Design:

Conditional Edges: Based on should_continue static method result
Direct Edges: Tools always route back to coordinator (prevents circular routing)
No Agent-to-Agent Routing: All routing goes through the coordinator
Standardized Naming: Plugin names normalized to {plugin_name}_agent and {plugin_name}_tools

Back Tool Integration¶

Each plugin bundle automatically includes a "back" tool created in the SDKPluginBundle constructor:

Implementation:

# From SDKPluginBundle.__init__()
@tool
def back() -> str:
    """Return control back to the coordinator."""
    return "back"


all_tools = tools + [back]  # Add to agent's decorators
self.tool_node = ToolNode(all_tools)  # Create ToolNode with all decorators

Design Principles:

Automatic Addition: Every plugin bundle gets a back tool automatically
Simple Implementation: Just returns "back" string for routing
ToolNode Integration: Included in the ToolNode along with agent's tools
Consistent Behavior: All plugins have the same back tool functionality

Structured Response Handling¶

The new orchestrator implementation includes sophisticated structured response handling capabilities that enhance the quality and consistency of conversation outputs.

Response Context Builder¶

The ResponseContextBuilder prepares context for different conversation nodes:

Key Features:

Tone Instruction: Extracts and formats tone preferences from conversation metadata
Plugin Suggestions: Collects response suggestions from plugins that were used during the conversation
Used Plugins Tracking: Maintains a list of plugins that participated in the conversation
Context Preparation: Builds comprehensive context for suspend and synthesizer nodes

Implementation:

def prepare_response_context(self, state: AgentState) -> tuple[str, list[str], str]:
    """Prepare common response context for suspend and synthesizer nodes."""
    metadata = StateHelpers.safe_get_metadata(state)
    requested_tone = metadata.get("tone", "natural") or "natural"
    tone_instruction = ResponseTone.get_description(requested_tone)

    plugin_context = StateHelpers.get_plugin_context(state)
    routing_history = plugin_context.get(PluginContextFields.ROUTING_HISTORY, [])
    used_plugins = list(set(routing_history))

    plugin_suggestions = self._collect_plugin_suggestions(used_plugins)
    suggestions_text = self._format_plugin_suggestions(plugin_suggestions)

    return tone_instruction, used_plugins, suggestions_text

Structured Response Handler¶

The StructuredResponseHandler provides multiple modes for generating structured responses:

Response Modes:

Model-based: Uses structured models with Pydantic schemas
Prompt-based: Uses JSON schema prompting with retry logic
Fallback: Direct model invocation when structured mode fails

Key Features:

Plugin Schema Integration: Automatically incorporates plugin response schemas
Retry Logic: Implements backoff retry for prompt-based structured responses
Response Extraction: Extracts content from structured responses
Error Handling: Graceful fallback to direct model invocation

Response Tone System¶

The system supports multiple response tones with detailed descriptions:

Available Tones:

Natural: Friendly, conversational style with casual language
Explanatory: Detailed, educational explanations with examples
Formal: Professional, structured language with clear organization
Concise: Brief, to-the-point responses focusing on essentials
Learning: Teaching approach with step-by-step guidance

Tone Implementation:

class ResponseTone(Enum):
    """Available response styles for conversation finalization."""

    NATURAL = "natural"
    EXPLANATORY = "explanatory"
    FORMAL = "formal"
    CONCISE = "concise"
    LEARNING = "learning"

    @property
    def description(self) -> str:
        """Return detailed description for this tone."""
        descriptions = {
            "natural": "Respond in a friendly, conversational way as if talking to a friend...",
            "explanatory": "Provide detailed, educational explanations that help users understand concepts...",
            # ... other tone descriptions
        }
        return descriptions.get(self.value, descriptions["natural"])

Message Compaction¶

The synthesizer includes intelligent message compaction for efficiency:

Compaction Modes:

Tool Mode: Compacts tool call/result chains into a single system message
System Mode: Injects compacted content directly into the system prompt
None: No compaction, uses all messages as-is

Compaction Features:

Smart Splitting: Splits messages at the last human message
Content Truncation: Limits compacted content to configurable character limits
Tool Call Processing: Handles AI messages with tool calls appropriately
Result Summarization: Summarizes tool results for context

Suspend Node Implementation¶

The suspend node provides intelligent handling of hop limits with context awareness.

Key Design Features:

Hop Detection: Hop limit detection with state tracking
Smart Hop Counting: Only agent calls increment the hop counter, not finalization calls
Context Preservation: Maintains conversation context while explaining the limit situation
Tone Adaptation: Respects user's requested tone preference in the suspension message
Safe Message Filtering: Prevents validation errors by filtering incomplete tool call sequences

Hop Limit Prompt Design:

The suspend node uses a prompt that provides better user experience:

User-Friendly Language: Explains limits without technical jargon
Accomplishment Acknowledgment: Explains what was accomplished based on gathered information
Best Possible Answer: Provides the best answer with available data
Continuation Suggestions: Suggests how to continue if the answer is incomplete
Tone Adaptation: Maintains the user's requested conversation tone

Hop Counting Logic:

The hop counting system ensures that only agent calls increment the hop counter:

Finalization Exclusion: goto_finalize calls don't increment hop counter
Agent Call Tracking: Only agent routing calls increment the counter
Accurate Limits: Prevents premature hop limit triggering

Coordinator Guardrails and Routing Limits¶

Consecutive Same-Agent Route Guard¶

The coordinator implements a guard to prevent repeatedly routing to the same agent too many times in a row.

Purpose: avoid unproductive loops where the coordinator keeps handing control to the same agent without progress
Trigger: when the same agent is selected consecutively beyond a configurable limit
Behavior: routes to the suspend node instead of continuing
Configuration: coordinator_consecutive_agent_route_limit (env: CADENCE_COORDINATOR_CONSECUTIVE_AGENT_ROUTE_LIMIT)

State tracking is maintained in plugin_context:

plugin_context.same_agent_consecutive_routes: running count of consecutive routes to the same agent
plugin_context.last_routed_agent: last selected agent name
Reset conditions: any goto_finalize decision or a change in selected agent

Coordinator Workflow Implementation¶

The coordinator follows a strict workflow implemented in _coordinator_node():

Coordinator Logic:

def _coordinator_node(self, state: AgentState) -> AgentState:
    # 1. Build dynamic prompt with available plugins
    plugin_descriptions = self._build_plugin_descriptions()
    tool_options = self._build_tool_options()
    coordinator_prompt = COORDINATOR_INSTRUCTIONS.format(...)

    # 2. Get coordinator's routing decision
    request_messages = [SystemMessage(content=coordinator_prompt)] + messages
    coordinator_response = self.coordinator_model.invoke(request_messages)

    # 3. Process routing decision and update counters
    if self.has_tool_calls({"messages": [coordinator_response]}):
        # Agent routing - increment hop counter and update consecutive routing
        current_agent_hops = self.calculate_agent_hops(current_agent_hops, tool_calls)
        plugin_context = self._update_consecutive_routes_counter(plugin_context, tool_calls)
    else:
        # No routing decision - force finalization
        coordinator_response.content = ""
        coordinator_response.tool_calls = [ToolCall(name="goto_finalize", args={})]
        plugin_context = self._reset_route_counters(plugin_context)

    return self._create_state_update(coordinator_response, current_agent_hops, updated_state)

Coordinator Safety Checks (in _coordinator_routing_logic()):

Hop Limit Check: if self._is_hop_limit_reached(state): return SUSPEND
Consecutive Agent Check: if self._is_consecutive_agent_route_limit_reached(state): return SUSPEND
Tool Calls Check: if self._has_tool_calls(state): return CONTINUE
Default: return DONE (route to finalizer)

Coordinator Prompt Contract (Strict Rules):

Choose exactly one route from available tools or finalize
Do not invent agents/tools, and do not perform tool work directly
Use the full conversation history; avoid redundant work if results already exist
Prefer continuity when the last agent is still the best fit

Complete Conversation Flow¶

High-Level Flow with Enhanced Conditional Routing¶

sequenceDiagram
    participant U as User
    participant O as Orchestrator
    participant C as Coordinator
    participant CT as Control Tools
    participant PA as Plugin Agent
    participant PT as Plugin Tools
    participant F as Finalizer
    participant S as Suspend Node

    U->>O: Ask Question
    O->>C: Start Conversation
    C->>C: Analyze & Decide Route

    alt Route to Control Tools
        C->>CT: Execute Routing Tool
        CT->>PA: Activate Plugin Agent
        PA->>PA: Process with LLM

        alt Agent has tool calls
            PA->>PA: should_continue = "continue"
            PA->>PT: Route to tools
            PT->>C: Execute tools & return to coordinator
        else Agent answers directly
            PA->>PA: Create fake "back" tool call
            PA->>PA: should_continue = "continue"
            PA->>PT: Route to tools
            PT->>C: Execute "back" tool & return to coordinator
        end

        C->>C: Evaluate Next Step
    else Route to Finalizer
        C->>F: Finalize Response
    else Route to Suspend
        C->>S: Handle Hop Limit
        S->>O: Return Response
    end

    O->>U: Return Response

Detailed Node Interactions with Routing¶

graph TB
    subgraph "Conversation Flow with Conditional Routing"
        A[User Input] --> B[Coordinator]
        B --> C{Decision}

        C -->|continue| D[control_tools]
        C -->|suspend| E[suspend]
        C -->|done| K[Finalizer]

        D --> G{Plugin Selection}
        G -->|math| H[math_agent]
        G -->|search| I[search_agent]
        G -->|info| J[info_agent]
        G -->|finalize| K

        subgraph "Agent Decision Making"
            H --> L{should_continue}
            I --> M{should_continue}
            J --> N{should_continue}

            L -->|continue| O[math_tools]
            L -->|back| P[coordinator]
            M -->|continue| Q[search_tools]
            M -->|back| P
            N -->|continue| R[info_tools]
            N -->|back| P

            O --> P
            Q --> P
            R --> P
        end

        E --> END[END]
        K --> END
    end

            subgraph "State Management"
        S[AgentState] --> T[agent_hops]
        S --> U[messages]
        S --> V[current_agent]
        S --> W[tone]
        S --> X[message_filtering]
        S --> Y[tool_execution_logging]
        S --> Z[plugin_context]
    end

Practical Examples¶

Example 1: Agent Answers Directly¶

User Query: "What is 2+2?"

Execution Flow:

sequenceDiagram
    participant U as User
    participant C as Coordinator
    participant CT as Control Tools
    participant MA as Math Agent
    participant MT as Math Tools

    U->>C: "What is 2+2?"
    C->>C: Analyze query, decide route
    C->>CT: goto_math_agent()
    CT->>MA: Activate math agent
    MA->>MA: Generate response "2+2=4"
    MA->>MA: Create fake "back" tool call
    MA->>MA: should_continue = "continue"
    MA->>MT: Route to tools
    MT->>MT: Execute "back" tool
    MT->>C: Return "back" to coordinator
    C->>C: Evaluate completion
    C->>CT: goto_finalize()
    CT->>F: Finalize response
    F->>U: "2+2=4"

Key Design Features:

Agent Decision: Uses standardized decision method for routing decisions
Consistent Flow: Always goes through tools node before coordinator
No Circular Routing: Tools route directly to coordinator, not back to agent
Suspend Node: Hop limit handling with user-friendly messages

Example 2: Agent Uses Tools¶

User Query: "Calculate 15 * 23"

Execution Flow:

sequenceDiagram
    participant U as User
    participant C as Coordinator
    participant CT as Control Tools
    participant MA as Math Agent
    participant MT as Math Tools

    U->>C: "Calculate 15 * 23"
    C->>C: Analyze query, decide route
    C->>CT: goto_math_agent()
    CT->>MA: Activate math agent
    MA->>MA: Generate tool call to calculator
    MA->>MA: should_continue = "continue"
    MA->>MT: Route to tools
    MT->>MT: Execute calculator tool
    MT->>C: Return result to coordinator
    C->>C: Evaluate completion
    C->>CT: goto_finalize()
    CT->>F: Finalize response
    F->>U: "15 * 23 = 345"

Benefits of the Routing System¶

1. Eliminated Circular Routing¶

Before: agent → tools → agent → tools → ... (infinite loop)
After: agent → tools → coordinator (clean, predictable flow)

2. State Management¶

All agent responses go through the same routing path
State updates happen consistently through the tools node
Debugging and monitoring capabilities
Plugin context tracking for routing history

3. Clear Intent Communication¶

Agent routing decisions are explicit through should_continue logic
Easy to understand and debug conversation flow
Predictable system behavior
Logging for routing decisions

4. Error Handling¶

Clear separation between agent decisions and tool execution
Error isolation and recovery
Consistent error handling patterns
Graceful degradation when agents fail

5. Plugin Integration¶

Plugin bundles define their own routing logic
Consistent interface for all plugins
Separation of concerns
Easy plugin development and testing

6. Suspend Node¶

Hop limit detection and counting
User-friendly limit explanations
Tone-aware suspension messages
Safe message filtering to prevent errors
Context preservation

Best Practices¶

1. Enhanced Agent Implementation¶

Always implement the standardized decision method properly
Clear system prompts that guide tool usage
Proper error handling and logging

2. Enhanced Tool Design¶

Tools should return meaningful results
Handle errors gracefully
Provide clear documentation
Include proper validation

3. Enhanced Plugin Structure¶

Follow the established plugin structure
Register plugins properly in the initialization
Include proper metadata and capabilities
Implement proper edge configuration

4. Enhanced Testing¶

Test both tool usage and direct answer scenarios
Verify routing behavior with different agent responses
Test error conditions and edge cases
Validate state management consistency

5. Monitoring¶

Monitor routing decisions and edge creation
Track plugin context and routing history
Monitor tool execution performance
Validate state consistency

Conclusion¶

The conditional routing system in Cadence provides a robust, predictable foundation for multi-agent conversations. By implementing intelligent agent decision-making, proper edge routing, and suspend node handling, we've eliminated circular routing issues while maintaining the flexibility and power of the multi-agent architecture.

The system ensures that:

All agent responses follow a consistent routing path
Circular routing is prevented through proper edge configuration
State management is predictable and debuggable
The conversation flow is clear and maintainable
Plugin integration is seamless and consistent
Error handling is robust and graceful

This implementation makes Cadence reliable, easy to debug, and maintainable while preserving all the features of the multi-agent orchestration system.