Architecture Overview

At a high level:

Client streams audio/text to VoiceSession.
Transport handles live model IO (Gemini/OpenAI).
Main agent decides tool calls and transfers.
Tool layer executes inline/background work.
Subagents handle long-running tasks.
Events/hooks expose observability and integration points.

Runtime model

The framework uses the router-based orchestration path for tool routing, subagent handoff, and agent transfers.

Main components

VoiceSession (session lifecycle and wiring)
LLMTransport implementations
ToolExecutor + router
AgentRouter / transfer flow
Memory + history stores
EventBus + hooks