feat(realtime): reuse realtime session across agent handoffs if supported by longcw · Pull Request #5229 · livekit/agents

longcw · 2026-03-26T04:03:05Z

When an agent hands off to another agent that uses the same RealtimeModel, reuse the existing WebSocket session instead of closing and reopening a new one. The session is detached from the old activity (event listeners removed), transferred to the new activity, and reconfigured via update_session — updating instructions, chat context, and tools as needed based on the model's mid_session_* capabilities.

Handoff Benchmark

Test with 10 conversation turns, then handoff — reuse vs fresh session, with copied or empty chat context.

Handoff = time for update_session or new session creation. Reply = time from handoff start to generate_reply response.

CI (good network):

┌──────────────────────┬──────────────┬──────────────┐
│ Mode                 │ Handoff (s)  │ Reply (s)    │
├──────────────────────┼──────────────┼──────────────┤
│ reuse (copy ctx)     │       *0.000 │        0.105 │
│ reuse (empty ctx)    │        0.061 │        0.164 │
│ fresh (copy ctx)     │        0.310 │        0.403 │
│ fresh (empty ctx)    │        0.000 │        0.364 │
└──────────────────────┴──────────────┴──────────────┘

Local network:

┌──────────────────────┬──────────────┬──────────────┐
│ Mode                 │ Handoff (s)  │ Reply (s)    │
├──────────────────────┼──────────────┼──────────────┤
│ reuse (copy ctx)     │        0.001 │        0.623 │
│ reuse (empty ctx)    │        0.296 │        0.866 │
│ fresh (copy ctx)     │        1.845 │        2.427 │
│ fresh (empty ctx)    │        0.000 │        1.681 │
└──────────────────────┴──────────────┴──────────────┘

* handoff time is zero because we didn't wait for the update_instructions in openai realtime

…rted

…n-across-handoffs

chenghao-mou

I think it is hitting an issue with funtion calls when the handoff happens before the tool call result is added:

    11:45:11.071 DEBUG    livekit.agents     executing tool  
                                         {"function": "transfer_to_weather", "arguments": "{}", "speech_id": "speech_a183e40b8c57", "room": 
"console"}
    11:45:11.074 INFO     basic-agent        handing off to WeatherAgent {"room": "console"}
    11:45:11.961 INFO     livekit.agents     RealtimeModel metrics  
                                         {"model_name": "gpt-realtime", "model_provider": "api.openai.com", "ttft": -1.0, "input_tokens": 219,
"cached_input_tokens": 0, "input_text_tokens": 113, "input_cached_text_tokens": 0, "input_image_tokens": 0, "input_cached_image_tokens": 0, 
"input_audio_tokens": 106, "input_cached_audio_tokens": 0, "output_tokens": 12, "output_text_tokens": 12, "output_audio_tokens": 0, 
"output_image_tokens": 0, "total_tokens": 231, "tokens_per_second": 9.8, "room": "console"}
    11:45:11.964 DEBUG    livekit.agents     tools execution completed {"speech_id": "speech_a183e40b8c57", "room": "console"}
    11:45:16.973 WARNING  livekit.agents     failed to update chat context before generating the function calls results  
                                         {"error": "update_chat_ctx timed out.", "room": "console"}
    11:45:16.977 DEBUG    livekit.agents     reusing realtime session from previous activity {"room": "console"}
    11:45:17.490 ERROR    livekit.…ns.openai failed to handle event  
                                           {"event": {"type": "conversation.item.added", "event_id": "event_DRVBjFMStnIRUZry4pSYE", 
"previous_item_id": "item_DRVBe7B7O7BGvRaTMEa50", "item": {"id": "item_e2ca6d129cd5", "type": "function_call_output", "call_id": 
"call_9UoYr1iIQM8DltFx", "output": ""}}, "room": "console"}
Traceback (most recent call last):
  File "/Users/chenghao/Developer/agents-review/livekit-plugins/livekit-plugins-openai/livekit/plugins/openai/realtime/realtime_model.py", 
line 1023, in _recv_task
    self._handle_conversion_item_added(ConversationItemAdded.construct(**event))
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/chenghao/Developer/agents-review/livekit-plugins/livekit-plugins-openai/livekit/plugins/openai/realtime/realtime_model.py", 
line 1675, in _handle_conversion_item_added
    fut.set_result(None)
    ~~~~~~~~~~~~~~^^^^^^
asyncio.exceptions.InvalidStateError: invalid state

Code

import logging

from dotenv import load_dotenv

from livekit.agents import (
    Agent,
    AgentServer,
    AgentSession,
    JobContext,
    JobProcess,
    MetricsCollectedEvent,
    RunContext,
    cli,
    metrics,
)
from livekit.agents.llm import function_tool
from livekit.plugins import openai, silero

logger = logging.getLogger("basic-agent")

load_dotenv("../agents/.env")


class GreeterAgent(Agent):
    def __init__(self) -> None:
        super().__init__(
            instructions=(
                "You are a friendly greeter named Kelly. "
                "Keep responses concise. No emojis or markdown. "
                "Your job is to greet the user and ask what they need help with. "
                "When the user asks about weather, use the transfer_to_weather tool."
            ),
        )

    async def on_enter(self) -> None:
        logger.info("GreeterAgent entered")
        self.session.generate_reply(instructions="greet the user and ask how you can help")

    @function_tool
    async def transfer_to_weather(self, context: RunContext) -> Agent:
        """Transfer to the weather agent when the user asks about weather."""
        logger.info("handing off to WeatherAgent")
        return WeatherAgent()


class WeatherAgent(Agent):
    def __init__(self) -> None:
        super().__init__(
            instructions=(
                "You are a weather specialist named Sunny. "
                "Keep responses concise. No emojis or markdown. "
                "Help the user with weather questions. "
                "When the user wants to talk about something else, use transfer_to_greeter."
            ),
        )

    async def on_enter(self) -> None:
        logger.info("WeatherAgent entered")
        self.session.generate_reply(
            instructions="introduce yourself as the weather specialist and ask what city"
        )

    @function_tool
    async def lookup_weather(self, context: RunContext, location: str) -> str:
        """Called when the user asks for weather information.

        Args:
            location: The city or region to check weather for
        """
        logger.info(f"Looking up weather for {location}")
        return f"The weather in {location} is sunny, 70 degrees Fahrenheit."

    @function_tool
    async def transfer_to_greeter(self, context: RunContext) -> Agent:
        """Transfer back to the greeter when the user is done with weather."""
        logger.info("handing off back to GreeterAgent")
        return GreeterAgent()


server = AgentServer()


def prewarm(proc: JobProcess) -> None:
    proc.userdata["vad"] = silero.VAD.load()


server.setup_fnc = prewarm


@server.rtc_session()
async def entrypoint(ctx: JobContext) -> None:
    ctx.log_context_fields = {"room": ctx.room.name}

    # shared realtime model -- both agents inherit this from the session,
    # so `self.llm is new_activity.llm` is True and the RT session is reused across handoffs
    rt_model = openai.realtime.RealtimeModel(voice="echo")

    session: AgentSession = AgentSession(
        llm=rt_model,
        vad=ctx.proc.userdata["vad"],
    )

    @session.on("metrics_collected")
    def _on_metrics_collected(ev: MetricsCollectedEvent) -> None:
        metrics.log_metrics(ev.metrics)

    async def log_usage():
        logger.info(f"Usage: {session.usage}")

    ctx.add_shutdown_callback(log_usage)

    await session.start(
        agent=GreeterAgent(),
        room=ctx.room,
    )


if __name__ == "__main__":
    cli.run_app(server)

livekit-agents/livekit/agents/llm/realtime.py

…n-across-handoffs

livekit-agents/livekit/agents/llm/realtime.py

theomonnom · 2026-04-07T03:41:53Z

livekit-agents/livekit/agents/llm/realtime.py

+    """Whether the instructions can be updated mid-session"""
+    mid_session_tools_update: bool = False
+    """Whether the tools can be updated mid-session"""
+    per_response_tool_choice: bool = False


IMO we should find a better name for per_response_tool_choice

How about mutable_*: mutable_chat_context, mutable_instructions, mutable_tools, and mutable_tool_choice?

i second mutable_*!

theomonnom · 2026-04-07T03:42:13Z

livekit-agents/livekit/agents/llm/realtime.py

+    async def update_session(
+        self,
+        *,
+        instructions: NotGivenOr[str] = NOT_GIVEN,
+        chat_ctx: NotGivenOr[ChatContext] = NOT_GIVEN,
+        tools: NotGivenOr[list[Tool]] = NOT_GIVEN,
+    ) -> None:
+
+        if is_given(instructions):
+            try:
+                await self.update_instructions(instructions)
+            except RealtimeError:
+                logger.exception("failed to update the instructions")
+
+        if is_given(chat_ctx):
+            try:
+                await self.update_chat_ctx(chat_ctx)
+            except RealtimeError:
+                logger.exception("failed to update the chat_ctx")
+
+        if is_given(tools):
+            try:
+                await self.update_tools(tools)
+            except RealtimeError:
+                logger.exception("failed to update the tools")
+


Any advantage of having this utility?

It was asked by @tinalenguyen in #5303, I guess it's some plugins can overwrite this method to customize the init process, like send all configs in a single event.

…n-across-handoffs

longcw · 2026-04-07T08:14:32Z

I think it is hitting an issue with funtion calls when the handoff happens before the tool call result is added:

    11:45:11.071 DEBUG    livekit.agents     executing tool  
                                         {"function": "transfer_to_weather", "arguments": "{}", "speech_id": "speech_a183e40b8c57", "room": 
"console"}
    11:45:11.074 INFO     basic-agent        handing off to WeatherAgent {"room": "console"}
    11:45:11.961 INFO     livekit.agents     RealtimeModel metrics  
                                         {"model_name": "gpt-realtime", "model_provider": "api.openai.com", "ttft": -1.0, "input_tokens": 219,
"cached_input_tokens": 0, "input_text_tokens": 113, "input_cached_text_tokens": 0, "input_image_tokens": 0, "input_cached_image_tokens": 0, 
"input_audio_tokens": 106, "input_cached_audio_tokens": 0, "output_tokens": 12, "output_text_tokens": 12, "output_audio_tokens": 0, 
"output_image_tokens": 0, "total_tokens": 231, "tokens_per_second": 9.8, "room": "console"}
    11:45:11.964 DEBUG    livekit.agents     tools execution completed {"speech_id": "speech_a183e40b8c57", "room": "console"}
    11:45:16.973 WARNING  livekit.agents     failed to update chat context before generating the function calls results  
                                         {"error": "update_chat_ctx timed out.", "room": "console"}
    11:45:16.977 DEBUG    livekit.agents     reusing realtime session from previous activity {"room": "console"}
    11:45:17.490 ERROR    livekit.…ns.openai failed to handle event  
                                           {"event": {"type": "conversation.item.added", "event_id": "event_DRVBjFMStnIRUZry4pSYE", 
"previous_item_id": "item_DRVBe7B7O7BGvRaTMEa50", "item": {"id": "item_e2ca6d129cd5", "type": "function_call_output", "call_id": 
"call_9UoYr1iIQM8DltFx", "output": ""}}, "room": "console"}
Traceback (most recent call last):
  File "/Users/chenghao/Developer/agents-review/livekit-plugins/livekit-plugins-openai/livekit/plugins/openai/realtime/realtime_model.py", 
line 1023, in _recv_task
    self._handle_conversion_item_added(ConversationItemAdded.construct(**event))
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/chenghao/Developer/agents-review/livekit-plugins/livekit-plugins-openai/livekit/plugins/openai/realtime/realtime_model.py", 
line 1675, in _handle_conversion_item_added
    fut.set_result(None)
    ~~~~~~~~~~~~~~^^^^^^
asyncio.exceptions.InvalidStateError: invalid state

@chenghao-mou I think the issue was the timeout of update chat ctx, after the timeout, the futures are cancelled but not removed from the _item_create_future, when the server eventually acknowledges those items, it tries to set_result on an already-cancelled future.

It's not caused by this pr, but I can clean the _item_create_future and _item_delete_future after it's cancelled here.

chenghao-mou

Tested locally and it worked well handing off back and forth. Maybe we need less verbose capability names.

theomonnom

lgtm, we can try to find a better name for _update_session when we make it public.

Otherwise nit:

per_response_tool_choice -> mutable_tool_choice?

Just thinking out loud, it kind of means the same thing if you edit it mid-session through generate_reply

longcw · 2026-04-10T08:34:21Z

per_response_tool_choice -> mutable_tool_choice?

Just thinking out loud, it kind of means the same thing if you edit it mid-session through generate_reply

the tool_choice for generate_reply is only for the single turn, we have update_options and update_tools for changing the session-level tools and tool_choice, mutable_tool_choice sounds more like for them.

feat(realtime): reuse realtime session across agent handoffs if suppo…

9ea8cad

…rted

chenghao-mou requested a review from a team March 26, 2026 04:03

This comment was marked as resolved.

Sign in to view

longcw added 2 commits March 26, 2026 13:01

check tools before reuse

b4359bf

clean

e653c56

This comment was marked as resolved.

Sign in to view

detach after _pause_scheduling_task

975eccf

This comment was marked as resolved.

Sign in to view

fix test

fea72dd

This comment was marked as resolved.

Sign in to view

longcw and others added 2 commits March 26, 2026 13:50

clean up in exception

1ecc2e9

(taskgroups): set llm to NOT_GIVEN (#5301)

cc1c8de

This comment was marked as resolved.

Sign in to view

tinalenguyen force-pushed the longc/reuse-rt-session-across-handoffs branch from 1b817a3 to cc1c8de Compare April 1, 2026 23:46

This comment was marked as resolved.

Sign in to view

longcw and others added 5 commits April 2, 2026 11:08

improve error handle

5b6c64e

(realtimesession): add reset method (#5303)

a8c61de

rename to update_session

26b529c

Merge remote-tracking branch 'origin/main' into longc/reuse-rt-sessio…

816a035

…n-across-handoffs

fix RealtimeCapabilities

a9c20e9

This comment was marked as resolved.

Sign in to view

chenghao-mou reviewed Apr 6, 2026

View reviewed changes

livekit-agents/livekit/agents/llm/realtime.py Outdated Show resolved Hide resolved

Merge remote-tracking branch 'origin/main' into longc/reuse-rt-sessio…

2b56c5e

…n-across-handoffs

theomonnom reviewed Apr 7, 2026

View reviewed changes

livekit-agents/livekit/agents/llm/realtime.py Outdated Show resolved Hide resolved

theomonnom reviewed Apr 7, 2026

View reviewed changes

longcw added 2 commits April 7, 2026 12:06

Merge remote-tracking branch 'origin/main' into longc/reuse-rt-sessio…

365f649

…n-across-handoffs

address comments

33299e7

clean _item_create_future and _item_delete_future when timeout

6f15356

fix taskgroup for realtime model

a7ec954

longcw force-pushed the longc/reuse-rt-session-across-handoffs branch from 44b21c2 to a7ec954 Compare April 7, 2026 09:10

chenghao-mou approved these changes Apr 9, 2026

View reviewed changes

longcw added 6 commits April 9, 2026 16:19

add test for reusing realtime session across agent handoffs

da8d099

print test output

c5a811b

print error msg

4db843c

update test_update_chat_ctx

df4e7b5

rename to _update_session

e8d391f

rename to mutable_

815e3ca

This comment was marked as resolved.

Sign in to view

theomonnom approved these changes Apr 10, 2026

View reviewed changes

longcw merged commit 137a3c1 into main Apr 10, 2026
25 of 27 checks passed

longcw deleted the longc/reuse-rt-session-across-handoffs branch April 10, 2026 09:11

Conversation

longcw commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Handoff Benchmark

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

theomonnom Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

tinalenguyen Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

longcw commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom left a comment

Choose a reason for hiding this comment

Uh oh!

longcw commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw commented Mar 26, 2026 •

edited

Loading

longcw commented Apr 7, 2026 •

edited

Loading