Every agent turn, you're resending the full context.
Again.
That overhead compounds fast.
WebSocket Mode for the Responses API keeps a persistent connection, sends only incremental inputs, and cuts end-to-end latency by up to 40% on heavy tool-call workflows.