Chat gets fast — token streaming and smarter scrolling
Chat just got a lot faster. Responses now stream token by token instead of arriving all at once, and the scroll keeps you oriented so you can watch answers fill in without losing your place.
Token streaming
Time-to-first-word drops from "the whole reply" to sub-second latency. Incremental tokens stream to the chat panel as the model produces them, with the frontend accumulating the response so it reads naturally without flicker. Streaming is now generally available — the feature flag is gone.
Smarter scrolling
New messages pin to the top of the viewport and reserve a screen of space below. The streaming answer fills that space calmly instead of chasing the bottom of the page. The scroll-up is animated (smooth, not a jump cut), and the message stays visible throughout the entire turn — no more disappearing above the fold mid-response.