- Better stabilty (fewer timeouts)
- Better UX (more information for users so long waits are more forgivable)
We need to expose eventing (ie, feedback what is happening before tokens start streaming back) and streaming tokens from the Apollo end, then plug Lightning into it.
This is becoming increasingly important as the assistant is slowing down and timing out more and more.