The deployment of the VITA-Audio-Plus-Vanilla model employs a non-streaming deployment approach.
For the ASR and TTS tasks, only single-turn dialogues are supported. In the Spoken QA task, generated text is used as dialogue history to reduce the context length.