You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Base on my experience, I have concerns about the reliability of Text2Semantic. When I modified the T2S model parameters to stabilize the semantic tokens, it significantly increased the pipeline's processing time compared to the standard Text2Speech + Speech2Semantic pipeline without saving the audio. Therefore, I recommend we proceed with the T2S+ S2S pipeline approach. cc @tuanlda78202
dan-menlo
changed the title
task: Multi-lingual Instruct Speech Dataset Creation
task: Instruct Dataset Creation for Multilingual Speech
Nov 27, 2024
hahuyhoang411
changed the title
task: Instruct Dataset Creation for Multilingual Speech
task: Instruct Dataset Creation for Multilingual Speech (Phase 2)
Nov 27, 2024
Goal
Create a speech instruction finetuning to make Ichigo better in conversation.
Tasklist
The text was updated successfully, but these errors were encountered: