planning: Ichigo Transcription #90

dan-menlo · 2024-10-18T06:24:37Z

Goal

Ichigo Demo should have transcription of the audio message
Likely driven by Eng
Data Storage is easier (i.e. can train over it)
Whisper Encoder is already in project (i.e. use Decoder)
Will not affect latency as this is post-processing

tikikun · 2024-11-11T05:54:25Z

@nguyenhoangthuan99 you can pick this up if you like, just extract embedding from encoder and forward it to whisper transcription

jrohsc · 2024-11-25T20:45:20Z

Hi, how can I do the transcription on a colab notebook? It seems like whenever I give a question audio, it only generates the answer to the question.

PodsAreAllYouNeed · 2024-12-02T14:30:59Z

Hi, how can I do the transcription on a colab notebook? It seems like whenever I give a question audio, it only generates the answer to the question.

I've prepared a colab demo with transcription example here: https://colab.research.google.com/drive/1req3ByqKS1vVPF_iGD1sNE2DzvMo7Jd0?usp=sharing

The relevant function for transcription is this:

def audio_to_text(audio_path, target_bandwidth=1.5, device=device):
    vq_model.ensure_whisper(device)
    wav, sr = torchaudio.load(audio_path)
    if sr != 16000:
        wav = torchaudio.functional.resample(wav, sr, 16000)
    with torch.no_grad():
        codes = vq_model.encode_audio(wav.to(device))
        transcript = vq_model.decode_text(codes[0]) 
    return f'{transcript[0].text}'

dan-menlo added this to Research Oct 18, 2024

dan-menlo converted this from a draft issue Oct 18, 2024

dan-menlo added this to the Ichigo v0.4 milestone Oct 18, 2024

dan-menlo changed the title ~~epic: Ichigo transcription~~ epic: Ichigo Transcription Oct 18, 2024

dan-menlo changed the title ~~epic: Ichigo Transcription~~ planning: Ichigo Transcription Oct 18, 2024

dan-menlo assigned PodsAreAllYouNeed Oct 18, 2024

tikikun assigned nguyenhoangthuan99 and unassigned PodsAreAllYouNeed Nov 11, 2024

tikikun assigned tuanlda78202 Nov 11, 2024

hiento09 added this to Jan & Cortex Nov 22, 2024

github-project-automation bot moved this to Investigating in Jan & Cortex Nov 22, 2024

dan-menlo modified the milestones: Ichigo v0.4, Ichigo Prod Demo Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

planning: Ichigo Transcription #90

planning: Ichigo Transcription #90

dan-menlo commented Oct 18, 2024 •

edited

Loading

tikikun commented Nov 11, 2024

jrohsc commented Nov 25, 2024

PodsAreAllYouNeed commented Dec 2, 2024

planning: Ichigo Transcription #90

planning: Ichigo Transcription #90

Comments

dan-menlo commented Oct 18, 2024 • edited Loading

Goal

tikikun commented Nov 11, 2024

jrohsc commented Nov 25, 2024

PodsAreAllYouNeed commented Dec 2, 2024

dan-menlo commented Oct 18, 2024 •

edited

Loading