Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Research optimizing media file format for transcriptions #33

Open
citomcclure opened this issue Jun 15, 2024 · 0 comments
Open

Research optimizing media file format for transcriptions #33

citomcclure opened this issue Jun 15, 2024 · 0 comments
Labels
spike Research for features with lots of unknowns voice note Issue directly related to Voice Note feature

Comments

@citomcclure
Copy link
Owner

citomcclure commented Jun 15, 2024

User Story

As a voice note user, I may want the fastest voice note transcription possible instead of a slower one with more confidence.

Overview

Research which, if any, media file formats affect transcription job times in Amazon Transcribe. Currently, WAV with PCM 16-bit encoding is a larger file format than most and may be overkill for simple notes. Ultimately, this could result in toggle between faster, lower quality notes and slower, higher quality ones. If not prior research exists, we can implement 2-3 other file formats (research length of time to do so) and measure/benchmark the transcription times in a repeatable environment/way.

@citomcclure citomcclure added voice note Issue directly related to Voice Note feature spike Research for features with lots of unknowns labels Jun 15, 2024
@citomcclure citomcclure changed the title [SPIKE] Measure, benchmark, and optimize media file format for transcriptions Measure, benchmark, and optimize media file format for transcriptions Jun 15, 2024
@citomcclure citomcclure changed the title Measure, benchmark, and optimize media file format for transcriptions Research optimizing media file format for transcriptions Jun 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
spike Research for features with lots of unknowns voice note Issue directly related to Voice Note feature
Projects
Status: Ready
Development

No branches or pull requests

1 participant