Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Xinwei Xiong [email protected]
This pull request includes significant updates to the configuration files, command-line interface, and web functionalities of the
voiceflow
project. The main changes involve updating environment variables, adding new commands, and enhancing the web interface for real-time audio processing.Configuration Updates:
.env.example
: Updated MinIO, Azure, AWS, Google, OpenAI, and VOLCENGINE configurations with new placeholders and default values.configs/config.yaml
: Updated server port, added AWS configuration, and made changes to logging and service provider settings. [1] [2] [3] [4] [5]Command-Line Interface Enhancements:
cmd/voiceflow/realtime.go
: Added a new commandrealtime
to listen and translate speech in real-time.cmd/voiceflow/root.go
: Introduced a new sub-commandtranscribe
to transcribe audio files using the configured STT service, along with necessary flag configurations and command logic. [1] [2] [3] [4] [5] [6]Web Interface Improvements:
cmd/voiceflow/web/script.js
: Enhanced real-time audio processing by adding partial transcription display and sending audio chunks to the backend more frequently. [1] [2] [3]Dependency Updates:
go.mod
: Added dependencies for AWS and Microsoft Cognitive Services SDKs and removed unused dependencies. [1] [2] [3]go.work
: Updated Go workspace configuration.These changes collectively enhance the functionality and configurability of the
voiceflow
project, making it more robust and easier to use for real-time and file-based speech processing tasks.FIX #12
FIX #8