Real-time Speech Transcription and Translation

A Next.js application that performs real-time speech transcription and translation using OpenAI's models. This app captures live audio input, provides real-time transcription, and simultaneously translates the content into your selected target language.

Features

🎤 Live audio capture and transcription
🌍 Real-time translation to multiple languages
🔄 Automatic text cleanup option (keep last 3 sentences)
💻 Clean, modern user interface
🎯 Support for 10+ target languages

Prerequisites

Before you begin, ensure you have the following installed:

Node.js 16.8 or later
npm or yarn package manager
A modern web browser with microphone support
An OpenAI API key (for production use)

Installation

Clone the repository:

git clone <repository-url>
cd speech-transcription-app

Install dependencies:

npm install
# or
yarn install

Create a .env.local file in the root directory and add your OpenAI API key:

OPENAI_API_KEY=your_api_key_here

Start the development server:

npm run dev
# or
yarn dev

Open http://localhost:3000 in your browser.

Usage

Select your target translation language from the dropdown menu.
Click the "Start Recording" button to begin capturing audio.
Speak clearly into your microphone.
The app will display the original transcription on the left and the translation on the right.
Toggle the "Keep only last 3 sentences" switch to manage text length.
Click "Stop Recording" when finished.

Supported Languages

English (en)
Spanish (es)
French (fr)
German (de)
Italian (it)
Portuguese (pt)
Russian (ru)
Japanese (ja)
Korean (ko)
Chinese (zh)

Technical Details

This application is built with:

Next.js 13 (App Router)
TypeScript
Tailwind CSS
OpenAI API
Web Audio API
React Icons

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

OpenAI for providing the speech-to-text and translation APIs
Next.js team for the excellent framework
All contributors and users of this application

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time Speech Transcription and Translation

Features

Prerequisites

Installation

Usage

Supported Languages

Technical Details

Contributing

License

Acknowledgments

About

Releases

Packages

Languages

Paul-Yu-Chun-Chang/live-speech-translated-caption

Folders and files

Latest commit

History

Repository files navigation

Real-time Speech Transcription and Translation

Features

Prerequisites

Installation

Usage

Supported Languages

Technical Details

Contributing

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages