🏗️ This documentation is a work in progress! It's also mostly AI generated right now 🏗️
Read my project notes about this app at: https://nathanclonts.com/text-to-audiobook-app-with-kubernetes/
- Introduction
- Features
- Project Structure
- Prerequisites
- Installation and Setup
- Running the Application
- License
The PDF to Speech Converter is a web application that allows users to upload PDF documents and receive high-quality spoken audio as output. Users can choose from multiple voice options, ensuring a personalized listening experience. The application processes the PDF by extracting and cleaning the text before converting it to speech using advanced text-to-speech (TTS) models.
- PDF Upload: Upload PDF files directly through the web interface.
- Voice Selection: Choose from a variety of high-quality voices.
- Text Cleaning: Automatic removal of headers, footers, and hyphenations for smooth narration.
- Audio Playback and Download: Listen to the generated audio online or download it for offline use.
- Asynchronous Processing: Efficient handling of large files without blocking the user interface.
- Modular Design: Separate backend and frontend components for scalability and maintainability.
TODO
TODO
TODO