Skip to content

Latest commit

 

History

History
96 lines (65 loc) · 3.43 KB

README.md

File metadata and controls

96 lines (65 loc) · 3.43 KB

GLM-Translator


English | 中文

Introduction

GLM Translator is a tool that uses AI technology to translate English PDF books into Chinese. This tool utilizes large language models (LLMs), specifically the ChatGLM model by Zhipu. It is built with Python and features a flexible, modular, and object-oriented design.

Why This Project

In the current environment, there is a lack of non-commercial yet effective PDF translation tools. Many users have PDF files containing sensitive data and prefer not to upload them to public commercial services to protect their privacy. This project aims to solve this problem by providing a solution for users who need to translate their PDF files while maintaining data privacy.

Sample Results

GLM Translator is still in the early stages of development, and I am actively adding more features and improving its performance. Any feedback or contributions are highly welcome!

OHenry Novel

"OHenry Novel selected"

The_Old_Man_of_the_Sea

"The Old Man and the Sea"

Features

  • Use large language models (LLMs) to translate English PDF books into Chinese.
  • Support for the ChatGLM model.
  • Flexible configuration through YAML files or command line parameters.
  • Robust timeout and error handling for translation operations.
  • Modular and object-oriented design, easy to customize and extend.
  • Add support for other languages and translation directions.
  • Implement a graphical user interface (GUI) for easier use.
  • Implement stylized translation.
  • Implement partial page translation (e.g., first x pages).
  • Create a web service or API for use in web applications.
  • Add support for batch processing of multiple PDF files.
  • Add support for retaining the original layout and formatting of the source PDF.
  • Improve translation quality by using custom-trained translation models.

Getting Started

  1. Environment Setup

    1. Clone the repository.

    2. GLM Translator requires Python 3.10 or higher. Install the dependencies using pip install -r requirements.txt.

    Usage

    1. Place your PDF file in the input directory.
    2. Run the translator using python translate.py.
    3. The translated PDF will be saved in the output directory.

    Command Line Options

    • --input: Specify the input PDF file.
    • --output: Specify the output directory.
    • --language: Set the target translation language (default: Chinese).
    • --model: Specify the model to use (default: ChatGLM).
    • --timeout: Set the timeout for translation operations.

    Configuration

    You can configure GLM Translator using a YAML file. Below is an example:

    input: 'input/sample.pdf'
    output: 'output/'
    language: 'Chinese'
    model: 'ChatGLM'
    timeout: 600

    Contributing

    We welcome contributions to the GLM Translator project. If you would like to contribute, please fork the repository, make your changes, and submit a pull request.

    License

    This project is licensed under the MIT License - see the LICENSE file for details.

    Acknowledgments

    • Zhipu's ChatGLM team for developing the model.
    • The open-source community for providing the tools and libraries used in this project.