live-ocr-translator

DISCLAIMER This project is still in very early development and does not work you can refer to the Features section in here to see the things currently implemented / will be implemented. I am committed to getting this in a functional state soon enough hopefully, but right now you can unfortunately not use this. Do feel free to contribute by opening a PR though. As I am a student things might still take a while so I am accepting all the help I can get.

Why

The motivation behind wanting to make this was largey based a conversation with a friend about embedded subtitles in some older movies. The main issue being that nowdays alot of text is generally embedded in ways conventional approaches can't easily access. Joseph Finney's Text-Grab approached this issue very nicely for the static case and I hope to extend that idea to the dynamic case of running live OCR to enable things like live captioning without any embedding in the application. With the addition of having the application be cross-platform.

Future Applications

Live translation of subtitles when preferred option does not exist
Live translation of text content in games without a translation
General translation of any dynamically changing text

Features

Contributing

Windows

Currently I am developing primarily for windows so to test the application on windows first build the dockerfile then each time you want to compile the application for windows run the cross_compile.sh script.

To build image with the required dependencies

docker build . -t gtkrs-crosscomp

Once you have the cross compilation image built you can just run the cross compile script whenever you want to build the application for windows. The zip should then appear in the root directory of the project.

./cross_compile.sh

The packaged application should then appear in your root directory in the folder gtkapp

Linux

Currently for screen capture there is a placeholder image that is run. So that other functionality can still be worked on while not running into errors. The standard cargo run should work, the only dependecies needed are gtk3 and tesseract which you can install by running.

sudo apt install libgtk-3-dev tesseract-ocr

Additionally I am planning on migrating the application to gtk4, so keep that in mind in case I forget to update the readme. I will likely just create a docker image for linux as well to make it easier to build.

Help

This is a somewhat larger undertaking than what I usually do, so I would greatly appreciate any contributions to any aspect of the app.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github/workflows		.github/workflows
.vscode		.vscode
assets		assets
libs		libs
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
PROGRESS.md		PROGRESS.md
README.md		README.md
build.sh		build.sh
cross_compile.sh		cross_compile.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

live-ocr-translator

Why

Future Applications

Features

Contributing

Windows

Linux

Help

Visual feature outline

About

Releases

Packages

Languages

KaiErikNiermann/live-ocr-translator

Folders and files

Latest commit

History

Repository files navigation

live-ocr-translator

Why

Future Applications

Features

Contributing

Windows

Linux

Help

Visual feature outline

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages