Skip to content
/ bg3gpt Public

Python script that looks for new screenshots of game dialog to process and send to GPT and read response out aloud.

License

Notifications You must be signed in to change notification settings

rzhaman/bg3gpt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

This python script executes in Windows 11 environment, looking for screenshot produced by any software (Windows 11 screenshot seems to be lagging, so I don't recommend it, I use Nvidia GeForce Experience and Alt-F1) in a defined folder, once it detects a new screenshot it sends it to GPT/LLM to understand the text and dialog options presented, takes the GPT answer and speaks the recommendation aloud.

Current version uses gpt-4o model by OpenAI, operating multimodally (i.e. it sends text and image prompt) and requires API key to work. I also have version that works with Ollama and local Llama3 model, it needs Google Tesseract OCR engine to provide just text from the screenshot to the Llama3.

About

Python script that looks for new screenshots of game dialog to process and send to GPT and read response out aloud.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages