Skip to content

mau777pirho/extractInfoWithLLM

 
 

Repository files navigation

Extract Info with LLM

Overview

This repository contains the source code and resources used in a tutorial video hosted on YouTube (in Spanish), demonstrating how to extract information from PDF invoices using a Local Language Model (LLM). The tutorial is designed to provide hands-on experience with setting up a local LLM server, processing PDFs, and extracting data efficiently using R.

Cómo Extraer Información de Facturas en PDF con Modelos de Lenguaje Locales y R

Tutorial Overview

In this tutorial, you'll learn the following:

  1. Introduction to LM Studio: Get familiar with the LM Studio, a tool that allows you to run language models locally.
  2. Installing a Local Model: Steps to download and install a local language model suitable for information extraction tasks.
  3. Running a Local Server: Set up and launch a local server that can handle API requests for text processing.
  4. Processing PDF Invoices: Convert PDF invoices into text and use the local language model to extract relevant information.
  5. Storing Extracted Data: Save the extracted data in either CSV or Excel formats for further analysis.

Features

  • Cost-Effective: Running models locally eliminates cloud costs.
  • Privacy: Your data stays on your machine, ensuring confidentiality.
  • Offline Capability: Once set up, the system can operate without an internet connection.

Requirements

  • Hardware: Sufficient computing power to run LLMs locally.
  • Software:
    • R Programming Language
    • PDF processing libraries
    • LM Studio

Troubleshooting

  • Model Loading Issues: Ensure that the model is correctly installed in LM Studio.
  • Server Connectivity: Verify that the local server is running and accessible.
  • Dependencies: Make sure all required R packages are installed

Contributions

Feel free to fork this repository and submit pull requests. Contributions are welcome, especially in improving the extraction accuracy and expanding functionality.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

For any questions or feedback, please contact:

Dr. José Manuel Galán Ordax

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%