Skip to content

linalgo/linalgo

Repository files navigation

main License: MIT PyPI - Version

Linalgo W3C Web Annotation Library

Linalgo is a Python module to help Machine Learning team create and curate datasets for Natural Language Processing. It tries to follow the W3C Web Annotation Data Model and to provides a powerful system to add metadata to most commonly used text and image formats: TXT, PDF, HTML, etc.

Documentation

The documentation is available at https://linalgo.github.io/linalgo

Installation

pip install linalgo

Test

pytest

Storing Annotation Data

By default, linalgo stores annotations on a dedicated hub at https://hub.linalgo.com. There are also connectors to retrieve data from Google BigQuery.