Assign controlled labels to different types of documents, mainly in Spanish language.
Supported labels:
- DeCS (Descriptores en Ciencias de la Salud), analog Spanish version of English MeSH terms
- HPO (Human Phenotype Ontology) (future feature)
Supported types of documents:
- Scientific articles from IBECS (Índice Bibliográfico Español en Ciencias de la Salud)
- Scientific articles from LILACS (Literatura Latinoamericana y del Caribe en Ciencias de la Salud)
- Clinical studies from REec (Registro Español de estudios clínicos)
- Clinical research projects from Portal FIS (Fondo de Investigación en Salud)
- Patents in Spanish from Google Patents
The main objective of indexing documents is to obtain a gold-standard set (i.e. annotated by humans) that maps the text within each document to some of those controlled labels.
The technologies that build this tool are:
- MongoDB for database
- Flask for backend
- Angular for frontend
- NGINX for server and proxy
- Docker Compose for development and production deployment
$ git clone https://github.com/TeMU-BSC/indexer.git
$ cd indexer
$ docker-compose up
$ docker-compose -f docker-compose.prod.yml up --build -d