Skip to content

ckosten/Dockerised-DBpedia-Virtuoso-Endpoint-Setup-Guide

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

DBpedia Virtuoso Docker Setup Guide

Downloading DBpedia

Manual-downloading

To setup your endpoint with custom files

  • Choose which version of DBpedia dataset you want to setup from the [https://downloads.dbpedia.org/{release-year}/]
  • There are two different versions you can choose from /core/ & /core-i18n/en/ for the English version of DBpedia
  • If you want to run the standard version of DBpedia (i.e. what you find at the dbpedia endpoint), use /core/ the i18n are localized versions for other languages. If you want more info on this: https://joernhees.de/blog/tag/virtuoso/
  • unpack (i.e. unzip the ttl or nt files) the downloaded files. [skip the next script-downloading part]

Script-downloading

Get the download commands from the following repository:

git clone https://github.com/AKSW/DBpedia-docker

You will need to edit the Makefile, specifying the urls or the dataset version you wish to download. I provide my adapted version of the Makefile in this repository Now to download the files and ontology, specify the year of the version of dbpedia that you want in the Makefile and run

make download

Then to unzip the downloaded files, run:

make unpack

Running Virtuoso docker

Pull the virtuoso docker image from:

docker pull tenforce/virtuoso

Then, run:

docker run --name dbpedia-virtuoso -p 8890:8890 -p 1111:1111 -v /path/to/virtuoso -v /path/to/data -d tenforce/virtuoso

IMPORTANT, inside this directory /path/to/virtuoso is the virtuoso.ini file. Make sure that the DirsAllowedparameter from the virtuoso.ini file includes the directory where you have downloaded the dbpedia core files.

Move the data to /dumps folder of the virtuoso

Move the unpacked .ttl files to the /db/dumps/ folder of the virtuoso docker repository that you created before.

mv /path/to/data/ /path/to/virtuoso/db/dumps

If there is an access related problem, use the above command with sudo

Loading DBpedia into Virtuoso

To load the data follow the following steps:

docker exec -it dbpedia-virtuoso bash

You will now be inside the docker virtuoso, then run

isql-v -U dba -P dba

You will now be inside the ISQL terminal of the virtuoso docker.

You will first need to load the ontology file (can be also loaded later, I tried it myself. What is important that you load it) to the graph [http://dbpedia.org/resource/classes#]. This will allow you to browse/query through your local endpoint like the public one, i.e. [https://dbpedia.org/sparql]. Which means you can query all the dbpedia resources in the same way you query the public endpoint at:

[http://{your-cname}:{your-port}/resource/{resource-name}]

To do this, run:

ld_add('dumps/{ontology-filename}.owl', 'http://dbpedia.org/resource/classes#');

Now, we can start loading the data (RDF triples) from the ttl files. To do so, run:

ld_dir_all(‘dumps/’, ’*.ttl*’, ’http://dbpedia.org/’);

In the above command, the first argument is the path of the dumps/ repository, second specifies the files to be loaded (. - everything in dumps, .ttl - for only ttl files in dumps, etc); and last argument is the named graph where you want all your data to be loaded. _Please make sure all your .ttl files are in the /dumps repository!!

Then run the rdf loader to start loading the triples in to the database, run:

rdf_loader_run();
checkpoint;

Querying the virtuoso SPARQL endpoint

The SPARQL endpoint will be exposed at the ports declared during the docker run command goto: http://localhost:8890/sparql

And you should be able to see the SPARQL interface.

Enjoy!

About

The step by step guide to setup your personal copy of DBpedia endpoint

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Makefile 100.0%