VRE is a Virtual Research Environment including a web application and a cloud-based backend aimed to provide end-users an integrative platform for exploring rellevant data resources and applying to them a collection of analyses and visualizations. It is written in PHP, HTML and Javascript. This document describes how to install and configure VRE platform in your system, as well as how to process on several basic operation for customizing VRE to your specific project needs.
Clone VRE source code in your installation directory, for instance /home/user
.
git clone https://github.com/inab/openvre.git
cd openvre
Install PHP extensions (curl, gd and mongodb).
sudo apt-get install php-curl php-gd php-mongodb
Install PHP library dependencies via composer using the following command inside the openvre directory:
composer update
After installing the extensions and libraries, restart Apache2 Web server or if missing install it and run:
sudo service apache2 restart
The /public/
directory is to be accessible on the Web, and the server should accept '.htaccess' files. So, the Apache2 Web Server is to be accordingly set.
Multiple configurations are possible, for instance, you can set the host files mapping using the Alias directive, or you can prepare a new virtual host by which the root document points to the openVRE public folder. The DocumentRoot needs to be set accordingly in apache.config and the necessary files. See an example:
DocumentRoot /home/user/openvre/public/
<Directory "/home/user/openvre/public">
Options Indexes FollowSymLinks
AllowOverride All
Require all granted
</Directory>
Enabling the rewriting engine is also required:
sudo a2enmod rewrite
sudo systemctl restart apache2
Define where VRE user's data and other data is to be stored, for example, in /shared/user/data
. Make sure that the chosen folder is accessible from both, the VRE web application and the VRE workers (i.e. local or remote virtual machines). Copy the directory structure there, and give Apache2 user permissions to it:
mkdir data
cp -r openvre/install/data/* data/
sudo chown -R www-data:www-data data/
For some visualizers, direct access to this data is required. So, if you latter plan to install one of these visualizers (e.g. NGL), add theses soft links:
cd public
ln -s /data/userdata/ files
Create a new database to MongoDB, here dbname
, and populate it with the structural collections:
for f in install/database/*.json; do mongoimport --host myServer --db dbname -u myAdmin -p myPassword --authenticationDatabase admin $f; done
Most of the VRE settings are parameterized at config/globals.inc.php
. Copy the template and read through it to configure arguments like the data path, the URL, the database, etc.
cp config/globals.inc.php.sample config/globals.inc.php
Each parameter is briefly introduced, yet most of them are self-explanatory. Some of these configurations require extra settings or auxiliary files. These are listed below:
Configure the appropiate path in config/globals.inc.php
, and make sure it exists and is writabble by Apache's user:
mkdir -p /home/user/openvre/logs/
touch /home/user/openvre/logs/application.log
sudo chown -R www-data:www-data /home/user/openvre/logs/application.log
Copy the conf template, set the credentials in there and parameterize config/globals.inc.php
accordingly
cp config/mail.conf.sample config/mail.conf
Copy the conf template, set the credentials in there and parameterize config/globals.inc.php
accordingly. VRE will use the PHP MongoDB driver to connect to the database.
cp config/mongo.conf.sample config/mongo.conf
Copy the conf template, set the credentials in there and parameterize config/globals.inc.php
accordingly, including the openID endpoints. VRE uses a generic OpenID Connect Resource Provider that connects to a external identity manager server (https://www.keycloak.org/)
cp config/oauth2.conf.sample config/oauth2.conf
cp config/oauth2_admin.conf.sample config/oauth2_admin.conf
Replace these files with your project logos
public/assets/layouts/layout/img/logo-big.png
: logo for central frame pagespublic/assets/layouts/layout/img/logo.png
: logo for top navigation barpublic/assets/layouts/layout/img/icon.png
: favico logo
Edit the custom CSS file for customizing the web look
public/assets/layouts/layout/css/custom.min.css
Edit the following pages, mostly HTML, for the Help section
- public/help/general.php : Section 'Help' > 'General Information'
- public/help/starting.php : Section 'Help' > 'Getting Started'
- public/help/upload.php : Section 'Help' > 'Get Data'
- public/help/ws.php : Section 'Help' > 'Workspace'
- public/help/launch.php : Section 'Help' > 'Launch Job'
- public/help/hdesk.php : Section 'Help' > 'Helpdesk'
- public/help/related.php : Section 'Help' > 'Related Links'
- public/help/refs.php : Section 'Help' > 'References'
- public/help/ackn.php : Section 'Help' > 'Acknowledgments'
Edit the following page with your project's terms of use. They are displayed at the VRE footer page.
vim public/applib/getTermsOfUse.php
Edit the content for the README file that appears by default on each workspace in upload/README
.
vim /data/sampleData/basic/uploads/README.md`
Log into VRE and register with the user you want to convert into the admin user. Then, grant it admin privileges by accessing the Mongo database and update Type:0
db.users.update(
{ _id: [email protected] },
{ $set: { "Type": 0 } }
)
Set the current system as a master SGE host able to submit jobs. Configure also a local queue, if you plan to install tools locally (i.e. WGET tool, BAMvalidation).
sudo qconf -as mail.domain.es
Each time a new user access VRE, an anonymous account is created until it registers. Set a cron job for regularly cleanning old temporarily user, from both, database and data directory.
Add execution permission to the cleaning script:
chmod u+x scripts/maintainance/cleanUsersData.php
Set a cron job to daily execute it using:
crontab -e
And setting there something similar to:
# Example of job definition:
# .---------------- minute (0 - 59)
# | .------------- hour (0 - 23)
# | | .---------- day of month (1 - 31)
# | | | .------- month (1 - 12) OR jan,feb,mar,apr ...
# | | | | .---- day of week (0 - 6) (Sunday=0 or 7) OR sun,mon,tue,wed,thu,fri,sat
# | | | | |
# * * * * * command to be executed
0 3 * * * /path/to/openvre/scripts/cleanUsers.php >> /path/to/openvre/logs/cleanUsers.log 2>&1
Register the tool definition into the tools collection
database
-
Prepare JSON schema for your project's tools extending the following definition: -
install/data/tool_schemas/tool_specification/tool_schema.json
-
Tool developer is to prepare the tool definition and submit it to admin. Validate the
newTool.json
against the previous schema. Following, an example: -install/data/tool_schemas/tool_specification/examples/example.json
-
Insert the tool document into the tools collection as a new entry
mongoimport --jsonArray --db dbname --collection tools -u myAdmin -p XXXX --authenticationDatabase admin --file newTool.json
Each tool in VRE requires a new folder in tools/
. Create it for the 'newTool'. Make it sure that 'newTool' is your tool id. You can take the tool_skeleton as a template
cp tools/tool_skeleton/ tools/newTool
sudo chown www-data:www-data tools/newTool/help/img/
Create the tool form where the user is to set the arguments and input files for a run (ie. Launch Tool > Tool Skeleton > Start Tool with 1 fasta). Use the tool_skeleton/input.php as a template, and edit it according the number and type of input files and arguments you need for your newTool
vim tools/newTool/inputs.php
Edit:
$toolId = "tool_skeleton";
: set to your tool id (newTool)$tool['input_files']['fasta1']
: set your input file name/s. Substitute in the following code the tool_skeleton 'fasta1' input by the identificator of your input file - according your tool definition. Concatenate and adapt as manyas input files your tool have.<div class="col-md-12"> <?php $ff = matchFormat_File($tool['input_files']['genes']['file_type'], $inPaths); ?> <?php InputTool_printSelectFile($tool['input_files']['genes'], $rerunParams['fasta1'], $ff[0], false, true); ?> </div>
Register tool help pages to the database. Each section help has a Mark Down editor where the tool developer will be able to set the help content by himself.
For registering the complete list of help sections (Help > Tools > Tool Skeleton > Methods, Inputs, Outputs, Results, References), add the following documents into helps collection
.
{
"tool" : "tool_skeleton",
"help" : "references",
"title" : "References",
"content" : "<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor."
}
{
"tool" : "tool_skeleton",
"help" : "results",
"title" : "Results",
"content" : "<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor."
}
{
"tool" : "tool_skeleton",
"help" : "method",
"title" : "Method",
"content" : "<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor.\r\n"
}
{
"tool" : "tool_skeleton",
"help" : "outputs",
"title" : "Outputs",
"content" : "<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor.\r\n"
}
{
"tool" : "tool_skeleton",
"help" : "help",
"title" : "Tool Skeleton",
"content" : "<!-- Example of navigator -->\r\n<a href=\"/tools/tool_skeleton/help/method.php\" class=\"btn green btn-xs\">Method</a> <a href=\"/tools/tool_skeleton/help/inputs.php\" class=\"btn green btn-xs\">Inputs</a> <a href=\"/tools/tool_skeleton/help/outputs.php\" class=\"btn green btn-xs\">Outputs</a> <a href=\"/tools/tool_skeleton/help/results.php\" class=\"btn green btn-xs\">Results</a> <a href=\"/tools/tool_skeleton/help/references.php\" class=\"btn green btn-xs\">References</a>\r\n\r\n<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor.\r\n\r\n\r\n"
}
{
"tool" : "tool_skeleton",
"help" : "inputs",
"title" : "Inputs",
"content" : "<!-- Help content -->\r\n# Title 1\r\nEdit the content using the online *Markdown* editor."
}
Apart from generating output files that are to be displayed in the workspace, a tool may also have a custom displayer, a simple HTML template that is going to source a tool output TAR for showing the statistics, logs, etc, associated to each run.
- Make sure the tool has registered an output file that have
data_type: tool_statistics
andfile_type : TAR
. Also,has_custom_viewer : true
. - Edit the basic
output.php
from tool_skeleton, and extend it by preparing the custom display for the newTool.vim tools/newTool/outputs.php
<div class="col-md-12"> <?php $ff = matchFormat_File($tool['input_files']['genes']['file_type'], $inPaths); ?> <?php InputTool_printSelectFile($tool['input_files']['genes'], $rerunParams['fasta1'], $ff[0], false, true); ?> </div>
- Check the result at 'View Results', the option appearing in the violet toolkit dropdown present next to each run folder at the workspace table.
Use any other page as template for creating the new content of myNewPage.php
.
TODO: create a white page to be used as a template
Add headers for including the required CSS and JS scripts. Three files need to be edited:
- public/htmlib/menu.inc.php
- public/htmlib/header.inc.php
- public/htmlib/js.inc.php
VRE has not only tools generated and imported by tool developers. Internal tools are tools that the end user do not select them from the web, but are internally triggered by VRE processes. For instance:
- File uploads from URL triggers 'wget' tool
- Importing BAM file triggers 'BAMval' tool In consequence, the tool executable is distributed together with the VRE code, as well the entry in the tools collection. Following, the instructions on how to configure 'wget' tool
At the database, the 'Tools' collection should already have the 'wget' tool entry. Update the JSON document to adapt it to your instalation requirements: 'excutable' and 'queue':
"executable" : "/ABSOLUTE/PATH/TO/VRE/DATA/apps/internalTools/wget/wget.sh"
"clouds" : {
"mug-bsc" : {
"launcher" : "SGE",
"queue" : "local.q"
}
- 'executable' : Point it to your 'apps' folder. Should have been installed in your VRE data directory as part of the data distributed in
install/data/apps
- 'mug-bsc' : Name of your cloud as set in
install/globals
- 'queue': Name of the queue configured in the host where the executable is found
VRE dataset appear listed in "Get Data > Import example dataset". On select, VRE loads certain data into user's workspace the data.
Add into the sample data directory (GLOBALS['sampleData']
as defined in config/globals.inc.php) a new folder with the following directory tree:
- NewDataSet/
- .sample_metadata.json
- uploads/
- myInput1.txt
- [resultsFolder/]
- [myOutput1.txt]
- [...]
The main directory (here NewDataSet/) can contain n folders, and in turn, each folder can contain n files - subfolders are not supported. You can take as example the Basic sample Data:
install/data/sampleData/basic/
: Basic data preloaded in each new workspace. Currently contains only a simple README file.
For each directory of your dataset, create a .sample_metadata.json
file. The document sets the metadata for each of the elements (files and folder) contained in such directory. You can take those included in the Basic sample data as examples:
-
NewDataSet/
- .sample_metadata.json
- uploads/
- .sample_metadata.json
- myInput1.txt
- [resultsFolder/]
- [.sample_metadata.json]
- [myOutput1.txt]
- [...]
-
install/sampleData/basic/uploads/.sample_metadata.json
: Metadata for 'README' file contained in the 'uploads/' directory
Add a new document in the sampleData
collection defining where the new dataset is to be found
{
"_id" : "NewDataSet_identifier",
"name" : "New Dataset",
"sample_path" : "NewDataSet/",
"status" : 1,
"short_description" : "Short description of the contents for this new dataset"
}
Where "sample_path" is the relative path of the dataset from GLOBALS['sampleData'].