Name	Name	Last commit message	Last commit date
parent directory ..
dbcluster	dbcluster
Monitor.jar	Monitor.jar
README.md	README.md
Steer.jar	Steer.jar
dChiron.jar	dChiron.jar
dChironSetup.jar	dChironSetup.jar
setupmysql.sh	setupmysql.sh
shutdownDB.sh	shutdownDB.sh
startDB.sh	startDB.sh
startWF.sh	startWF.sh

d-Chiron

Dependecies

Java JRE 1.7 64 bit or newer
MySQL Cluster
libaio1 64 bit
The front node (node from which the other processes will start) should be able to access the other nodes via ssh using public key.

Setup

Clone repository:

$ git clone https://github.com/vssousa/d-chiron.git

Download MySQL Cluster:

Linux Generic (glibc 2.5) (x86, 64-bit), Compressed TAR Archive

Unpack MySQL Cluster:

$ tar -xvvf mysql-cluster-gpl-X.Y.Z-plataform.tar.gz

Change working directory to d-Chiron:
```
$ cd d-chiron/d-chiron
```

Setup MySQL Cluster for d-Chiron:

$ ./setupmysql.sh <mysql_path> <mysql_directory> <data_directory> <number_of_nodes>

mysql_path: fullpath to the unpacked MySQL Cluster directory
mysql_directory: root directory where MySQL cluster directories will be placed into
data_directory: base directory where data from data nodes will be placed into
number_of_nodes: Maximum number of nodes that will run MySQL Cluster. If there are, for example, 5 database nodes (1 mgm_node, 2 sql_nodes, 2 data_nodes), this number should be at least 5.

Configuration

Edit dbcluster/installation.properties:
```
$ vi dbcluster/installation.properties
```
- Follow this link for an example
- Define installationRootDirectory and base_data_dir as you defined in Setup#5 for mysql_root_directory an base_data_directory, respectively.
- Chose the number of nodes used by MySQL Cluster
  - number_of_datanodes
  - number_of_sqlnodes
  - number_of_mgm_node
- All other configurations may be left as the example file.

Workflow Configuration

We will use the Risers Fatigue Analysis workflow as example.

Change working directory to rfa-synthetic/rfa-dchiron:
```
$ cd ../rfa-synthetic/rfa-dchiron
```
Use the XML file from RFA workflow:
```
$ vi rfa-dchiron-wf.xml
```
- Replace full/to/path/repositoryroot/ occurrences with the full path to this repository root directory

Edit the machines.conf file with nodes hostnames, ports, and ranks for MPI initialization:

$ vi machines.conf

Example:

# Number of processes
2
# Protocol switch limit
131072
#Entry in the form of machinename@port@rank
node1-hostname@20348@0
node2-hostname@20348@1

Edit the input.dataset file to adjust the amount of data and parallel tasks your workflow will process. This is a CSV file. Each row in this dataset represents one subgraph of task executions of an RFA run. The meaning of each field is as follows.
- ID: identifier of the subgraph
- SPLITMAP: factor that controls the average cost of each Data Gathering task execution.
- SPLITFACTOR: factor that controls the amount of parallel tasks that will be generated (at the Data Gathering activity).
- MAP1 and MAP2: factors that control the average cost of each Preprocessing and Stress Analysis task execution.
- FILTER1 and FILTER2: factors that control the average cost of each Stress Critical Case Selection and Curvature Critical Case Selection task execution, respectively.
- F1 and F2: factors that control the retention of tuples (in the dataflow) in the Stress Critical Case Selection and Curvature Critical Case Selection activities, respectively.
- REDUCE: factor that controls the average cost of each Compress Results task execution.
- REDUCEFACTOR: factor that controls the amount of resulting tuples after going through the reduce (Compress Results) activity.
For the theoretical reference behind the choices for these factors, please see this reference:
```
Fernando Chirigati, Vitor Silva, Eduardo Ogasawara, Jonas Dias, Fabio Porto, et al. Evaluating Parameter Sweep Workflows in High Performance Computing. SWEET'12: International Workshop on Scalable Workflow Enactment Engines and Technologies, 2012, Scottsdale, Arizona, United States. pp.10.
```
The source code for the Risers Fatigue Analysis workflow tasks generator is available here.

Run

Still using the Risers Fatigue Analysis workflow in d-Chiron as example, follow the next commands to run the workflow. The scripts used in this section are here.

Start MySQL Cluster instances:

$ ./invokestartdb.sh <mysql_root_directory>

Submit workflow execution:
```
$ ./invokewf.sh <mysql_root_directory>
```

Shutdown MySQL Cluster instances:

$ ./invokeshutdown.sh <mysql_root_directory>

Monitor

Optionally, you can monitor the execution by specifying periodic queries in the workflow database.

Add new monitoring queries before or during workflow execution, informing the monitoring interval. Example:

insert into dchiron.umonitoring(monitoringinterval, monitoringquery) values (
    10, 
    'select avg(g.map2) from dchiron.eactivation artask, rfa.ogatherdatafromrisers g, rfa.opreprocessing p, rfa.oanalyzerisers r where r.previoustaskid = p.nexttaskid and p.previoustaskid = g.nexttaskid and r.previoustaskid = artask.taskid and (artask.endtime-artask.starttime) > (select avg(endtime-starttime) from scc2.eactivation where actid=3);'
);

Start the Monitor module:
```
$ java -jar Monitor.jar --start
```
For each monitoring query registered in the UMonitoring table, the result set is stored in UMonitoringResult table during workflow execution at each time interval defined for the query.

Steer

If you need, d-Chiron allows you to steer the execution by adapting the workflow data at runtime.

Removing slices

First, determine the select predicates to determine a slice to be removed. For example, map1 > 70000.

Then, run a steering query to cut off the slice:

$ java -jar Steer.jar --xml=rfa-dchiron-wf.xml --user=peter --type=REMOVE_TUPLES --tasksquery="select  nexttaskid  from wf1.ogatherdatafromrisers where map1 > 70000"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

d-chiron

d-chiron

README.md

d-Chiron

Dependecies

Setup

Configuration

Workflow Configuration

Run

Monitor

Steer

Removing slices

Files

d-chiron

Directory actions

More options

Directory actions

More options

Latest commit

History

d-chiron

Folders and files

parent directory

README.md

d-Chiron

Dependecies

Setup

Configuration

Workflow Configuration

Run

Monitor

Steer

Removing slices