Graph generator simplexes-based approach

We propose changes to Lemming Repo such that it uses simplexes.

Overview (Röder et al., 2021)

Lemming works in five steps to generate the future output graph. Our approach updates its two steps (Step 1 and Step 3).

Step 1 reads previous versions of input graphs, analyzes simplexes in them, and computes various statistics.

Step 2 evaluates input to determine expressions.

Step 3 uses simplexes to generate the output graph.

Step 4 modifies the output graph using expressions.

Step 5 creates the final version of the output graph.

Execution

Our approach is tested on two datasets Semantic web dog food (SWDF) and LinkedGeoData (LGEO). We have created a class that sets the expressions mentioned in Lemming for these datasets. Thus, Step 2 need not be executed. To generate the graph with the proposed approach, an instance of class GraphGenerationTest should be invoked with the following parameters, and it supports existing parameters of Lemming Repo.

Parameter	Description	Detailed Description
-ds	Input dataset name	This parameter should be set to "swdf" and "lgeo" for SWDF and LGEO datasets.
-nv	Number of vertices in the output graph	For our testing, this parameter was set to "45420" and "591649" for SWDF and LGEO datasets, respectively. We generated the future graph for the year 2015 for both datasets.
-t	Generator to use for creating the future graph	To test the proposed generators, this parameter should be set to "S1" or "S2" or "S3" or "S4". In this proposed thesis, the parameter "S1" corresponds to Generator 1. Similarly, Generator 2 is defined for parameter "S2", and so on.
-mi	Maximum number of retries	When the approach is not able to create a simplex, it retries in multiple iterations. This parameter sets the number of retries until the approach terminates. Its default value is 5000 if not provided as input.

Results

We executed each generator three times for both datasets. Existing generators were also executed for comparison with the proposed generators. The generated result files for the performed execution can be found in the folder "generated_results". This folder also consists of benchmarking results and console logs for the proposed generators.

.result files

The result files are available in result_files.zip.
The zip file has two parent folders: "Lemming" and "Simplex". "Lemming" contains results for existing generators, and the results of the proposed generators are available in "Simplex".
The folders further consist of the sub-folders "SWDF" and "LGD" for the two datasets. The "SWDF" folder contains results about Semantic web dog food, whereas the "LGD" folder contains results for LinkedGeoData.
The files within this folder follow naming conventions such that they end with "_<Generator execution parameter>_r<execution_id>.result". For example, a file name ending with _R_r1.result denotes the result file for the generator invoked with parameter "R" (Existing generator) for the first execution.
Complete Example (Existing generator): result_files > Lemming > LGD > LemmingEx_C_r1.result denotes the result file for the first execution of the generator with parameter "C" for the LinkedGeoData dataset.
Complete Example (Proposed generator): result_files > Simplex > SWDF > LemmingEx_S1_r3.result denotes the result file for the third execution of the generator with parameter "S1" for the Semantic web dog food dataset.
Note: The approach specified within these result files might differ, and the file name indicates the generator. To locate a result file for a specific generator, the file name should be used.

console logs

We have saved console logs for the proposed generators, and they are in console_logs.zip.
They follow the same hierarchy as that defined for .result files.
Example: console_logs > Simplex > LGD > lgeo_S1_r1.txt denotes the console logs for the first execution of the generator with parameter "S1" for the LinkedGeoData dataset.

Benchmarking

Benchmarking was performed using IGUANA, and the generated results are available in benchmarking.zip.
The initial folder hierarchy is same as the previous files. The parent folder name indicates the generators. The dataset-specific folders are defined for them. Then, folders are defined for each generator's execution parameter, and the files are present for every execution run in these folders.
General folder hierarchy: benchmarking > <Generator> > <Dataset name> > <Generator execution parameter> > r<execution_id>
Example: benchmarking > Lemming > LGD > R > r1 - The files in this folder are for the first execution of the existing generator with parameter "R" for the LinkedGeoData dataset.
The files found in a specific folder consists of results for different triple stores evaluated using IGUANA.

Name	Name	Last commit message	Last commit date
Latest commit atulpundir88 Update README.md Dec 30, 2023 7840a9c · Dec 30, 2023 History 753 Commits
config	config	Graph lexicalization	May 28, 2018
generated_results	generated_results	Adding benchmarking results	Dec 23, 2023
iguana	iguana	Updated READ.ME and added scripts.	Apr 22, 2021
lib	lib	Graph lexicalization	May 28, 2018
src	src	Commenting additional developed generator	Dec 20, 2023
.editorconfig	.editorconfig	Added editorconfig file.	Oct 28, 2021
.gitignore	.gitignore	Graph lexicalization	May 28, 2018
LICENSE	LICENSE	Update LICENSE	Nov 23, 2017
README.md	README.md	Update README.md	Dec 30, 2023
pom.xml	pom.xml	Requested changes.	May 10, 2021
run_baseline.sh	run_baseline.sh	Updated READ.ME and added scripts.	Apr 22, 2021
run_dataset.sh	run_dataset.sh	Update run_dataset.sh	Apr 22, 2021
run_lemming.sh	run_lemming.sh	Updated READ.ME and added scripts.	Apr 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph generator simplexes-based approach

Overview (Röder et al., 2021)

Execution

Results

.result files

console logs

Benchmarking

About

Releases

Packages

Languages

License

atulpundir88/Lemming-Simplexes

Folders and files

Latest commit

History

Repository files navigation

Graph generator simplexes-based approach

Overview (Röder et al., 2021)

Execution

Results

.result files

console logs

Benchmarking

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages