ub-benchmark-infra

Collection of scripts used for benchmarking various systems against UB optimizations.

Useful information

To create a new benchmark for phoronix-test-suite (PTS) you need to use the template found in template/. The template is a modified compress-pbzip2. Modify it as per your use case, put the modified template/ in /var/lib/phoronix-test-suite/test-profiles/local/ (at least that is for Debian 11) and run:

phoronix-test-suite force-install yourtestsuite

Read the content of the files in template/ if you want to understand what they are doing.

If you want to install it with your compiler configuration run:

CC=/path/to/c/compiler CXX=/path/to/cpp/compiler CFLAGS='-f1 -f2'
CXXFLAGS='-f3 -f4' phoronix-test-suite force-install yourtestsuite

Then to run the benchmark run:

phoronix-test-suite benchmark yourtestsuite

To delete a benchmark run:

rm -rf /var/lib/phoronix-test-suite/installed-tests/local/yourtestsuite
rm -rf /var/lib/phoronix-test-suite/test-profiles/local/yourtestsuite

UB benchmarking algorithm

for ts in test-suites
	res = {}
	for ub in undef_behaviors:
		compile_testsuite(ts, {compiler, ub})
		run_testsuite(ts)
		res += fetch_testsuite_results()
	compare_results(res)

The only problem that I see here is that at one moment I will want to combine multiple UBs to test their impact. I don't want to try all combinations of UBs, instead it would be helpful to do an analysis to see what UBs are required to be tested.

Filtered test suites

We don't need to benchmark all the tests provided by phoronix. We need to filter them so that we get only the ones that are compiled with a C compiler. Now the question is: if a test is written in multiple programming languages, do we want to benchmark it too? I would say yes because if we see an impact the only place where it can have effect is in the part compiled with our modified compiler.

I did a simple filtering in the tests provided by phoronix with the following command:

git clone https://github.com/phoronix-test-suite/test-profiles.git && cd test-profiles && grep -orP --include=*.xml '(?<=<ExternalDependencies>).*?(?=</ExternalDependencies>)' | grep "build-utilities"

The build-utilities dependency is translated in PTS into the following two dependencies: build-essential, autoconf. There are a lot of results for the above command.

However there is one more step before benchmarking the test profiles against UB modifications. Some of the test profiles contain projects that are stupid. For example take pts/compress-pbzip2, please take a look at the makefile in http://www.bzip.org/1.0.6/bzip2-1.0.6.tar.gz and notice that it overwrites the CC set in the environment. In this case I need to manually modify the makefile and create a new test profile. I expect this problem with other test profiles too.

Profiles

all.profiles.txt has all test profiles that are dependent on build-utilities. profiles.txt has profiles extracted from all.profiles.txt that respect the CC/CXX variables. rand-profiles.txt has 25 profiles randomly chosen from profiles.txt. Compiling all profiles.txt takes 22 hours which is too much. We reduce the number of profiles to move faster.

Showing results

The plan is to merge the results for the profiles compiled with optimization flags and the ones without optimiztions flags. After the results are generated run:

phoronix-test-suite merge-results r1 r2 <-- will generate merged-r
phoronix-test-suite show-result merged-r

I will use some smart grep to put togheter r1 and r2 that will be then merged. After that I will need to manually check every merged results and interpret it.

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
htmls-1ghz.arm		htmls-1ghz.arm
htmls-3ghz.arm		htmls-3ghz.arm
htmls.x86		htmls.x86
plots		plots
results.old		results.old
results		results
text-results		text-results
toolchain		toolchain
README.md		README.md
categorized-profiles.txt		categorized-profiles.txt
csv-to-impact.awk		csv-to-impact.awk
csv-to-impact.sh		csv-to-impact.sh
find-impact.sh		find-impact.sh
generate-text-results.sh		generate-text-results.sh
get-changed-funcs.sh		get-changed-funcs.sh
get-results.sh		get-results.sh
install-profiles.sh		install-profiles.sh
merge-results.sh		merge-results.sh
prepare-benchmark-env.sh		prepare-benchmark-env.sh
record-size.sh		record-size.sh
results-to-csv.sh		results-to-csv.sh
results-to-html.sh		results-to-html.sh
run-profiles.sh		run-profiles.sh
run.sh		run.sh
run2.sh		run2.sh
sort-bms.sh		sort-bms.sh
user-config.xml		user-config.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ub-benchmark-infra

Useful information

UB benchmarking algorithm

Filtered test suites

Profiles

Showing results

About

Uh oh!

Releases

Packages

Languages

ManuelJBrito/ub-benchmark-infra

Folders and files

Latest commit

History

Repository files navigation

ub-benchmark-infra

Useful information

UB benchmarking algorithm

Filtered test suites

Profiles

Showing results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages