some fixes to benchmark framework including weak scaling of wave equation #577

lslusarczyk · 2023-10-10T23:11:31Z

No description provided.

lslusarczyk · 2023-10-10T23:14:34Z

weak scaling looks OK (not as fast as we want, but at least correct) with this PR and analysis runs fast (~ 1sec each mhp-bench invocation on devcloud)

rscohn2

Thanks for fixing the weak scaling issue.

rscohn2 · 2023-10-11T12:19:38Z

src-python/drbench/drbench/drbench.py

+    "--different-devices",
+    is_flag=True,
+    default=False,
+    help="Ensures there are no multiple ranks on one SYCL device",


Suggested change

help="Ensures there are no multiple ranks on one SYCL device",

help="Ensures there are not multiple ranks on one SYCL device",

rscohn2 · 2023-10-11T12:25:48Z

include/dr/mhp/global.hpp

@@ -129,7 +129,8 @@ inline std::string hostname() {
 inline sycl::queue &sycl_queue() { return __detail::gcontext()->sycl_queue_; }
 inline auto dpl_policy() { return __detail::gcontext()->dpl_policy_; }

-inline sycl::queue select_queue(MPI_Comm comm = MPI_COMM_WORLD) {
+inline sycl::queue select_queue(MPI_Comm comm,


Suggested change

inline sycl::queue select_queue(MPI_Comm comm,

inline sycl::queue select_queue(

It is OK to delete the comm argument so the recommended usage can remain: select_queue(). We don't support or test different comms anyway.

rscohn2 · 2023-10-11T12:35:11Z

benchmarks/gbench/mhp/wave_equation.cpp

@@ -666,7 +673,8 @@ int main(int argc, char *argv[]) {

 static void WaveEquation_DR(benchmark::State &state) {

-  int n = 4000;
+  int n = ::sqrtl(default_vector_size);


It seems strange that the running time is only 1 second because 4000 * 4000 = 16M and default vector size is 2B, but I see you did an actual benchmarking run.

It seems strange that the running time is only 1 second because 4000 * 4000 = 16M and default vector size is 2B, but I see you did an actual benchmarking run.

I need to rerun. I've checked this again and I saw that I had manually specified a size when creating my plot. It was smaller than default. Hence 1sec :( I need to wait with checking this until they devcloud will resolve Invalid account or account/partition combination specified problem.

Ugly hack is to add n/=4 line to change default just in this test. Nice way is to change size of this test in suite in python.

Previous plot was made with problem size 10M, now I've put temporarily an ugly hack and problem size is 500M, it runs in 10sec and gives the following plot

rscohn2 · 2023-10-11T12:40:07Z

src-python/drbench/drbench/drbench.py

@@ -447,6 +461,7 @@ def multi_node(base):
    base.vec_size = [vec_size]
    base.reps = reps
    base.weak_scaling = weak_scaling
+    base.different_devices = different_devices


I am not happy with the code I wrote here. The redundancy in adding arguments is just 1 problem. Now that we understand better what the runner and plotter has to do, I expect something much simpler could work. Something to think about in the future....

rscohn2 · 2023-10-12T13:00:23Z

The appropriate problem size depends on the benchmark. We need a size that will work for performance measurement and a size that is good for a quick test. Using --default-vector-size to control everything will not work. Something like:

most benchmarks rely on default-vector-size as before
default-vector-size is controlled by some switches (--test (short run), --bench (long run), --weak-scaling)
benchmarks like wave equation ignore default-vector-size and look at --test, --bench, --weak-scaling and pick problem size
similar for matrix multiply, fft, ...

lslusarczyk · 2023-10-13T07:40:19Z

The appropriate problem size depends on the benchmark...

I have an idea of writing suites in just flat way - no if-s, more hardcoded params in suites, including problem sizes passed to separate benchmarks. Once I will have ready conception details I will discuss with you.

some fixes to benchmark framework

7236471

fix standalone benches compilation

815fc54

rscohn2 approved these changes Oct 11, 2023

View reviewed changes

lslusarczyk linked an issue Oct 12, 2023 that may be closed by this pull request

fix weak scaling of wave equation #578

Closed

lslusarczyk marked this pull request as ready for review October 12, 2023 07:36

applied Robert's comments, added uglyu hack to wave_equation benchmark

530b3ed

lslusarczyk enabled auto-merge (squash) October 13, 2023 07:59

lslusarczyk added 3 commits October 14, 2023 19:02

compatibility fixes to new select_queue in other benchmarks

c83e721

icpx upload tests logs fails recently, allowing failure

a042496

commented out icpx results publish

2c498ff

lslusarczyk merged commit 3002448 into oneapi-src:main Oct 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

some fixes to benchmark framework including weak scaling of wave equation #577

some fixes to benchmark framework including weak scaling of wave equation #577

Uh oh!

lslusarczyk commented Oct 10, 2023

Uh oh!

lslusarczyk commented Oct 10, 2023 •

edited

Loading

Uh oh!

rscohn2 left a comment

Uh oh!

rscohn2 Oct 11, 2023

Uh oh!

rscohn2 Oct 11, 2023

Uh oh!

rscohn2 Oct 11, 2023

Uh oh!

lslusarczyk Oct 12, 2023

Uh oh!

lslusarczyk Oct 13, 2023

Uh oh!

rscohn2 Oct 11, 2023

Uh oh!

rscohn2 commented Oct 12, 2023

Uh oh!

lslusarczyk commented Oct 13, 2023

Uh oh!

Uh oh!

	help="Ensures there are no multiple ranks on one SYCL device",
	help="Ensures there are not multiple ranks on one SYCL device",

	inline sycl::queue select_queue(MPI_Comm comm,
	inline sycl::queue select_queue(

some fixes to benchmark framework including weak scaling of wave equation #577

some fixes to benchmark framework including weak scaling of wave equation #577

Uh oh!

Conversation

lslusarczyk commented Oct 10, 2023

Uh oh!

lslusarczyk commented Oct 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rscohn2 left a comment

Choose a reason for hiding this comment

Uh oh!

rscohn2 Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

rscohn2 Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

rscohn2 Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

lslusarczyk Oct 12, 2023

Choose a reason for hiding this comment

Uh oh!

lslusarczyk Oct 13, 2023

Choose a reason for hiding this comment

Uh oh!

rscohn2 Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

rscohn2 commented Oct 12, 2023

Uh oh!

lslusarczyk commented Oct 13, 2023

Uh oh!

Uh oh!

lslusarczyk commented Oct 10, 2023 •

edited

Loading