Continuous benchmarking of workload motifs #704

hammer · 2021-10-02T21:34:11Z

hammer
Oct 2, 2021
Maintainer

Once we have a good understanding of the workload we'd like our toolkit to handle, it would be useful to extract a small set of synthetic tasks that capture the performance-critical aspects of our workload. We can run these tasks regularly and capture those results over time.

Note that these kinds of tasks are often called "dwarves", but they were originally called "motifs", and I'd prefer to use that term as it's less offensive.

Software Engineering for Scientific Computing (2004) by Phillip Colella
The Landscape of Parallel Computing Research: A View from Berkeley (2006) by Krste Asanović, Ras Bodik, Bryan Christopher Catanzaro, Joseph James Gebis, Parry Husbands, Kurt Keutzer, David A. Patterson, William Lester Plishker, John Shalf, Samuel Webb Williams, and Katherine A. Yelick

hammer · 2021-10-02T21:34:22Z

hammer
Oct 2, 2021
Maintainer Author

The Hail team has a nice benchmark suite that we may want to borrow from.

0 replies

hammer · 2021-10-02T21:34:29Z

hammer
Oct 2, 2021
Maintainer Author

There are many Dask benchmark suites that may also provide inspiration:

0 replies

hammer · 2021-10-02T21:34:35Z

hammer
Oct 2, 2021
Maintainer Author

And of course the Scalable Linear Algebra Benchmark suite from A comparative evaluation of systems for scalable linear algebra-based analytics (2018).

0 replies

hammer · 2021-10-02T21:34:46Z

hammer
Oct 2, 2021
Maintainer Author

(Posted by @jeromekelleher)

Great idea. For genetic variation data, I'd suggest using simulations from stdpopsim, so that we can capture expected patterns of variation across a few different species. We can also easily simulate very large datasets, so we can run benchmarks at scale without having to ship around lots of data.

Converting from the tskit tree sequence output from stdpopsim to Python genotype arrays is straightforward using the variants method. I'm happy to help with setting this up.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous benchmarking of workload motifs #704

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Continuous benchmarking of workload motifs #704

hammer Oct 2, 2021 Maintainer

Replies: 4 comments

hammer Oct 2, 2021 Maintainer Author

hammer Oct 2, 2021 Maintainer Author

hammer Oct 2, 2021 Maintainer Author

hammer Oct 2, 2021 Maintainer Author

hammer
Oct 2, 2021
Maintainer

hammer
Oct 2, 2021
Maintainer Author

hammer
Oct 2, 2021
Maintainer Author

hammer
Oct 2, 2021
Maintainer Author

hammer
Oct 2, 2021
Maintainer Author