Standardize benchmark code of arctic #545

dimosped · 2018-04-20T13:09:01Z

We currently we have multiple unrelated benchmarks for various scenarios:

generic Arctic top level calls
draft Arctic breakdown solution for keeping track of where time goes ((de)compress, numpy, serialization, MongoDB IO)
draft Arrow serialization benchmarks

The goal is to create a standard API for benchmarks:

requirements
- specify experiement scenarios in an easy way (e.g. DSL or just a dict for fixed steps)
- collection of results
- plotting
- break down to components (e.g. compress, numpy object creation, serialization, mongo IO)
- make sure that when benchmark mode is disabled no impact on performance
- reproducible benchmarks
goals
- understand our code's bottlenecks
- have a standard way to perform and repeat benchmarks

A skeleton of benchmarks exists in the top level directory, benchmarks.
There are some very basic examples and a readme (https://github.com/manahl/arctic/blob/master/benchmarks.md), but these should be expanded upon to include all the storage engines and some more involved use cases and examples (i.e. chunkstore with numerics only, vs chunkstore with strings, version store with pickled objects, etc).

dimosped added hackathon hard feature labels Apr 20, 2018

dimosped mentioned this issue Apr 20, 2018

Benchmarks need to be updated #543

Closed

shashank88 self-assigned this Jan 22, 2019

shashank88 mentioned this issue Jan 22, 2019

Tests and Benchmarks for async arctic #687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardize benchmark code of arctic #545

Standardize benchmark code of arctic #545

dimosped commented Apr 20, 2018 •

edited

Loading

Standardize benchmark code of arctic #545

Standardize benchmark code of arctic #545

Comments

dimosped commented Apr 20, 2018 • edited Loading

dimosped commented Apr 20, 2018 •

edited

Loading