Missing data: filling in

@nnethercote left an excellent comment on the rustc tracking issue for compiler performance: https://github.com/rust-lang/rust/issues/48750#issuecomment-400148645.

This issue tracks the implementation of the data fill-in described there.

> * The failure on run 5 occurred in the middle of the series. Use average of the values for runs 4 and 6 for run 5 of benchmark B.
> * The failure on run 1 occurred at the start of the series. Use the value from run 2 for run 1 of benchmark B.
> * The failures on runs 9 and 10 occurred at the end of the series. Use the value from run 8 for runs 9 and 10 of benchmark B.

These are the cases we need to support.

The data structure containing all data is currently a `BTreeMap<Commit, CommitData>`. The rustc-perf server is not aware of what commits exist (i.e., in rust-lang/rust master).

I make the contention that if we are missing data *for a commit* then we should not attempt to fill that data in, at least not yet. It will not affect graphs/summaries since no code will see that data. This also simplifies implementation.

In order to determine whether data needs to be "filled in" I think we'll want to be able to find the "previous" and "next" nodes for every commit. I don't believe the API exposed by BTreeMap trivially allows this for an arbitrary node -- at least not for our keys. As such, we'll want to have a separate list of keys (probably `Vec<Commit>`, maybe `Vec<CommitHash>`) and a `HashMap<Commit, usize>` to find ourselves in that vector. We can then find the previous/next commits.

This index should be populated when reloading data, not dynamically created on each lookup.

I believe that mostly settles the implementation concerns here; if a better way exists, I have not thought of it just now, but I'm of course open to it.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Missing data: filling in #249

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Missing data: filling in #249

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions