To submit to the leaderboard, submit a pull request that adds your results to the Markdown table below. The table should be sorted by increasing time in milliseconds.
You should be running on a single H100 with batch size 1, sequence length 16384, d_model=1024 and num_heads=16.
The top 3 submissions will receive a prize at the end of the quarter. To make this fair, we will reorder the top 5 scoring students based on our own tests. Make sure you save a snapshot of your best code so it can be reproduced by us! We will reach out to the top 5 students after results have stabilized.
In your pull request description, you should also include:
- The time you recorded
- A description of what you did
| Name | Forward + Backward Time (ms) | Verification status (leave empty) |
|---|---|---|
| Herman Brunborg | 5.364ms | Verified |
| Matthew Noto | 6.778ms | Verified |
| Varun Desai | 11.07ms | Verified |
| Prateek Varshney | 22.59ms | Verified |
| Stephen Ge | 28.45ms | |
| naive baseline | 80 ms | Verified |