|
1 | 1 | # 🤔 ETA
|
2 | 2 |
|
3 |
| -## Highlight |
4 |
| -We propose "Efficiency through Thinking Ahead" (ETA), an asynchronous dual-system that pre-processes information from past frames using a large model in tandem with processing the current information with a small model to enable real-time decisions with strong performance. |
| 3 | +## Highlight <a name="highlight"></a> |
| 4 | +We propose "**E**fficiency through **T**hinking **A**head" (ETA), an asynchronous dual-system that pre-processes information from past frames using a large model in tandem with processing the current information with a small model to enable real-time decisions with strong performance. |
5 | 5 | <img width="800" alt="CarFormer overview" src="assets/overview.png">
|
6 | 6 |
|
7 |
| -## Abstract |
8 |
| -How can we benefit from large models without sacrificing inference speed, a common dilemma in self-driving systems? A prevalent solution is a dual-system architecture, employing a small model for rapid, reactive decisions and a larger model for slower but more informative analyses. Existing dual-system designs often implement parallel architectures where inference is either directly conducted using the large model at each current frame or retrieved from previously stored inference results. However, these works still struggle to enable large models for a timely response to every online frame. Our key insight is to shift intensive computations of the current frame to previous time steps and perform a batch inference of multiple time steps to make large models respond promptly to each time step. To achieve the shifting, we introduce Efficiency through Thinking Ahead (ETA), an asynchronous system designed to: (1) propagate informative features from the past to the current frame using future predictions from the large model, (2) extract current frame features using a small model for real-time responsiveness, and (3) integrate these dual features via an action mask mechanism that emphasizes action-critical image regions. Evaluated on the Bench2Drive CARLA Leaderboard-v2 benchmark, ETA advances state-of-the-art performance by 8% with a driving score of 69.53 while maintaining a near-real-time inference speed at 50 ms. |
| 7 | +## News <a name="news"></a> |
| 8 | +- **`[2025/06/10]`** [ETA](https://arxiv.org/abs/2506.07725) paper and code release! |
9 | 9 |
|
10 | 10 | ## Results
|
11 | 11 | <img width="800" alt="CarFormer overview" src="assets/results.png">
|
12 | 12 |
|
13 |
| -## Training and Inference |
| 13 | + |
| 14 | +## Table of Contents |
| 15 | +1. [Highlights](#highlight) |
| 16 | +2. [News](#news) |
| 17 | +3. [Results](#results) |
| 18 | +2. [Getting Started](#gettingstarted) |
| 19 | + - [Training](docs/TRAIN_EVAL.md#trainingsetup) |
| 20 | + - [Evaluation](docs/TRAIN_EVAL.md#evalsetup) |
| 21 | +4. [TODO List](#todolist) |
| 22 | +6. [License and Citation](#licenseandcitation) |
| 23 | +7. [Other Resources](#otherresources) |
| 24 | + |
| 25 | +## Getting Started <a name="gettingstarted"></a> |
14 | 26 |
|
15 | 27 | Please refer to [TRAIN_EVAL.md](docs/TRAIN_EVAL.md) for detailed instructions on how to train and evaluate the model.
|
16 | 28 |
|
| 29 | +## TODO List |
| 30 | +- [x] ETA Training code |
| 31 | +- [x] ETA Evaluation |
| 32 | + - [x] Inference Code |
| 33 | + - [ ] Checkpoints |
| 34 | + |
17 | 35 | ## Acknowledgements
|
18 | 36 |
|
19 |
| -This codebase builds on open sourced code from [carla_garage]([email protected]:autonomousvision/carla_garage.git) among others. We thank the authors for their contributions. This project is funded by the European Union (ERC, ENSURE, 101116486) with additional compute support from Leonardo Booster (EuroHPC Joint Undertaking, EHPC-AI-2024A01-060). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them. This study is also supported by National Natural Science Foundation of China (62206172) and Shanghai Committee of Science and Technology (23YF1462000). |
| 37 | +This codebase builds on open sourced code from [CARLA Garage]([email protected]:autonomousvision/carla_garage.git) and [Bench2DriveZoo](https://github.com/Thinklab-SJTU/Bench2DriveZoo/) among others. We thank the authors for their contributions. This project is funded by the European Union (ERC, ENSURE, 101116486) with additional compute support from Leonardo Booster (EuroHPC Joint Undertaking, EHPC-AI-2024A01-060). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them. This study is also supported by National Natural Science Foundation of China (62206172) and Shanghai Committee of Science and Technology (23YF1462000). |
20 | 38 |
|
21 |
| -## Citation |
22 |
| -If you find our project useful for your research, please consider citing our paper with the following BibTeX: |
| 39 | +## License and Citation <a name="licenseandcitation"></a> |
| 40 | +This project is released under the [MIT License](LICENSE). If you find our project useful for your research, please consider citing our paper with the following BibTeX: |
23 | 41 |
|
24 | 42 |
|
25 | 43 | ```bibtex
|
|
0 commit comments