Include lap time in reward function #80

till2 · 2023-08-12T00:41:48Z

Hey there,

I ran some experiments with RL and wondered why the reward function does not directly depend on the lap time. I think it makes a lot of sense to shape the reward to encourage staying in the center and driving with high velocity. However, to get a racing policy that sometimes intentionally breaks these rules (e.g. doesn't stay in the center to take a curve optimally) to really optimize the lap time, I think the reward function should include the lap time in the reward calculation (give a bonus for low lap times).

I've added an idea on how to implement this in the attached commit. Here, an additional reward is given that is inversely correlated to the lap time (high lap time = low reward, low lap time = high reward). It can be scaled with a factor that weights this additional reward versus the other rewards. This factor is currently just eyeballed and probably needs to be tuned for optimal results. But even with the current form, it yielded pretty good experimental results.

Link to a video with a trained PPO (the simulator is set to 4x speed and max. throttle=0.4; PPO is trained without action smoothing and without frame stacking - so just a really simple baseline): https://drive.google.com/file/d/1Ucsrfwqm02PzzJlb76ozMVTX_tqMiIjE/view?usp=sharing

Eager to hear what you think.
Best, Till

Add a bonus reward that scales inversely with the lap time (low lap time = high bonus reward). The scaling factor `bonus_coeff` needs to be tuned for optimal results.

till2 added 2 commits August 12, 2023 02:20

Update the reward function to depend on lap time

0d1ceaa

Add a bonus reward that scales inversely with the lap time (low lap time = high bonus reward). The scaling factor `bonus_coeff` needs to be tuned for optimal results.

small oversight

dfe3d44

till2 changed the title ~~Improve reward function for racing (minimize lap time)~~ Include lap time in reward function Aug 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include lap time in reward function #80

Include lap time in reward function #80

till2 commented Aug 12, 2023 •

edited

Loading

Include lap time in reward function #80

Are you sure you want to change the base?

Include lap time in reward function #80

Conversation

till2 commented Aug 12, 2023 • edited Loading

till2 commented Aug 12, 2023 •

edited

Loading