[question] HER and prioritized experience replay #751

johannes-dornheim · 2020-03-20T11:11:20Z

Hi

in the stable-baselines implementation, HER does not support prioritized replay buffer. In the HER Paper they state that: "Prioritized experience replay (....) is orthogonal to our work and both approaches can be easily combined". So my question is: Are there 'deeper reasons' for the lack of support or is it just a currently missing feature?

Best Regards,
Johannes

Miffyli · 2020-03-20T22:45:36Z

I believe there are no bigger reasons to lack of support, other than lack of implementation. It would require coming up with prioritizes for the samples in the buffer, and then updating the replay_buffer.py in HER. I am not too familiar with HER to know how easy of a feat this would be. On the first glance it does not sound as straight-forward as with DQNs.

RyanRizzo96 · 2020-03-27T06:40:21Z

Actually, PER has been shown not to improve performance over HER, hence there is no real motivation to imlpement. Not only does PER not improve performance, but it actually increases computational time substantially.

PER works by prioritising transitions with higher TD-error, which means that the TD-error must be computed for each transition, hence the expensive computational time.

Prioritised Sequence Experience Replay (PSER) outperforms PER but has not been imlpemented with HER.

There are other methods which improve the sampling efficiency of HER (such as Energy Based Prioritisation), but PER is not one of them. I have put my name forward to implement this in the new PyTorch version.

Miffyli · 2020-03-27T22:50:34Z

Ok, thanks for the info! Indeed any such new features would be things for the PyTorch version :).

araffin · 2020-05-09T13:27:49Z

I added it to possible features for Stable-Baselines3 1.1+ in DLR-RM/stable-baselines3#1

johannes-dornheim changed the title ~~HER and prioritized experience replay~~ [question] HER and prioritized experience replay Mar 20, 2020

Miffyli added the question Further information is requested label Mar 20, 2020

araffin closed this as completed May 9, 2020

araffin added enhancement New feature or request v3 Discussion about V3 labels May 9, 2020

vwxyzjn mentioned this issue Jan 28, 2022

Refactor value based methods vwxyzjn/cleanrl#102

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] HER and prioritized experience replay #751

[question] HER and prioritized experience replay #751

johannes-dornheim commented Mar 20, 2020

Miffyli commented Mar 20, 2020

RyanRizzo96 commented Mar 27, 2020 •

edited

Loading

Miffyli commented Mar 27, 2020

araffin commented May 9, 2020

[question] HER and prioritized experience replay #751

[question] HER and prioritized experience replay #751

Comments

johannes-dornheim commented Mar 20, 2020

Miffyli commented Mar 20, 2020

RyanRizzo96 commented Mar 27, 2020 • edited Loading

Miffyli commented Mar 27, 2020

araffin commented May 9, 2020

RyanRizzo96 commented Mar 27, 2020 •

edited

Loading