You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I was wondering, when using p2e algorithm, why do we pass actor_model parameters into curious_actor_optimizer? Same happens with value_model_optimizer and value_model.
In the way I see it, this is bug because curious_actor_model doesn't learn anything while exploring environment with p2e and the way to fix it is to pass curious_actor_model parameters and curious_value_model parameters into curious_actor_optimizer and value_model_optimizer respectively.
The text was updated successfully, but these errors were encountered:
plan2explore-pytorch/main.py
Line 149 in 13c13bd
Hello, I was wondering, when using p2e algorithm, why do we pass actor_model parameters into curious_actor_optimizer? Same happens with value_model_optimizer and value_model.
In the way I see it, this is bug because curious_actor_model doesn't learn anything while exploring environment with p2e and the way to fix it is to pass curious_actor_model parameters and curious_value_model parameters into curious_actor_optimizer and value_model_optimizer respectively.
The text was updated successfully, but these errors were encountered: