You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have two datasets of MDP trajectories sampled from the same environment. These datasets share the same observation, but they differ in action space.
I would like to create an offline RL model able to choose which action to perform from those two action space, but I am still doubtful on the implementation. Of course, it wouldn't make sense to create a new action space = action_space1 + action_space2.
I think that the first thing to do should be creating a new env like this
Hello,
I have two datasets of MDP trajectories sampled from the same environment. These datasets share the same observation, but they differ in action space.
I would like to create an offline RL model able to choose which action to perform from those two action space, but I am still doubtful on the implementation. Of course, it wouldn't make sense to create a new action space = action_space1 + action_space2.
I think that the first thing to do should be creating a new env like this
but I am still unsure about the changes that have to be done consequently, both in multi_action_env.py and behaviour.py/controller.py
Thank you so much in advance for your feedbacks,
Regards
The text was updated successfully, but these errors were encountered: