How to construct an RLDS dataset? #144

Loong1512 · 2024-11-25T08:46:09Z

I want to fine-tune the Octo model, but I don't know how to construct my own RLDS dataset. I have already built a reinforcement learning environment using dm_env, performed simulation with Isaac Gym, and generated an RLDS dataset using envlogger.

for i in range(FLAGS.num_episodes):
  timestep = env.reset()
  while True:
    # TODO: HOW TO GENERATE ACTION
    action = np.random.uniform(-3, 3, size=(9,)).astype(np.float32)
    timestep = env.step(action)
    gym_env.render()
gym_env.close()

However, for the action part, how should I control the robotic arm to generate grasping motion trajectories? Should it be learned through a reward function, controlled via a keyboard or gamepad, or through motion capture? I'm quite confused about this. Could someone tell me the general solution?

The text was updated successfully, but these errors were encountered:

…actor Action head refactor

evelynmitchell pushed a commit to evelynmitchell/octo that referenced this issue Dec 31, 2024

Merge pull request octo-models#144 from rail-berkeley/action_head_ref…

230a0f3

…actor Action head refactor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to construct an RLDS dataset? #144

How to construct an RLDS dataset? #144

Loong1512 commented Nov 25, 2024

How to construct an RLDS dataset? #144

How to construct an RLDS dataset? #144

Comments

Loong1512 commented Nov 25, 2024