Simulation for the LeRobot hackathon

This simulation environment that will be used for the LeRobot Hackathon 2024.

General outline and goals

Train the simulation environment of the Koch arms to perform basic manipulation tasks.
Use the TD-MPC implementation in this branch of LeRobot to train your policies to acheive different tasks.
If you have a leader arm, you can use the control_sim_robot.py script in my branch to teleoperate the sim robot and collect datasets that you can couple with the RL part.

Simulation and tasks

We will be using the sim environment this branch of my fork of the original gym-lowcostrobot. The enviroment is based on Mujoco and defines several manipulation tasks like pushing a cube to a target and pick and place.

The main task of the hackathon would be to mimic the real-world task that the participants with the arm will try to solve and attempt to find ideas that improves the learning performance in sim which can then benifit learning in the real-world. What can you change?
1. The hyper-parameters of TD-MPC. You can find different parameters of the algorithm that could improve the overall learning performance.
2. The elements of the sim environment: reward shaping, action space, observations...
You have the freedom to try to design and solve your own task or to work on much harder tasks in the environment (pick and place, stack two cubes).
It is important to note that the rewards for many of these enviornemnt are not well tuned and it will be up to you to improve their design and find ideas to make them work better.

Code

Get familiar with the following code:

lerobot/scripts/control_sim_robot.py: code to run control in sim if you have a leader arm. Choose the record option to record a dataset in sim.
lerobot/scripts/train.py: Updated in my branch to be able to use it with sim environment and with different platforms.
gym_lowcostrobot/envs/*: The gym environment for each task, check how the observations, control and rewards are computed.
gym_lowcostrobot/assets/*: The environment description that defines the scene, objects and physics.

Useful links

Mujoco documentation.
TD-MPC and FOWM papers.
Highly Recomended: The videos by Alexander Soare that explain TD-MPC (part 1 and part 2) and implementation of TD-MPC in LeRobot.

Examples of configuration files for the sim environments.

For the sim env in lerobot/configs/env you need to define a config file that takes into account the proper gym arguments to setup the simulation. Example for the push task

# @package _global_

fps: 50

env:
  name: lowcostrobot
  fps: ${fps}
  handle: PushCube-v0 
  state_dim: 6
  action_dim: 6
  task: PushCube

  gym:
    render_mode: rgb_array
    max_episode_steps: 200
    actions_in_degrees: true
    reward_type: 'dense'

calibration:
  axis_directions: [-1, -1, 1, -1, -1, -1]
  offsets: [0, -0.5, -0.5, 0, -0.5, 0] # factor of pi

eval:
  use_async_envs: false

state_keys:
  observation.state: 'arm_qpos'
  observation.velocity: 'arm_qvel'

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simulation for the LeRobot hackathon

General outline and goals

Simulation and tasks

Code

Useful links

Examples of configuration files for the sim environments.

About

Releases

Packages

michel-aractingi/Sim-LeRobotHackathon

Folders and files

Latest commit

History

Repository files navigation

Simulation for the LeRobot hackathon

General outline and goals

Simulation and tasks

Code

Useful links

Examples of configuration files for the sim environments.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages