Goal: make a meaningful manipulation task that can be trained in under 10 mins on a large GPU
Lerobot pushT:
Original keypoints pushT:
batch size: 256, n_obs: 2, 0.916% SR in ~17 mins, 1.7 GB VRAM