Goal: make a meaningful manipulation task that can be trained in under 10 mins on a large GPU

Lerobot pushT:

Original keypoints pushT:

batch size: 256, n_obs: 2, 0.916% SR in ~17 mins, 1.7 GB VRAM