Project start

Work Done

Watched the youtube video of diffusion policy presentation. My takeaways:
- Human's vision reaction time is ~300ms. If the training data is collected by human, each action sequence/chunk should be at the same order of magnitudes to that¹.
- Diffusion policy works well both in joint-space and action-space. However, working in action space requires a good IK.
Created repository.
Draft plan:
- huggingface/lerobot: start from the training and evaluation scripts there. Maybe reproduce lerobot/diffusion_pusht if feasible.
- Per request, use huggingface/gym-pusht for simulation environment.
- Maybe Material for MkDocs for documentation and report.
- Or maybe just the paper-style, good-old \(\LaTeX\).
- uv for package management? Not sure if this would work since most of the environment requires conda/mamba for non-python dependencies.
- marimo or jupyter notebook for interactive sessions? Or use the jupyter notebook extension for mkdocs.

Understand difference between DDPM and DDIM².
Fiddle with lerobot/diffusion_pusht.
- Understand the workflow.
- Get a feeling of how resource-hungry are the training & evaluation scripts.
Discover the custom pusht dataset.
Perhaps read the paper?