Day 4
Work Done
- Evaluated policies from full training runs.
- Completed dataset analysis notebook.
WIP
- Training CNN-based policy model on state-based dataset.
TODO
- Training
- Compare results between CNN and transformer-based policy models.
- Plot training curves from json log.
- Analyze training results. Compare with original paper.
- Documentation
- Methodology section: diffusion model training, evaluation.
- Data preprocessing: how image and low-dimension tasks utilize data.