GitHunt
ZH

This is the official implement of MP5

MP5

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

[Paper]
[Project Page]
[Demo]

We are currently organizing the code for MP5.
If you are interested in our work, please star ⭐ our project.

The process of finishing the task ''kill a pig with a stone sward during the daytime near the water with grass next to it.''

MP5 Framework

Active Perception

Citation

@article{qin2023mp5,
  title={MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception},
  author={Yiran Qin and Enshen Zhou and Qichang Liu and Zhenfei Yin and Lu Sheng and Ruimao Zhang and Yu Qiao and Jing Shao},
  booktitle={arXiv preprint arxiv:2312.07472},
  year={2023}
}