MP5
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
[Paper]
[Project Page]
[Demo]
We are currently organizing the code for MP5.
If you are interested in our work, please star ⭐ our project.
The process of finishing the task ''kill a pig with a stone sward during the daytime near the water with grass next to it.''
MP5 Framework
Active Perception
Citation
@article{qin2023mp5,
title={MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception},
author={Yiran Qin and Enshen Zhou and Qichang Liu and Zhenfei Yin and Lu Sheng and Ruimao Zhang and Yu Qiao and Jing Shao},
booktitle={arXiv preprint arxiv:2312.07472},
year={2023}
}
On this page
Contributors
Created December 14, 2023
Updated December 14, 2023



