Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
* Equation Contribution, ✉ Corresponding Authors,
1 The University of Hong Kong, 2 Shanghai AI Laboratory, 3 The Chinese University of Hong Kong,
* Equation Contribution, ✉ Corresponding Authors,
1 The University of Hong Kong, 2 Shanghai AI Laboratory, 3 The Chinese University of Hong Kong,
Note that we have already edited the front and back images. And we have three steps to get the final 3D object.
Step 1: We use the same image encoder (DINO-v2) to get the front-view and back-view image features.
Step 2: The two image features are processed separately using the LoRA Triplane Transformer but share the same front-view camera extrinsics..
Step 3: After obtaining two triplanes, we 'tailor' the two triplane features through rotation and Viewpoint Cross-Attention to obtain the 3D object.
@misc{qi2024tailor3dcustomized3dassets,
title={Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images},
author={Zhangyang Qi and Yunhan Yang and Mengchen Zhang and Long Xing and Xiaoyang Wu and Tong Wu and Dahua Lin and Xihui Liu and Jiaqi Wang and Hengshuang Zhao},
year={2024},
eprint={2407.06191},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2407.06191},
}