Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects (CVPR 2025)

This is the official implementation of Instant3dit. We provide the weights and inference code for the multiview 3d inpainting network that allows fast editing of 3d objects, by reconstruction to various representations using corresponding LRMs (Large Reconstruction Models).

Multiview Edited Image Generation

The code has been tested on python 3.8 and 3.10 with pytorch 2.1.2 and 2.7.0, both with cuda 11.8, but should work for all versions in between.

run pip install requirements.txt to install dependencies
download the multiview inpainting SDXL weights
replace Path/to/Instant3dit_model in the default argument with the path to the SDXL multiview inpainting checkpoint folder downloaded in the previous step.

To test run demo_mv_images.sh

Note: We use the diffusers library, so you must have a Huggingface access token, in a file called TOKEN, at the root of the project.

Reconstruction using LRMs (Large Reconstruction Models)

Disclaimer: The results in the paper were obatined used internal Adobe LRMs for reconstruction to various 3d representations (NeRF, meshes and 3DGS). We substitute this with the best open source offerings we could find. Currently, these are not on par with the Adobe models. Newer and more powerful open source LRMs can be integrated in the future (PRs welcome).

We allow for using these LRMs seamlessly in our inference code.

Mesh LRM

We use InstantMesh for mesh reconstruction, all the required dependencies are already in requirements.txt.
locally clone InstantMesh: git clone git@github.com:TencentARC/InstantMesh.git
and replace Path/to/InstantMesh in the default argument for instantmesh_path with the path to the InstantMesh folder.
To test run demo_mesh.sh

3D Gaussian Splats LRM

We use geoLRM for 3DGS reconstruction, To install, after installing all the dependencies in requirements.txt, run:
pip install flash-attn --no-build-isolation
pip install git+https://github.com/ashawkey/diff-gaussian-rasterization.git
pip install git+https://github.com/Stability-AI/generative-models.git
(Note: installing flash-attn may take a while)
locally clone geoLRM: git clone git@github.com:alibaba-yuanjing-aigclab/GeoLRM.git
and replace Path/to/geoLRM in the default argument for geoLRM_path with the path to the geoLRM folder.
To test run demo_3dgs.sh

Mask Training Data

The mask renderings used to train the network are provided here. Each Objaverse model used has 16 renders, with renders 0,4,8,12 corresponding to the camera positions given in cameras/opencv_cameras.json.

TODO:

adaptive remeshing pipeline
texturing pipeline
training code + mask creation code
training dataset

BibTex

If you find this work useful, please cite as:

 @misc{barda2024instant3ditmultiviewinpaintingfast,
      title={Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects}, 
      author={Amir Barda and Matheus Gadelha and Vladimir G. Kim and Noam Aigerman and Amit H. Bermano and Thibault Groueix},
      journal = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
      year = {2025}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LRMs		LRMs
ROAR		ROAR
assets		assets
cameras		cameras
experiment_data		experiment_data
models		models
rendering		rendering
training		training
training_data		training_data
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adaptive_remeshing.py		adaptive_remeshing.py
demo_3dgs.sh		demo_3dgs.sh
demo_adaptive_remeshing.sh		demo_adaptive_remeshing.sh
demo_mesh.sh		demo_mesh.sh
demo_mv_images.sh		demo_mv_images.sh
inference.py		inference.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects (CVPR 2025)

Multiview Edited Image Generation

Reconstruction using LRMs (Large Reconstruction Models)

Mesh LRM

3D Gaussian Splats LRM

Mask Training Data

TODO:

BibTex

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

amirbarda/Instant3dit

Folders and files

Latest commit

History

Repository files navigation

Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects (CVPR 2025)

Multiview Edited Image Generation

Reconstruction using LRMs (Large Reconstruction Models)

Mesh LRM

3D Gaussian Splats LRM

Mask Training Data

TODO:

BibTex

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages