EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior

Zhipeng Hu1* Minda Zhao1* Chaoyi Zhao1 Xinyue Liang1
Lincheng Li1 Zeng Zhao1 Changjie Fan1 Xiaowei Zhou2 Xin Yu3
1 NetEase Fuxi AI Lab 2 State Key Lab of CAD&CG, Zhejiang University 3 University of Queensland



Abstract

While image diffusion models have made significant progress in text-driven 3D content creation, they often fail to accurately capture the intended meaning of text prompts, especially for view information. This limitation leads to the Janus problem, where multi-faced 3D models are generated under the guidance of such diffusion models. In this paper, we propose a robust high-quality 3D content generation pipeline by exploiting orthogonal-view image guidance. First, we introduce a novel 2D diffusion model that generates an image consisting of four orthogonal-view sub-images based on the given text prompt. Then, the 3D content is created using this diffusion model. Notably, the generated orthogonal-view image provides strong geometric structure priors and thus improves 3D consistency. As a result, it effectively resolves the Janus problem and significantly enhances the quality of 3D content creation. Additionally, we present a 3D synthesis fusion network that can further improve the details of the generated 3D contents. Both quantitative and qualitative evaluations demonstrate that our method surpasses previous text-to-3D techniques.



Example generated objects

EfficientDreamer generates objects and scenes from diverse captions.

A squirrel, animated movie character, high detail 3D model.
A DSLR photo of a yellow duck.
A pig wearing a backpack, high quality.
Viking axe fantasy, weapon, blender, 8k, HD.
Mr Bean Cartoon.
A motorcycle, scifi, high detail, high quality.

More Results


DreamFusion
Magic3D
TextMesh
Ours  

A squirrel, animated movie character, high detail 3D model.

A pig wearing a backpack, high quality.

Mr Bean Cartoon.

A motorcycle, scifi.

A squirrel playing guitar.

A ghost eating a hamburger.

A crab, low poly.

Katana, high detail, high quality.

Darth Vader helmet.

A DSLR photo of a yellow duck.

A 3D model of a road bike.

A 3D model of a corgi taking a selfie, high detail.

A 3D model of a fox holding a videogame controller.

An astronaut is riding a horse, high detail 3d model.

A peacock on a surfboard.

A 3D model of a German Shepherd.

A 3D model of an adorable cottage with a thatched roof.

A 3D model of a toy robot.

A 3D model of an exercise bike.

A 3D model of a white rabbit.

A blue poison-dart frog sitting on a water lily, high detail 3D model.

A ladybug, high detail, high quality.

A DSLR photo of a chow chow puppy, high detail, high quality.

A lion reading the newspaper.

A panda rowing a boat in a pond, high detail, high quality.

A product photo of a toy tank, high detail 3D model.

A recliner chair.

Viking axe fantasy, weapon, blender.

Dragon armor, 3D asset.

A photo of a horse walking.

TRUMP figure.

Pikachu, high quality.

Army Jacket.

A statue of angel.

A bulldog wearing a black pirate hat.

A 3D scan of AK47, weapon.

Project page template is borrowed from DreamFusion.