-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.html
executable file
·581 lines (540 loc) · 36.1 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
<!DOCTYPE html>
<html>
<head>
<!-- Google tag (gtag.js) -->
<script async src="https://www.googletagmanager.com/gtag/js?id=G-XB3PR2Y1TQ"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag(){dataLayer.push(arguments);}
gtag('js', new Date());
gtag('config', 'G-XB3PR2Y1TQ');
</script>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0, shrink-to-fit=no">
<title>RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D</title>
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/4.5.0/css/bootstrap.min.css">
<link href='https://fonts.googleapis.com/css?family=Source+Sans+Pro:300,400,500,600' rel='stylesheet' type='text/css'>
<link rel="stylesheet" href="assets/css/Highlight-Clean.css">
<link rel="stylesheet" href="assets/css/styles.css">
<link rel="apple-touch-icon" sizes="180x180" href="apple-touch-icon.png">
<link rel="icon" type="image/png" sizes="32x32" href="favicon-32x32.png">
<link rel="icon" type="image/png" sizes="16x16" href="favicon-16x16.png">
<link rel="manifest" href="site.webmanifest">
<meta name="robots" content="noindex">
<meta property="og:site_name" content="RichDreamer" />
<meta property="og:type" content="video.other" />
<meta property="og:title" content="RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D" />
<meta property="og:description" content="RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D, 2023" />
<meta property="og:url" content="" />
<script src="assets/js/video_comparison.js"></script>
<script type="module" src="https://unpkg.com/@google/model-viewer@2.0.1/dist/model-viewer.min.js"></script>
</head>
<body>
<div class="highlight-clean" style="padding-bottom: 10px;">
<div class="container" style="max-width: 1024px; margin-bottom: 20px;">
<h1 class="text-center"><b>RichDreamer</b>: A Generalizable Normal-Depth Diffusion Model for
Detail Richness in Text-to-3D</h1>
</div>
<div class="container" style="max-width: 990px; margin-bottom: 20px;">
<div class="row authors">
<div class="col">
<h1 class="text-center"><a class="text-center"><small>CVPR2024</a></small><a class="text-center"><small> <p style="color: red;"> Highlight</p> </a></small></h1>
</div>
</div>
<div class="row authors">
<div class="col">
<h5 class="text-center"></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center" href="https://lingtengqiu.github.io/", target="_blank">Lingteng Qiu<sup>1,3*</sup> </a></h5>
</div>
<div class="col">
<h5 class="text-center"><a href="https://guanyingc.github.io/", target="_blank">Guanying Chen<sup>2,1*</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center", href="https://scholar.google.com.hk/citations?user=aJPO514AAAAJ&hl=zh-CN&oi=ao", target="_blank">Xiaodong Gu<sup>3*</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"></h5>
</div>
</div>
<div class="row authors">
<div class="col">
<h5 class="text-center"><a class="text-center">Qi Zuo<sup>3</sup> </a></h5>
</div>
<div class="col">
<h5 class="text-center"><a href="https://mutianxu.github.io/", target="_blank">Mutian Xu<sup>1</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center">Yushuang Wu<sup>2,1,3</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center" href="https://weihao-yuan.com/", target="_blank">Weihao Yuan<sup>3</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center", href="https://scholar.google.com/citations?user=GHOQKCwAAAAJ&hl=zh-CN&oi=ao", target="_blank">Zilong Dong<sup>3</sup></a></h5>
</div>
</div>
<div class="row authors">
<div class="col">
<h5 class="text-center"></h5>
</div>
<div class="col">
<h5 class="text-center"><a class="text-center", href="https://research.cs.washington.edu/istc/lfb/", target="_blank">Liefeng Bo<sup>3</sup> </a></h5>
</div>
<div class="col">
<h5 class="text-center"><a href="https://mypage.cuhk.edu.cn/academics/hanxiaoguang/", target="_blank">Xiaoguang Han<sup>1,2</sup></a></h5>
</div>
<div class="col">
<h5 class="text-center"></h5>
</div>
</div>
<div class="row affiliations">
<div class="col">
<h6 class="text-center"></h6>
</div>
<div class="col">
<h6 class="text-center"><a class="text-center"><sup>1</sup>SSE, CUHKSZ</a></h6>
</div>
<div class="col">
<h6 class="text-center"><a class="text-center"><sup>2</sup>FNii, CUHKSZ</a></h6>
</div>
<div class="col">
<h6 class="text-center"><a class="text-center"><sup>3</sup>Alibaba Group</a></h6>
</div>
<div class="col">
<h6 class="text-center"><a class="text-center"><sup>*</sup>Equal Contribution</a></h6>
</div>
<div class="col">
<h6 class="text-center"></h6>
</div>
</div>
</div>
<div class="buttons" style="margin-top: 8px; margin-bottom: 8px;">
<a class="btn btn-light" role="button" href="https://arxiv.org/abs/2311.16918", target="_blank">
<svg style="width:24px;height:24px;margin-left:-12px;margin-right:12px" viewBox="0 0 24 24">
<path fill="currentColor" d="M16 0H8C6.9 0 6 .9 6 2V18C6 19.1 6.9 20 8 20H20C21.1 20 22 19.1 22 18V6L16 0M20 18H8V2H15V7H20V18M4 4V22H20V24H4C2.9 24 2 23.1 2 22V4H4M10 10V12H18V10H10M10 14V16H15V14H10Z"></path>
</svg>Paper
</a>
<a class="btn btn-light disabled border border-dark" aria-disabled="true" role="button" href="#">
<svg style="visibility:hidden;width:0px;height:24px;margin-left:-12px;margin-right:12px" width="0px" height="24px" viewBox="0 0 375 531">
<polygon stroke="#000000" points="0.5,0.866 459.5,265.87 0.5,530.874 "></polygon>
</svg>
Project
</a>
<a class="btn btn-light" role="button" href="https://github.com/modelscope/RichDreamer", target="_blank">
<svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px;margin-left:-12px;margin-right:12px" viewBox="0 0 24 24">
<path d="M12 0c-6.626 0-12 5.373-12 12 0 5.302 3.438 9.8 8.207 11.387.599.111.793-.261.793-.577v-2.234c-3.338.726-4.033-1.416-4.033-1.416-.546-1.387-1.333-1.756-1.333-1.756-1.089-.745.083-.729.083-.729
1.205.084 1.839 1.237 1.839 1.237 1.07 1.834 2.807 1.304 3.492.997.107-.775.418-1.305.762-1.604-2.665-.305-5.467-1.334-5.467-5.931 0-1.311.469-2.381 1.236-3.221-.124-.303-.535-1.524.117-3.176 0 0 1.008-.322
3.301 1.23.957-.266 1.983-.399 3.003-.404 1.02.005 2.047.138 3.006.404 2.291-1.552 3.297-1.23 3.297-1.23.653 1.653.242 2.874.118 3.176.77.84 1.235 1.911 1.235 3.221 0 4.609-2.807 5.624-5.479 5.921.43.372.823 1.102.823
2.222v3.293c0 .319.192.694.801.576 4.765-1.589 8.199-6.086 8.199-11.386 0-6.627-5.373-12-12-12z"></path>
</svg>
Code
</a>
<a class="btn btn-light" role="button" href="https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC", target="_blank">
<svg xmlns="http://www.w3.org/2000/svg" width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="#000000" stroke-width="2" stroke-linecap="round" stroke-linejoin="round">
<rect x="2" y="3" width="20" height="14" rx="2" ry="2"></rect><line x1="8" y1="21" x2="16" y2="21"></line><line x1="12" y1="17" x2="12" y2="21"></line>
</svg>
Live Demo
</a>
<a class="btn btn-light" role="button" href="https://aigc3d.github.io/gobjaverse/", target="_blank">
<svg t="1705641830216" class="icon" viewBox="0 0 1024 1024" version="1.1" xmlns="http://www.w3.org/2000/svg" p-id="2212" width="24" height="24"><path d="M958.144 619.584c0-3.072 0-6.144-0.512-9.216V191.616h-0.512v-1.024c0-39.648-24.128-69.056-44.768-87.04-24.224-21.152-57.696-40.224-99.904-55.584C730.528 17.536 622.432 0.512 507.52 0.512c-114.848 0-221.92 16.448-304.352 46.816-42.208 15.584-75.712 34.592-99.904 55.648-20.128 18.08-44.832 47.968-44.832 87.04 0 3.136 0 6.208 0.512 9.376v633.92c0 39.2 24.224 69.056 44.832 87.04 24.128 21.216 57.696 40.224 99.904 55.584 81.92 30.432 190.016 47.456 304.864 47.456s222.496-17.056 304.928-47.392c42.208-15.424 75.712-34.528 99.904-55.584 20.032-18.08 44.256-47.392 44.768-87.04V619.52z m-73.152-214.752c0 22.688-36.576 51.488-92.672 73.664-73.664 28.832-177.152 45.344-284.32 45.344-107.072 0-210.656-16.544-284.224-45.344-64.352-25.248-89.664-52.512-92.16-70.592v-109.12c18.496 11.84 40.608 22.688 65.824 32.416 82.944 31.968 193.184 49.44 310.56 49.44 117.984 0 228.192-17.568 310.56-49.952a369.888 369.888 0 0 0 66.496-32.928v107.072zM151.744 157.632c16.96-14.912 43.744-29.856 77.248-42.208 74.176-27.808 173.056-42.784 279.104-42.784s205.536 15.52 279.2 42.784c32.928 12.352 59.744 27.296 77.248 42.208 9.28 8.256 20.128 20.128 20.128 32.48v0.512c-0.512 22.688-36.576 51.488-92.16 73.152-73.728 28.32-177.216 44.768-284.384 44.768-107.072 0-210.656-16.448-284.224-44.768-62.912-24.224-88.576-51.488-92.16-69.056v-4.544c0-11.84 10.752-24.224 20.032-32.48zM132.16 513.024c18.592 11.84 40.736 23.2 65.984 32.928 82.88 32.48 193.088 49.952 310.496 49.952 117.472 0 228.192-17.472 310.56-49.952a376.416 376.416 0 0 0 66.496-32.928v111.168c-3.648 17.568-29.408 44.832-92.736 69.056-73.664 28.32-177.152 44.832-284.832 44.832-106.56 0-210.08-16.544-283.2-44.832-56.16-21.664-92.736-50.976-92.736-73.664v-106.56z m732.864 353.792c-16.96 14.912-43.808 29.856-77.248 42.208-74.176 27.808-173.056 42.784-279.2 42.784-106.048 0-205.472-15.424-279.104-42.784-32.928-12.352-59.744-27.296-77.248-42.208-9.28-8.256-20.128-20.64-20.128-32.928v-106.72c18.592 11.84 41.248 23.2 66.496 32.928 82.368 31.968 192.576 49.504 309.472 49.504 117.472 0 227.68-17.568 310.56-49.44a369.856 369.856 0 0 0 66.496-32.992v106.048c0 12.864-10.816 25.248-20.128 33.504z m-102.528-455.744v0.16c0 19.808 16.032 35.84 35.84 35.84h0.448a35.84 35.84 0 0 0 35.84-35.84v-0.16-0.224a35.84 35.84 0 0 0-35.84-35.84h-0.448a35.84 35.84 0 0 0-35.84 35.84v0.224z m0.512 210.656v0.16c0 19.808 16.032 35.84 35.84 35.84h0.448a35.84 35.84 0 0 0 35.84-35.84v-0.224-0.16a35.84 35.84 0 0 0-35.84-35.84h-0.448a35.84 35.84 0 0 0-35.84 35.84v0.16z m0 217.216v0.288c0 19.808 16.032 35.84 35.84 35.84h0.448a35.84 35.84 0 0 0 35.84-35.84v-0.224-0.224a35.84 35.84 0 0 0-35.84-35.84h-0.448a35.84 35.84 0 0 0-35.84 35.84v0.224z" p-id="2213"></path></svg>
G-Objaverse
</a>
<a class="btn btn-light" role="button" href="https://github.com/aigc3d/", target="_blank">
<svg t="1705643356653" class="icon" viewBox="0 0 1024 1024" version="1.1" xmlns="http://www.w3.org/2000/svg" p-id="4475" width="24" height="24"><path d="M946.3 418.5L588.8 95.7c-42.9-42-111.8-42-154.7 0L77.7 418.6c-16.4 14.8-17.6 40.1-2.8 56.5s40.1 17.6 56.5 2.8l357.1-323.5 1.4-1.4c11.9-11.9 31.2-11.9 43.1 0l0.7 0.7 358.9 324.2c7.7 6.9 17.2 10.3 26.8 10.3 10.9 0 21.8-4.4 29.7-13.2 14.9-16.4 13.6-41.6-2.8-56.5z" fill="#333333" p-id="4476"></path><path d="M809.6 439.3c-22.1 0-40 17.9-40 40v375.3c0 13.7-9.6 25.2-20.9 25.2h-64.9c-11.3 0-20.9-11.6-20.9-25.2V715.5c0-58-45.2-105.2-100.9-105.2H461c-55.6 0-100.9 47.2-100.9 105.2v10.9c-0.1 1.3-0.2 2.6-0.2 3.9v124.3c0 13.7-9.6 25.2-20.9 25.2h-64.9c-11.3 0-20.9-11.6-20.9-25.2V479.3c0-22.1-17.9-40-40-40s-40 17.9-40 40v375.3c0 58 45.2 105.2 100.9 105.2H339c54.1 0 98.3-44.6 100.8-100.4 0.2-1.6 0.3-3.2 0.3-4.8V715.5c0-13.7 9.6-25.2 20.9-25.2h101.2c11.3 0 20.9 11.6 20.9 25.2v139.1c0 58 45.2 105.2 100.9 105.2h64.9c55.6 0 100.9-47.2 100.9-105.2V479.3c-0.2-22.1-18.1-40-40.2-40z" fill="#333333" p-id="4477"></path></svg>
Home
</a>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<img src="static/teaser.jpg" alt="architecture" style="width: 100%">
<h2>Abstract</h2>
<p>
Lifting 2D diffusion for 3D generation is a challenging
problem due to the lack of geometric prior and the com-
plex entanglement of materials and lighting in natural im-
ages. Existing methods have shown promise by first creat-
ing the geometry through score-distillation sampling (SDS)
applied to rendered surface normals, followed by appear-
ance modeling. However, relying on a 2D RGB diffusion
model to optimize surface normals is suboptimal due to the
distribution discrepancy between natural images and nor-
mals maps, leading to instability in optimization. In this
paper, recognizing that the normal and depth information
effectively describe scene geometry and be automatically
estimated from images, we propose to learn a generaliz-
able Normal-Depth diffusion model for 3D generation. We
achieve this by training on the large-scale LAION dataset
together with the generalizable image-to-depth and normal
prior models. In an attempt to alleviate the mixed illumi-
nation effects in the generated materials, we introduce an
albedo diffusion model to impose data-driven constraints on
the albedo component. Our experiments show that when in-
tegrated into existing text-to-3D pipelines, our models sig-
nificantly enhance the detail richness, achieving state-of-
the-art results.
</p>
</div>
</div>
</div>
<div class="container" style="max-width: 768px;">
<div class="row captioned_videos">
<div class="col-sm-12">
<h2>Methodology</h2>
<p>
We introduce a generalizable Normal-Depth diffusion model that is trained on the
LAION-2B dataset with normal and depth predicted by Midas, followed by fine-tuning on the synthetic dataset. Our model can be
incorporated with the DMTet and NeRF representations to enhance the geometry generation. To alleviate the ambiguity in appearance
modeling, we propose an albedo diffusion model to impose data-drive prior on the albedo component.
</p>
</div>
<div class="col-md-12">
<img src="static/architecture.png" alt="architecture" style="width: 100%">
</div>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-md-12">
<h2>Video</h2>
<div class="compositional captioned_videos">
<iframe width="768" height="432" src="https://www.youtube.com/embed/6gQ1VWiKoc0?si=Q7yL4mWmBN3cUaYi" frameborder="no" framespacing="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen> </iframe>
</div>
<strong>For netizens in China, considering the problem of Internet restrictions, we provide a video link to <a href="https://www.bilibili.com/video/BV1Qb4y1K7Sb/?spm_id_from=888.80997.embed_other.whitelist", target="_blank">bilibili</a>.</strong>
</div>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-sm-12">
<h2>Gallery Results of <i>Ours (Sphere)</i></h2>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a DSLR photo of a cake covered in colorful frosting with a slice being taken out, high resolution.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of a cake covered in colorful frosting with a slice being taken out, high resolution</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="static/examples/A crocheted doll wearing a crown, 4K, HD.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">A crocheted doll wearing a crown, 4K, HD</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="static/examples/A statue of angel, 3d asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">A statue of angel, 3d asset</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="static/examples/Fire-breathing Phoenix, mythical bird, engulfed in flames, rebirth and renewal, 3d asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Fire-breathing Phoenix, mythical bird, engulfed in flames, rebirth and renewal, 3d asset.mp4</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/an origami motorcycle.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of an origami motorcycle</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="static/examples/mini China town, highly detailed, 3d asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">mini China town, highly detailed, 3d asset.mp4</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a DSLR photo of a knight chopping wood.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of a knight chopping wood</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/A punk rock squirrel in a studded leather jacket shouting into a microphone while standing on a stump and holding a beer.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">A punk rock squirrel in a studded leather jacket shouting into a microphone while standing on a stump and holding a beer</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a squirrel dressed up like a Victorian woman.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a squirrel dressed up like a Victorian woman</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a turtle standing on its hind legs, wearing a top hat.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a turtle standing on its hind legs, wearing a top hat</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a DSLR photo of an astronaut standing on the surface of mars.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of an astronaut standing on the surface of mars</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a DSLR photo of edible typewriter made out of vegetables.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of edible typewriter made out of vegetables</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/flying Dragon, highly detailed, breathing fire, 3d asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">flying Dragon, highly detailed, breathing fire, 3d asset</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a tiger wearing sunglasses and a leather jacket, riding a motorcycle.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a tiger wearing sunglasses and a leather jacket, riding a motorcycle</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/Panda samurai, anthropomorphic panda in samurai armour, soldier, game asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Panda samurai, anthropomorphic panda in samurai armour, soldier, game asset</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/Ninja Assassin, stealthy operative, high-tech weaponry.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Ninja Assassin, stealthy operative, high-tech weaponry</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/the leaning tower of Pisa, aerial view.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">the leaning tower of Pisa, aerial view</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/examples/a DSLR photo of the Statue of Liberty, aerial view.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a DSLR photo of the Statue of Liberty, aerial view</h6>
</div>
</div>
<div class="col-sm-8 my-auto center" style="margin-left: auto; margin-right: auto;">
<a href="./dmtet_dreamfusion_gallery_0.html" class="btn btn-primary btn-lg btn-search", target='_blank'>
Results from DreamFusion Prompts
</a>
<a href="#" class="btn btn-primary btn-lg btn-search">
Additional Results (Coming Soon)
</a>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col-sm-12">
<h2>Gallery Results of <i>Ours (NeRF)</i></h2>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a Christmas tree with donuts as decorations.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a Christmas tree with donuts as decorations</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a confused beagle sitting at a desk working on homework.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a confused beagle sitting at a desk working on homework</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a fox holding a videogame controller.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a fox holding a videogame controller</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a group of dogs playing poker, 3d asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a group of dogs playing poker, 3d asset</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a group of squirrels rowing crew.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a group of squirrels rowing crew</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a gummy bear driving a convertible.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a gummy bear driving a convertible</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a human skeleton relaxing in a lounge chair.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a human skeleton relaxing in a lounge chair</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a humanoid robot sitting on a chair drinking a cup of coffee.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a humanoid robot sitting on a chair drinking a cup of coffee</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a panda wearing a chefs hat and kneading bread dough on a countertop.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a panda wearing a chefs hat and kneading bread dough on a countertop</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a squirrel dressed like Henry VIII king of England.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a squirrel dressed like Henry VIII king of England</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a tiger waiter at a fancy restaurant.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a tiger waiter at a fancy restaurant</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/a wide angle zoomed out DSLR photo of a skiing penguin wearing a puffy jacket.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">a wide angle zoomed out DSLR photo of a skiing penguin wearing a puffy jacket</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/an origami hippo in a river.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">an origami hippo in a river</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/Majestic Peacock Throne, golden opulence, feathers adorned with jewels, royal symbolism, 3D asset.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Majestic Peacock Throne, golden opulence, feathers adorned with jewels, royal symbolism, 3D asset</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/two raccoons playing poker.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">two raccoons playing poker</h6>
</div>
</div>
<div class="row captioned_videos">
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/Army Jacket, 3D scan.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Army Jacket, 3D scan</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/Humoristic san goku body mixed with wild boar head running, 4K, HD.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">Humoristic san goku body mixed with wild boar head running, 4K, HD</h6>
</div>
<div class="col-4">
<video class="video lazy" loop playsinline autoplay muted onplay="resizeAndPlay(this)">
<source data-src="./static/NeRF-examples/An intricate complex with steam-powered machinery, twisting pipes, and brick warehouses, shrouded in a foggy, industrial atmosphere, 8K, blender 3d.mp4" type="video/mp4"></source>
</video>
<h6 class="caption">An intricate complex with steam-powered machinery, twisting pipes, and brick warehouses, shrouded in a foggy, industrial atmosphere, 8K, blender 3d</h6>
</div>
</div>
<div class="col-sm-8 my-auto center" style="margin-left: auto; margin-right: auto;">
<a href="./nerf_dreamfusion_gallery_0.html" class="btn btn-primary btn-lg btn-search", target='_blank'>
Results from DreamFusion Prompts
</a>
<a href="#" class="btn btn-primary btn-lg btn-search">
Additional Results (Coming Soon)
</a>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col">
<h2>Text to Normal-Depth</h2>
<p>Sampling results of our Normal-Depth diffusion model trained on the Laion-2B dataset</p>
</div>
</div>
<div class="row captioned_videos">
<div class="col-md-12">
<img src="static/text-to-nd.png" alt="text-to-nd" style="width: 100%">
</div>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<div class="row">
<div class="col">
<h2>BibTex</h2>
<p>If you find our approach helpful, you may consider citing our work.</p>
</div>
</div>
<div class="row">
<div class="col">
<pre><code>@inproceedings{qiu2024richdreamer,
title={Richdreamer: A generalizable normal-depth diffusion model for detail richness in text-to-3d},
author={Qiu, Lingteng and Chen, Guanying and Gu, Xiaodong and Zuo, Qi and Xu, Mutian and Wu, Yushuang and Yuan, Weihao and Dong, Zilong and Bo, Liefeng and Han, Xiaoguang},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={9914--9925},
year={2024}
}
</code></pre>
</div>
</div>
</div>
<hr class="divider" />
<div class="container" style="max-width: 768px;">
<footer>
<p> Website template from <a href="https://dreamfusion3d.github.io/">DreamFusion</a>. We thank the authors for the open-source code.</p>
</footer>
</div>
<script src="https://polyfill.io/v3/polyfill.js?features=IntersectionObserver"></script>
<script src="assets/js/yall.js"></script>
<script>
yall(
{
observeChanges: true
}
);
</script>
<script src="assets/js/scripts.js"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.5.1/jquery.min.js"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/4.5.0/js/bootstrap.bundle.min.js"></script>
<script src="https://uploads-ssl.webflow.com/51e0d73d83d06baa7a00000f/js/webflow.fd002feec.js"></script>
<!-- Import the component -->
</body>
</html>