1/1
![Police car light :rotating_light: 🚨](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f6a8.png)
Paper Alert
![Right arrow :arrow_right: ➡️](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/27a1.png)
Paper Title: Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
![Glowing star :star2: 🌟](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f31f.png)
Few pointers from the paper
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
In this paper, authors have introduced “Era3D”, a novel multiview diffusion method that generates high-resolution multiview images from a single-view image.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
Despite significant advancements in multiview generation, existing methods still suffer from camera prior mismatch, inefficacy, and low resolution, resulting in poor-quality multiview images.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
Specifically, these methods assume that the input images should comply with a predefined camera type, e.g. a perspective camera with a fixed focal length, leading to distorted shapes when the assumption fails.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
Moreover, the full-image or dense multiview attention they employ leads to an exponential
explosion of computational complexity as image resolution increases, resulting in prohibitively expensive training costs.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
To bridge the gap between assumption and reality, Era3D first proposes a diffusion-based camera prediction module to estimate the focal length and elevation of the input image, which allows their method
to generate images without shape distortions.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
Furthermore, a simple but efficient attention layer, named row-wise attention, is used to enforce epipolar priors in the multiview diffusion, facilitating efficient cross-view information fusion.
![Direct hit :dart: 🎯](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3af.png)
Consequently, compared with state-of-the-art methods, Era3D generates high-quality multiview images with up to a 512×512 resolution while reducing computation complexity by 12x times.
![Office building :office: 🏢](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3e2.png)
Organization: @hkust , @HKUniversity , DreamTech, PKU, LightIllusion
![Mage :mage: 🧙](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f9d9.png)
Paper Authors: Peng Li, @YuanLiu41955461 , @xxlong0 , Feihu Zhang, @_cheng_lin , Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo
![Keycap: 1 :one: 1️⃣](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/0031-20e3.png)
Read the Full Paper here:
[2405.11616] Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
![Keycap: 2 :two: 2️⃣](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/0032-20e3.png)
Project Page:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
![Keycap: 3 :three: 3️⃣](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/0033-20e3.png)
Code:
GitHub - pengHTYX/Era3D
![Keycap: 4 :four: 4️⃣](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/0034-20e3.png)
Demo:
Era3D MV Demo - a Hugging Face Space by pengHTYX
![Movie camera :movie_camera: 🎥](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3a5.png)
Be sure to watch the attached Demo Video-Sound on
![Speaker high volume :loud_sound: 🔊](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f50a.png)
![Musical note :musical_note: 🎵](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f3b5.png)
Music by Oleg Fedak from @pixabay
Find this Valuable
![Gem stone :gem: 💎](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f48e.png)
?
![Recycling symbol :recycle: ♻️](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/267b.png)
QT and teach your network something new
Follow me
![Footprints :footprints: 👣](https://cdn.jsdelivr.net/joypixels/assets/6.6/png/unicode/64/1f463.png)
, @NaveenManwani17 , for the latest updates on Tech and AI-related news, insightful research papers, and exciting announcements.
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196