1/1
Paper Alert
Paper Title: Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Few pointers from the paper
In this paper, authors have introduced “Era3D”, a novel multiview diffusion method that generates high-resolution multiview images from a single-view image.
Despite significant advancements in multiview generation, existing methods still suffer from camera prior mismatch, inefficacy, and low resolution, resulting in poor-quality multiview images.
Specifically, these methods assume that the input images should comply with a predefined camera type, e.g. a perspective camera with a fixed focal length, leading to distorted shapes when the assumption fails.
Moreover, the full-image or dense multiview attention they employ leads to an exponential
explosion of computational complexity as image resolution increases, resulting in prohibitively expensive training costs.
To bridge the gap between assumption and reality, Era3D first proposes a diffusion-based camera prediction module to estimate the focal length and elevation of the input image, which allows their method
to generate images without shape distortions.
Furthermore, a simple but efficient attention layer, named row-wise attention, is used to enforce epipolar priors in the multiview diffusion, facilitating efficient cross-view information fusion.
Consequently, compared with state-of-the-art methods, Era3D generates high-quality multiview images with up to a 512×512 resolution while reducing computation complexity by 12x times.
Organization: @hkust , @HKUniversity , DreamTech, PKU, LightIllusion
Paper Authors: Peng Li, @YuanLiu41955461 , @xxlong0 , Feihu Zhang, @_cheng_lin , Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo
Read the Full Paper here:
[2405.11616] Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Project Page:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Code:
GitHub - pengHTYX/Era3D
Demo:
Era3D MV Demo - a Hugging Face Space by pengHTYX
Be sure to watch the attached Demo Video-Sound on
Music by Oleg Fedak from @pixabay
Find this Valuable
?
QT and teach your network something new
Follow me
, @NaveenManwani17 , for the latest updates on Tech and AI-related news, insightful research papers, and exciting announcements.
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196