Y'all heard about ChatGPT yet? AI instantly generates question answers, entire essays etc.

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,112
Reputation
8,239
Daps
157,807
@bnew is there a a.i tax examiner?

I haven't realy looked into it but someone did make a gpt for taxes. I didn't try it.

ChatGPT - TaxGPT



 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,112
Reputation
8,239
Daps
157,807
the competition is heating up.


jEasUME.png







GFeXRKsW0AAmERI.jpg

GFecHADXUAA00Vt.jpg

GFecHAAWgAAfmFW.jpg

GFecHADWcAAaVM_.jpg

GFedUHgWgAA6x7v.jpg

GFedUHhXAAA0TaO.jpg

GFedUHeXgAEbUmK.jpg

GFedUHgW4AAu0Ua.jpg


GFas2thbMAAEpmh.jpg

GFas2tfboAARoe0.jpg

GFas2tjasAAC7uV.jpg

GFas2tiaMAAhRUf.jpg



 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,112
Reputation
8,239
Daps
157,807







 

Matt504

YSL as a gang must end
Joined
Sep 7, 2013
Messages
45,222
Reputation
14,777
Daps
274,020









it's over

:wow:
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
56,112
Reputation
8,239
Daps
157,807


I made a post about it but the thread got buried.



EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions​


Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo

Institute for Intelligent Computing, Alibaba Group

GitHub arXiv


Abstract​


MY ALT TEXT
We proposed EMO, an expressive audio-driven portrait-video generation framework. Input a single reference image and the vocal audio, e.g. talking and singing, our method can generate vocal avatar videos with expressive facial expressions, and various head poses, meanwhile, we can generate videos with any duration depending on the length of input video.​


Method​

MY ALT TEXT

Overview of the proposed method. Our framework is mainly constituted with two stages. In the initial stage, termed Frames Encoding, the ReferenceNet is deployed to extract features from the reference image and motion frames. Subsequently, during the Diffusion Process stage, a pretrained audio encoder processes the audio embedding. The facial region mask is integrated with multi-frame noise to govern the generation of facial imagery. This is followed by the employment of the Backbone Network to facilitate the denoising operation. Within the Backbone Network, two forms of attention mechanisms are applied: Reference-Attention and Audio-Attention. These mechanisms are essential for preserving the character's identity and modulating the character's movements, respectively. Additionally, Temporal Modules are utilized to manipulate the temporal dimension, and adjust the velocity of motion.

Various Generated Videos​

Singing​



Make Portrait Sing​


Input a single character image and a vocal audio, such as singing, our method can generate vocal avatar videos with expressive facial expressions, and various head poses, meanwhile, we can generate videos with any duration depending on the length of input audio. Our method can also persist the characters' identifies in a long duration.

Character: AI Mona Lisa generated by dreamshaper XL
Vocal Source: Miley Cyrus - Flowers. Covered by YUQI


Character: AI Lady from SORA
Vocal Source: Dua Lipa - Don't Start Now




Different Language & Portrait Style​


Our method supports songs in various languages and brings diverse portrait styles to life. It intuitively recognizes tonal variations in the audio, enabling the generation of dynamic, expression-rich avatars.

Character: AI Girl generated by ChilloutMix
Vocal Source: David Tao - Melody. Covered by NINGNING (mandarin)



Character: AI Ymir from AnyLora & Ymir Fritz Adult
Vocal Source: 『衝撃』Music Video【TVアニメ「進撃の巨人」The Final Season エンディングテーマ曲】 (Japanese)


Character: Leslie Cheung Kwok Wing
Vocal Source: Eason Chan - Unconditional. Covered by AI (Cantonese)



Character: AI girl generated by WildCardX-XL-Fusion
Vocal Source: JENNIE - SOLO. Cover by Aiana (Korean)




Rapid Rhythm​


The driven avatar can keep up with fast-paced rhythms, guaranteeing that even the swiftest lyrics are synchronized with expressive and dynamic character animations.

Character: Leonardo Wilhelm DiCaprio

Vocal Source: EMINEM - GODZILLA (FT. JUICE WRLD) COVER


Character: KUN KUN

Vocal Source: Eminem - Rap God

Talking​


Talking With Different Characters​


Our approach is not limited to processing audio inputs from singing, it can also accommodate spoken audio in various languages. Additionally, our method has the capability to animate portraits from bygone eras, paintings, and both 3D models and AI generated content, infusing them with lifelike motion and realism.


Character: Audrey Kathleen Hepburn-Ruston
Vocal Source: Interview Clip



Character: AI Chloe: Detroit Become Human
Vocal Source: Interview Clip



Character: Mona Lisa
Vocal Source: Shakespeare's Monologue II As You Like It: Rosalind "Yes, one; and in this manner."



Character: AI Ymir from AnyLora & Ymir Fritz Adult
Vocal Source: NieR: Automata


Cross-Actor Performance​


Explore the potential applications of our method, which enables the portraits of movie characters delivering monologues or performances in different languages and styles. we can expanding the possibilities of character portrayal in multilingual and multicultural contexts.


Character: Joaquin Rafael Phoenix - The Jocker - 《Jocker 2019》
Vocal Source: 《The Dark Knight》 2008


Character: SongWen Zhang - QiQiang Gao - 《The Knockout》
Vocal Source: Online courses for legal exams



Character: AI girl generated by xxmix_9realisticSDXL
Vocal Source: Videos published by itsjuli4.
 
Top