The Coli Makes Music Thread (Suno AI)

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,673
Reputation
8,661
Daps
162,664



1/11
Pretty amazing stuff from the Udio/Suno lawsuits. Record labels were able to basically recreate versions of very famous songs with highly specific prompts, then linked to them in the lawsuits. I made a short compilation here: Listen to the AI-Generated Ripoff Songs That Got Udio and Suno Sued

2/11
also notable: record industry is not just suing Udio and Suno. It is also seeking to sue & unmask 10 John Does who allegedly helped them scrape the music and make the models

3/11
A full list of AI generated songs that record industry claims are ripped off with links and prompts are available here: https://s3.documentcloud.org/documents/24776032/1-3.pdf and here https://s3.documentcloud.org/documents/24776029/1-2.pdf

4/11
songs they wee able to reproduce:

5/11
It will get nasty after going to discovery and they are forced to disclose exactly how the AI was trained.

6/11
called it in April

7/11
ScarJo vs Openai. Same thing everyone runs with the meme but the audio is not even close. In this case, of couse it has to resemble the original as they are probably using the mimic option to promp the audio with the original song, will they sue every other cover band too? silly.

8/11
A few months ago you were able to get Beatles covers by just asking for a "Bea_tles" song

9/11
The theft from songwriters and musicians is a century old and continuing

10/11
This is biased by lifting the lyrics from the actual songs. Without that I don’t think anyone would say the tune is the same or even similar.

11/11
lol cc: .@joshtpm is there any past example of a hot new industry being sued as rapidly as the AI “creators”? No.


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GK0jtD6aIAEho3u.jpg
 

Artificial Intelligence

Not Allen Iverson
Joined
May 17, 2012
Messages
54,833
Reputation
7,085
Daps
126,835



1/11
Pretty amazing stuff from the Udio/Suno lawsuits. Record labels were able to basically recreate versions of very famous songs with highly specific prompts, then linked to them in the lawsuits. I made a short compilation here: Listen to the AI-Generated Ripoff Songs That Got Udio and Suno Sued

2/11
also notable: record industry is not just suing Udio and Suno. It is also seeking to sue & unmask 10 John Does who allegedly helped them scrape the music and make the models

3/11
A full list of AI generated songs that record industry claims are ripped off with links and prompts are available here: https://s3.documentcloud.org/documents/24776032/1-3.pdf and here https://s3.documentcloud.org/documents/24776029/1-2.pdf

4/11
songs they wee able to reproduce:

5/11
It will get nasty after going to discovery and they are forced to disclose exactly how the AI was trained.

6/11
called it in April

7/11
ScarJo vs Openai. Same thing everyone runs with the meme but the audio is not even close. In this case, of couse it has to resemble the original as they are probably using the mimic option to promp the audio with the original song, will they sue every other cover band too? silly.

8/11
A few months ago you were able to get Beatles covers by just asking for a "Bea_tles" song

9/11
The theft from songwriters and musicians is a century old and continuing

10/11
This is biased by lifting the lyrics from the actual songs. Without that I don’t think anyone would say the tune is the same or even similar.

11/11
lol cc: .@joshtpm is there any past example of a hot new industry being sued as rapidly as the AI “creators”? No.


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GK0jtD6aIAEho3u.jpg

It’s over :beli: why couldn’t they train royalty fee music
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,673
Reputation
8,661
Daps
162,664






1/11
@sunomusic
v4 is coming soon 🎧



https://video.twimg.com/ext_tw_video/1854955202001907712/pu/vid/avc1/720x1280/JFFDhLm1PGV4YWjU.mp4

2/11
@ThenAMug
I hate how corny most of the Hip-Hop/Rap generations are. Hopefully, this update brings some justice to the genre. @sunomusic



3/11
@sunomusic
we posted a hip hop song made with v4 this morning!



4/11
@Jesseclinton_
People who don’t understand creativity will knock this. True creators don’t just use a prompt; they see AI/Suno as part of a process. Bach with a strings VST and a chord generator—he’d innovate, not dismiss. Creativity evolves with tools, not despite them.



5/11
@sunomusic
said beautifully. thanks!



6/11
@StevenDarlow
@nickfloats wasn’t lying!

Wow, that is a game changer!! (again)



7/11
@sunomusic
thanks steve! excited for you to try it



8/11
@NahPlaya1212
Take everything I own



9/11
@sunomusic
😂❤️



10/11
@cooljellyx
Add in painting, and the game is f*cking over. ❤️‍🔥



11/11
@sunomusic
we have inpainting!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196







1/11
@sunomusic
Burkinabe funk, but make it v4 🎧
Thank you to all our alpha testers for your feedback on v4 this past week. With your help, we’re now working hard at adding some finishing touches. Thank you everyone for your excitement and patience as we 🧑‍🍳



https://video.twimg.com/ext_tw_video/1857478323317706752/pu/vid/avc1/720x1280/mOnbevzYhNIAIjDQ.mp4

2/11
@AIandDesign
It's amazing!



3/11
@sunomusic
thanks Marco! really glad to hear that. and appreciate your feedback the last few days!



4/11
@JoeProAI
I did a song in anticipation but felt cliche to post. I do have to say whomever is running this update coming soon a campaign is doing a great job. I'm super stoked for this.



5/11
@sunomusic
thank you 🙏 ❤️



6/11
@SwisherYard
Will it have the option to enhance existing songs?



7/11
@sunomusic
Yes - we will have remastering as an option on your existing songs! 🔥



8/11
@lxe
This is absolutely bonkers. I've been addicted to suno.



9/11
@sunomusic
appreciate your support and glad you’ve been able to make your own music with suno!



10/11
@WorldEverett
Can't wait, it sounds great🔥



11/11
@RichSilverX
I want it so bad!!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196



1/11
@sunomusic
v4 is coming soon 🎧



https://video.twimg.com/ext_tw_video/1856106054909632512/pu/vid/avc1/720x1280/tEGw361SAgwBxdzT.mp4

2/11
@Mitchnavarro7
I'm so pumped for this!



3/11
@gopro_audio
I got 500 songs to re do

lets go



4/11
@jasonjdxb
Can’t wait for SUNO V4! Can we get an upgrade where we can upload our original songs and SUNO AI creates vocals on the entire song using time stamps/prompts for guidance?



5/11
@Emily_Escapor
I hope soon means next week, and I hope you guys start the initial preparation for V5 and start training before the end of the year on the B200 cluster; we need everyone to make music.
I hope we also significantly improve how to control the instruments and advanced control over vocals.



6/11
@mckaywrigley
Can’t come soon enough



7/11
@blizaine
Wow. Tomorrow then? 🤞😬



8/11
@ibrvndy
How soon we talkin? 🤔 🙌



9/11
@Faedriel
hypeee



10/11
@david_vipernz
It's already best value for anything ever. And it keeps getting better!



11/11
@Liinad_De_Varge
Already forgot you existed 😅
But going to try out the new version for sure!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196



1/11
@nickfloats
something comin' @sunomusic



https://video.twimg.com/ext_tw_video/1854964192417038336/pu/vid/avc1/720x1278/9RQop-MC3yfc638K.mp4

2/11
@dustinhollywood
🔥



3/11
@DMctendo
Put it on hi fi speakers in a club and see why it’s not gonna be doing much of anything… yet



4/11
@dkardonsky_
bullish



5/11
@jaredeasley
Fantastic



6/11
@gpt_biz
Excited to see what’s coming next from @sunomusic looking forward to it



7/11
@edh_wow
Honestly, I love Suno. Has the rapping gotten better?



8/11
@ikamanu
Love it. Can you share the prompt for this one?



9/11
@NoodleNakamoto
I'm NGL, this is an absolute banger. Please @nickfloats - if you can extend this, I'd love to hear more.



10/11
@opensaysmani
V4 on its way - can’t wait



11/11
@be_high
API? Word level timestamps ?




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196







1/11
@sunomusic
Make a song from any image or video with Suno Scenes. 🎥 : Disco Sculptures by ClaraCo



https://video.twimg.com/ext_tw_video/1852821409162207232/pu/vid/avc1/720x1402/WY2XWQmD_XMSALDE.mp4

2/11
@frownsOfficial




https://video.twimg.com/ext_tw_video/1852822140372062210/pu/vid/avc1/720x720/2ysRvDKZDqQ1MC4A.mp4

3/11
@sunomusic
🤔



4/11
@UriGil3
Youre too focused on mainstream music. Wish there was more creativity



5/11
@sunomusic
happy to take style suggestions 🙏



6/11
@luckycreative_o
And for Android users?



7/11
@sunomusic
Good news - Suno on Android is now open for pre-registration at Suno - AI Music - Apps on Google Play! Have you signed up?



8/11
@BromfieldDuane
when is this coming for android?



9/11
@sunomusic
Good news - Suno on Android is now open for pre-registration at Suno - AI Music - Apps on Google Play! Have you signed up?



10/11
@Mitchnavarro7
I have an android 🥲



11/11
@jessyseonoob




https://video.twimg.com/ext_tw_video/1852943354407100417/pu/vid/avc1/720x948/0kx3YGDa890zieDc.mp4


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,673
Reputation
8,661
Daps
162,664


Now Hear This: World’s Most Flexible Sound Machine Debuts​


Using text and audio as inputs, a new generative AI model from NVIDIA can create any combination of music, voices and sounds.

November 25, 2024 by Richard Kerris

Fugatto


A team of generative AI researchers created a Swiss Army knife for sound, one that allows users to control the audio output simply using text.

While some AI models can compose a song or modify a voice, none have the dexterity of the new offering.

Called Fugatto (short for Foundational Generative Audio Transformer Opus 1), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice — even let people produce sounds never heard before.

“This thing is wild,” said Ido Zmishlany, a multi-platinum producer and songwriter — and cofounder of One Take Audio, a member of the NVIDIA Inception program for cutting-edge startups. “Sound is my inspiration. It’s what moves me to create music. The idea that I can create entirely new sounds on the fly in the studio is incredible.”


A Sound Grasp of Audio

“We wanted to create a model that understands and generates sound like humans do,” said Rafael Valle, a manager of applied audio research at NVIDIA and one of the dozen-plus people behind Fugatto, as well as an orchestral conductor and composer.

Supporting numerous audio generation and transformation tasks, Fugatto is the first foundational generative AI model that showcases emergent properties — capabilities that arise from the interaction of its various trained abilities — and the ability to combine free-form instructions.

“Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis and transformation emerges from data and model scale,” Valle said.


A Sample Playlist of Use Cases


For example, music producers could use Fugatto to quickly prototype or edit an idea for a song, trying out different styles, voices and instruments. They could also add effects and enhance the overall audio quality of an existing track.

“The history of music is also a history of technology. The electric guitar gave the world rock and roll. When the sampler showed up, hip-hop was born,” said Zmishlany. “With AI, we’re writing the next chapter of music. We have a new instrument, a new tool for making music — and that’s super exciting.”

An ad agency could apply Fugatto to quickly target an existing campaign for multiple regions or situations, applying different accents and emotions to voiceovers.

Language learning tools could be personalized to use any voice a speaker chooses. Imagine an online course spoken in the voice of any family member or friend.

Video game developers could use the model to modify prerecorded assets in their title to fit the changing action as users play the game. Or, they could create new assets on the fly from text instructions and optional audio inputs.


Making a Joyful Noise

“One of the model’s capabilities we’re especially proud of is what we call the avocado chair,” said Valle, referring to a novel visual created by a generative AI model for imaging.

For instance, Fugatto can make a trumpet bark or a saxophone meow. Whatever users can describe, the model can create.

With fine-tuning and small amounts of singing data, researchers found it could handle tasks it was not pretrained on, like generating a high-quality singing voice from a text prompt.


Users Get Artistic Controls


Several capabilities add to Fugatto’s novelty.

During inference, the model uses a technique called ComposableART to combine instructions that were only seen separately during training. For example, a combination of prompts could ask for text spoken with a sad feeling in a French accent.

The model’s ability to interpolate between instructions gives users fine-grained control over text instructions, in this case the heaviness of the accent or the degree of sorrow.

“I wanted to let users combine attributes in a subjective or artistic way, selecting how much emphasis they put on each one,” said Rohan Badlani, an AI researcher who designed these aspects of the model.

“In my tests, the results were often surprising and made me feel a little bit like an artist, even though I’m a computer scientist,” said Badlani, who holds a master’s degree in computer science with a focus on AI from Stanford.

The model also generates sounds that change over time, a feature he calls temporal interpolation. It can, for instance, create the sounds of a rainstorm moving through an area with crescendos of thunder that slowly fade into the distance. It also gives users fine-grained control over how the soundscape evolves.

Plus, unlike most models, which can only recreate the training data they’ve been exposed to, Fugatto allows users to create soundscapes it’s never seen before, such as a thunderstorm easing into a dawn with the sound of birds singing.


A Look Under the Hood


Fugatto is a foundational generative transformer model that builds on the team’s prior work in areas such as speech modeling, audio vocoding and audio understanding.

The full version uses 2.5 billion parameters and was trained on a bank of NVIDIA DGX systems packing 32 NVIDIA H100 Tensor Core GPUs.

Fugatto was made by a diverse group of people from around the world, including India, Brazil, China, Jordan and South Korea. Their collaboration made Fugatto’s multi-accent and multilingual capabilities stronger.

One of the hardest parts of the effort was generating a blended dataset that contains millions of audio samples used for training. The team employed a multifaceted strategy to generate data and instructions that considerably expanded the range of tasks the model could perform, while achieving more accurate performance and enabling new tasks without requiring additional data.

They also scrutinized existing datasets to reveal new relationships among the data. The overall work spanned more than a year.

Valle remembers two moments when the team knew it was on to something. “The first time it generated music from a prompt, it blew our minds,” he said.

Later, the team demoed Fugatto responding to a prompt to create electronic music with dogs barking in time to the beat.

“When the group broke up with laughter, it really warmed my heart.”

Hear what Fugatto can do:

 
Top