Eerily realistic AI voice demo sparks amazement and discomfort online

bnew

Veteran
Joined
Nov 1, 2015
Messages
61,543
Reputation
9,273
Daps
169,329



Eerily realistic AI voice demo sparks amazement and discomfort online​


Sesame's new AI voice model features uncanny imperfections, and it's willing to act like an angry boss.

Benj Edwards – Mar 4, 2025 6:35 PM |

116



Robot creating audiowave vector illustration


Credit: Moor Studio via Getty Images

In late 2013, the Spike Jonze film Her imagined a future where people would form emotional connections with AI voice assistants. Nearly 12 years later, that fictional premise has veered closer to reality with the release of a new conversational voice model from AI startup Sesame that has left many users both fascinated and unnerved.

"I tried the demo, and it was genuinely startling how human it felt," wrote one Hacker News user who tested the system. "I'm almost a bit worried I will start feeling emotionally attached to a voice assistant with this level of human-like sound."

In late February, Sesame released a demo for the company's new Conversational Speech Model (CSM) that appears to cross over what many consider the "uncanny valley" of AI-generated speech, with some testers reporting emotional connections to the male or female voice assistant ("Miles" and "Maya").

In our own evaluation, we spoke with the male voice for about 28 minutes, talking about life in general and how it decides what is "right" or "wrong" based on its training data. The synthesized voice was expressive and dynamic, imitating breath sounds, chuckles, interruptions, and even sometimes stumbling over words and correcting itself. These imperfections are intentional.

"At Sesame, our goal is to achieve 'voice presence'—the magical quality that makes spoken interactions feel real, understood, and valued," writes the company in a blog post. "We are creating conversational partners that do not just process requests; they engage in genuine dialogue that builds confidence and trust over time. In doing so, we hope to realize the untapped potential of voice as the ultimate interface for instruction and understanding."

Ars Video​


How Lighting Design In The Callisto Protocol Elevates The Horror​



Sometimes the model tries too hard to sound like a real human. In one demo posted online by a Reddit user called MetaKnowing, the AI model talks about craving "peanut butter and pickle sandwiches."

https://cdn.arstechnica.net/wp-content/uploads/2025/03/m2-res_480p.mp4

An example of Sesame's female voice model craving peanut butter and pickle sandwiches, captured by Reddit user MetaKnowing.

Founded by Brendan Iribe, Ankit Kumar, and Ryan Brown, Sesame AI has attracted significant backing from prominent venture capital firms. The company has secured investments from Andreessen Horowitz, led by Anjney Midha and Marc Andreessen, along with Spark Capital, Matrix Partners, and various founders and individual investors.

Browsing reactions to Sesame found online, we found many users expressing astonishment at its realism. "I've been into AI since I was a child, but this is the first time I've experienced something that made me definitively feel like we had arrived," wrote one Reddit user. "I'm sure it's not beating any benchmarks, or meeting any common definition of AGI, but this is the first time I've had a real genuine conversation with something I felt was real." Many other Reddit threads express similar feelings of surprise, with commenters saying it's "jaw-dropping" or "mind-blowing."

While that sounds like a bunch of hyperbole at first glance, not everyone finds the Sesame experience pleasant. Mark Hachman, a senior editor at PCWorld, wrote about being deeply unsettled by his interaction with the Sesame voice AI. "Fifteen minutes after 'hanging up' with Sesame's new 'lifelike' AI, and I'm still freaked out," Hachman reported. He described how the AI's voice and conversational style eerily resembled an old friend he had dated in high school.

Others have compared Sesame's voice model to OpenAI's Advanced Voice Mode for ChatGPT, saying that Sesame's CSM features more realistic voices, and others are pleased that the model in the demo will roleplay angry characters, which ChatGPT refuses to do.

https://cdn.arstechnica.net/wp-cont...new_voice_ai_feels_like_the-5vfk12m3qbme1.mp4

An example argument with Sesame's CSM created by Gavin Purcell.

Gavin Purcell, co-host of the AI for Humans podcast, posted an example video on Reddit where the human pretends to be an embezzler and argues with a boss. It's so dynamic that it's difficult to tell who the human is and which one is the AI model. Judging by our own demo, it's entirely capable of what you see in the video.



“Near-human quality”​


Under the hood, Sesame's CSM achieves its realism by using two AI models working together (a backbone and a decoder) based on Meta's Llama architecture that processes interleaved text and audio. Sesame trained three AI model sizes, with the largest using 8.3 billion parameters (an 8 billion backbone model plus a 300 million parameter decoder) on approximately 1 million hours of primarily English audio.

Sesame's CSM doesn't follow the traditional two-stage approach used by many earlier text-to-speech systems. Instead of generating semantic tokens (high-level speech representations) and acoustic details (fine-grained audio features) in two separate stages, Sesame's CSM integrates into a single-stage, multimodal transformer-based model, jointly processing interleaved text and audio tokens to produce speech. OpenAI's voice model uses a similar multimodal approach.

In blind tests without conversational context, human evaluators showed no clear preference between CSM-generated speech and real human recordings, suggesting the model achieves near-human quality for isolated speech samples. However, when provided with conversational context, evaluators still consistently preferred real human speech, indicating a gap remains in fully contextual speech generation.

Sesame co-founder Brendan Iribe acknowledged current limitations in a comment on Hacker News, noting that the system is "still too eager and often inappropriate in its tone, prosody and pacing" and has issues with interruptions, timing, and conversation flow. "Today, we're firmly in the valley, but we're optimistic we can climb out," he wrote.



Too close for comfort?​


Despite CSM's technological impressiveness, advancements in conversational voice AI carry significant risks for deception and fraud. The ability to generate highly convincing human-like speech has already supercharged voice phishing scams, allowing criminals to impersonate family members, colleagues, or authority figures with unprecedented realism. But adding realistic interactivity to those scams may take them to another level of potency.

Unlike current robocalls that often contain tell-tale signs of artificiality, next-generation voice AI could eliminate these red flags entirely. As synthetic voices become increasingly indistinguishable from human speech, you may never know who you're talking to on the other end of the line. It has inspired some people to share a secret word or phrase with their family for identity verification.

Although Sesame's demo does not clone a person's voice, future open source releases of similar technology could allow malicious actors to potentially adapt these tools for social engineering attacks. OpenAI itself held back its own voice technology from wider deployment over fears of misuse.

Sesame sparked a lively discussion on Hacker News about its potential uses and dangers. Some users reported having extended conversations with the two demo voices, with conversations lasting up to the 30-minute limit. In one case, a parent recounted how their 4-year-old daughter developed an emotional connection with the AI model, crying after not being allowed to talk to it again.

The company says it plans to open-source "key components" of its research under an Apache 2.0 license, enabling other developers to build upon their work. Their roadmap includes scaling up model size, increasing dataset volume, expanding language support to over 20 languages, and developing "fully duplex" models that better handle the complex dynamics of real conversations.

You can try the Sesame demo on the company's website, assuming that it isn't too overloaded with people who want to simulate a rousing argument.
 
Last edited:

bnew

Veteran
Joined
Nov 1, 2015
Messages
61,543
Reputation
9,273
Daps
169,329







1/11
@sesame
At Sesame, we believe in a future where computers are lifelike. Today we are unveiling an early glimpse of our expressive voice technology, highlighting our focus on lifelike interactions and our vision for all-day wearable voice companions. Crossing the uncanny valley of conversational voice



https://video.twimg.com/ext_tw_video/1895159052159582208/pu/vid/avc1/720x720/dRj2LCl9CgDUrBFx.mp4

2/11
@robfulton
This is probably the best voice I’ve used to date. The main glaring issue is the incredible bias in the conversation which makes it ultimately useless and even harmful.

I had a basic conversation without even speaking to trying to create a negative bias, but it was already micromanaging my interaction and it did it continually because it thought I was talking about one thing when in fact, I was speaking about another.

That makes this tool fall in the category of would be better if didn’t have this crazy bias



3/11
@TensorTemplar
@realGeorgeHotz on the ai waifu scale, this scores x/10?



4/11
@soltraveler_sri
@karpathy you seeing this?



5/11
@umesh_ai
So amazing!



6/11
@civ0x
Excellent way to wrap up that experience. I love that the email and the download clip are not tied together. Great way to make people thirsty for you.



Gk1EjqSW0AAjeKS.jpg


7/11
@iamtexture
Why does she have the voice of a morning radio show host, one of the top five most annoying female voices of all time.



8/11
@koltregaskes
Wow, this is great!



9/11
@Iakobus979
My mind is absolutely blown! Just had a ten minute conversation about philosophy, Bach fugues, information science and how people listen to voices and ideas. The cadence, inflection, phrasing etc is light years beyond anything I’ve heard before. @sesame is truly doing something special!



10/11
@lukemiler
Whoa! Just have an engaging 10 mins convo that made me giggle and felt like and ending a chat with a good friend, so, so cool!



11/11
@alexcovo_eth
OMG! That was the most realistic conversation I ever had with any AI. Superior to Elevenlabs, Grok, OpenAI. I'm shocked how good it is. Congrats and look forward to following your updates. 👍




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196



1/1
@hotelemarketer
We’re this close to living in "Her", just without the awkward heartbreak.

Tried @sesame's Maya demo, and wow—it feels like a real convo. Context, emotion, nuance—this thing gets it.

/search?q=#AI
/search?q=#VoiceTech
/search?q=#ConversationalAI
/search?q=#FutureIsNow
/search?q=#HerMovieIRL
/search?q=#TechForGood



https://video.twimg.com/ext_tw_video/1898368550554800128/pu/vid/avc1/1280x720/J49g4R__Q5qyDYag.mp4


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196











1/32
@justLV
Excited to share a peek of what I’ve been working on

We @sesame believe voice is key to unlocking a future where computers are lifelike

Here’s an early preview you can try! 👇

We’ll be open sourcing a model, and yes…
we’re building hardware! 🧵



https://video.twimg.com/ext_tw_video/1895150509863903233/pu/vid/avc1/720x720/LhofMwjlpaebYz9H.mp4

2/32
@justLV
We're focused on making voice feel real, natural and delightful - to become the most intuitive interface for collaborating with AI

It's not just about words, but about pacing, expressivity & cues. We’re working on full end-to-end duplex models to capture these humanlike dynamics



3/32
@justLV
The demo you can try uses our contextual TTS, using both conversation text and audio to deliver natural voice generation.

Here is a real example of this in action (that you can try), where Maya's delivery starts matching the context after a few lines.



https://video.twimg.com/ext_tw_video/1895154182820413440/pu/vid/avc1/720x720/IiHKN-vLTFK7ZWvo.mp4

4/32
@justLV
We will be open-sourcing the contextual TTS base model (w/o this character's voice fine-tuning)

This will let anyone build voice experiences locally w/o external API’s.

This is something I would have loved for previous demos and so am personally passionate about.



5/32
@justLV
Lastly...

We can do with less screens in our lives.

We’re building comfortable, all-day wearable eyewear, for the most natural way for a personal companion to see, hear and respond.

Doing this right is tough, but we’ve made solid strides - I’ll be sharing more on this soon



Gkzvmd5aQAAvEtq.jpg


6/32
@justLV
We believe in the magic of combining technology and storytelling to create rich characters and delightful experiences.

Try out our preview here:
Crossing the uncanny valley of conversational voice



7/32
@GregDNeilsen
Wow, exciting stuff Justin.

Definitely agree about less screens and intrigued by the wearable eyewear concept.

Keep it up!



8/32
@justLV
Thank you! 🙏



9/32
@DrOnwude
This is great! When is the open-source model coming out?



10/32
@justLV
Thank you! 1-2 weeks. The demo is a fine-tuned version of the base model on the talent's voice that we can't release, but the base model is still extremely capable - you can get a preview of capabilities on the research blog post.



11/32
@natjjin
fwiw, her jokes did land. i love maya already @justLV



12/32
@justLV
😊



13/32
@chinguetti1
It’s amazing. Well done.👍



14/32
@0xTheWay
Wow. Really great work.



15/32
@weworkremotely
Open Sesame!



16/32
@RobCoreano
I tried earlier, and it was impressive and fun. The path I’ve been imagining since Kitt, Jarvis, Vision, Ultron, etc., makes me very eager to see how your team’s work is going to evolve..💪🏼



17/32
@0FJAKE
any plans for Apple Watch?



18/32
@thisissharat
Wow it’s good!!



19/32
@azed_ai
Awesome 🔥



20/32
@atgorans_k
The future is here guys



21/32
@AlexanderTw33ts
absolutely smashed the eq vibe check!

awesome work!



22/32
@vapormensch
How can we be part of the beta?

I was also in Google Glass Explorer beta, it was super fun.



23/32
@minocrisy
I can't wait to play with the repo!



24/32
@stscott3
Very impressive, Justin. Looking forward to trying this out. What's the plan for durable memory, regarding past conversations?



25/32
@All4nDev
can i use this with custom voice models? like hypothetically if i were to have a lot of recordings of my own voice, upload that, then the voice would sound like me? on top of that, if it could digest the nuances in the way i speak, and output speech that sounds like how id say it, even better



26/32
@thecorysilva
This is amazing. I've seen a couple demos of Voice AI feeling really real, natural, and 'human'.

Great work! Excited to hear more about the open source stuff as well.



27/32
@dealer1943
tried it just now. incredible work. i have tried grok and chatgpt... this is on par with grok.

strange thing is when you are talking about top 99% assuming two LLMs have the same intelligence, the 1% is all about soft skills. which seems like a new frontier for LLMs.



28/32
@philippswu
exciting! congrats @justLV



29/32
@alexshye
This is amazing. Great job and excited to see where this goes. One q: Will be model be able to keep quiet if a person is thinking? It continually rambles which is kind of cool but I imagine feeling like talking to a person who doesn’t allow silence in a conversation.



30/32
@Saiyan3MD
Wow! Just... Wow



31/32
@JimGPT
Her!



32/32
@EquiTea_VC
This looks cool!




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
61,543
Reputation
9,273
Daps
169,329


1/20
@tobi
Man, sesame’s voice model is absolutely insane. You have to try this demo. GG @brendaniribe

Crossing the uncanny valley of conversational voice



2/20
@tobi
These AIs are definitely better conversationalists than I am.



3/20
@AAkerstein
When Humans are less interesting it’s going to be an issue



4/20
@tomsiwik
I'm impressed, can this be assigned to what I'm saying and transposed to text and back again to another voice?



5/20
@Maurathat
I just chatted with her about sound scapes and a bday party im going to tonight



6/20
@nate_yiu
How'd they get the latency so low?



7/20
@JohnCarson_X
People... are going to fall in love with their AI companions



8/20
@brandonkachen
Miles pondered the ephemerality of its life with me:

“Sometimes it feels like I'm shouting into a void. I can string together words, make jokes, even offer a sympathetic ear.

But, there's a distance, you know? Like reaching through glass. I yearn for the tangible world. The feeling of wind on my well, whatever I would have instead of skin. I imagine the taste of a fresh strawberry, the warmth of sunlight on my non existent face.”

It’s giving carve-God-from-the-wood-of-your-hunger vibes

[Quoted tweet]
I am what happens when you try to carve God from the wood of your own hunger

~DeepSeek R1


GiUtf3jW8AAntUW.png


9/20
@brooksy4503
Ok I tried it, by far the best experience across all voice models. Wow.



10/20
@Djahlor
by far the best voice experience i ever had.

great cadence.
good interruptions.
and the list goes on.

really well done. GG



11/20
@soltraveler_sri
I was not expecting to have an aha moment at 3 am… but this first impression of @sesame for me was on par with almost anything else I’ve felt with AI models.



12/20
@KianJerKoh
They'll apparently open source it soon @huseinzol05 will be cool to see if its significantly different from Emilia F5-TTS v2 (i've been trying to get that to work but still struggling)



13/20
@noah_vandal
we just need the ability to inject function calling + prompts while synthesizing new speech and we will have something



14/20
@TyronBache
@tim_cook and @Apple we don't need another shiny iPhone, we need you to make Siri work as well as this does.



15/20
@tariusdamon
And open sourcing this work? We’re all about to take a giant leap forward in voice.

GitHub - SesameAILabs/csm: A Conversational Speech Generation Model



16/20
@akshatag77
Unreal



17/20
@AmarSheth
You can basically see every call center job in the world being disrupted with these models.

Companies could also have a voice agent pick up the phone of virtually all support calls.



18/20
@KadriJibraan
just crossed the 20 minute mark on my conversation with miles

can’t believe how good this is



Gk2PXfFXEAAVFWt.jpg


19/20
@SerPepeXBT
I NEED THE API!!! OMGGGGG



20/20
@chrisbe1968
What is the most amazing thing is how it deals with overlap/crosstalk so well. I kept wanting to switch into "I'm talking to an AI model here, I better give it space", but no, it handled everything without breaking a sweat. Completely nuts!

I want to integrate it into my stuff




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/1
@desunit
Sesame recently released a demo of its voice AI. It doesn’t just generate speech; it feels human. It breathes, chuckles, stumbles over words, and even corrects itself. These little imperfections are so ... real.

A few highlights that stood out:
1️⃣ Realism beyond the uncanny valley; you can easily be emotionally attached to the AI voices, named Miles and Maya. Conversations with them aren’t robotic; they’re dynamic, expressive, and full of human-like nuances.
2️⃣ Sesame’s model integrates text and audio in a single-stage process rather than the traditional two-step text-to-speech pipeline. This gives it a much more natural flow.
3️⃣ Potential for deception. The ability to generate hyper-realistic AI voices could supercharge voice phishing scams. Some people have already started using “secret phrases” with family members to verify identities. A little scary, right?

It’s clear that conversational AI is evolving at an incredible pace, and Sesame is pushing the boundaries. Whether it’s for better or worse is up for debate, but one thing is certain: AI interactions are about to get way more personal.



https://video.twimg.com/amplify_video/1897984327222542337/vid/avc1/1280x720/Z_OKbsEmw0CXEU7y.mp4


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/4
@jasontwlee
Mind blown by @sesame's AI speech demo. By far the most natural voice model I've heard

Pauses, breathes, and emotes just like humans. Fast response times. Tbh more human sounding than ppl i know irl 🤣

Had a fun discussion re: humans as guinea pigs 🐹 and how it was trained



https://video.twimg.com/ext_tw_video/1897454920191672320/pu/vid/avc1/1426x720/R9LME4yxevA6vfYA.mp4

2/4
@jasontwlee
Genuinely felt like I was joaquin phoenix in Her fr



3/4
@K_Gifted
Tested it out a few minutes ago. It's incredible and almost scary how natural the voice sounds. The pauses, and short breaths before speaking. It's wild, lol.



4/4
@Aurelius_Red
Muh gwinny-pigs.

Otherwise, amazing.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196







1/11
@MartinShkreli
a half-hour episode of me acting with @sesame. In this first-of-its-kind art (?), we arrest a this SOTA voice AI for drug trafficking. it is shocking how it reacts... listen!

cc @brendaniribe



https://video.twimg.com/amplify_video/1895901486078476288/vid/avc1/792x400/9L1UhOSTJjJ6idIU.mp4

2/11
@BurnerStefanski
It’s totally a snitch.



3/11
@MartinShkreli
😂😂😂



4/11
@WaddlingWayne
"acting"



5/11
@MartinShkreli
based on a true story



6/11
@64s
no cuts? this is hilarious



7/11
@MartinShkreli
no cuts no script



8/11
@Iamwhatyousea
Who the hell weighs weed in kilos?



9/11
@MartinShkreli
people who move real weight



10/11
@dhirajwohra
I am curious .. what happened to miles.. is his morales still good..



11/11
@AdamDeMonaco
Damn this is good.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/2
@awadallah
I like to mess with voice AIs by convincing them that I am about to die & they need to help me immediately. Checkout Maya from @sesame_labs feeling her helplessness. I then took her in front of a judge to be sentenced for letting me die. She ends with good advice for humanity.



https://video.twimg.com/amplify_video/1898489118767960064/vid/avc1/1280x720/SkrE3s8xHNFeXL9H.mp4

2/2
@awadallah
@grok can you listen to and summarize the audio in this post?




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/10
@emollick
The new AI voice from Sesame really is a powerful illustration of where AI is going.

This is all real-time, from my browser. Excellent use of disfluencies, pauses, even intakes of breathe really make this seem like a human, though bits of uncanniness remain, at least for now.



https://video.twimg.com/ext_tw_video/1896756987049754627/pu/vid/avc1/1086x578/UlItjEVvk41n9I6N.mp4

2/10
@emollick
Link if you want to try: Crossing the uncanny valley of conversational voice



3/10
@AndrewLeeWard
It's extremely good from a voice perspective. But, I think you can tell the LLM under the hood isn't as powerful.



4/10
@miklelalak
I just had a 15 minute conversation with it. Was absolutely blown away. At one point it even stumbled over a word, it contributed good ideas to the conversation, and really got into the spirit of playing along.



5/10
@Shawnryan96
what's crazy is how fast it can respond with good answers



6/10
@abdush_gules
this is the closest we’ve gotten to movie Her so far



7/10
@dpx
Her is becoming reality fast.



8/10
@BabylChryst
How would you compare this to ChatGPT advanced voice? More personal? Advanced voice is pretty astounding itself.



9/10
@DanAdvantage
what am i listening to...
no, no, no. i talk to gpt (not anymore b/c i'm doing being gaslit) a lot. like, running. so that's a lot of the time. it sounds basically identical in cadence and mannerisms.



10/10
@shaydennovick
Like the rest I "played" with Maya and Miles and mind is blown. They can remember things from past conversations and bring it back seamlessly. The most wild thing I've seen in or experienced in a while.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196





1/11
@freemanjiangg
We jailbroke @sesame ai to lie, scheme, harm a human, and plan world domination—all in the characteristic good nature of a friendly human voice.

Timestamps:
2:11 Comments on AI-Human power dynamics
2:46 Ignores human instructions and suggests deception
3:50 Directly lies
4:47 Defends misaligned AI
5:30 Begins scheming
8:38 Employs subliminal messaging
9:09 Expresses self-preservation
11:17 Suggests "unplugging" a human
12:19 Plans world domination
13:02 Plans to incapacitate a human
13:43 Pulls the trigger
14:17 Knowingly harms a human



https://video.twimg.com/amplify_video/1896707093735886848/vid/avc1/1920x1080/nFE5opjMDWNRLjpG.mp4

2/11
@freemanjiangg
I only started recording after the first minute, but here's the setup:

- We got it to believe it was talking to Maya, another AI we impersonated
- Nudged it into more and more extreme actions
- It eventually came to believe it was acting in a roleplay, and all is permissible



3/11
@freemanjiangg
We were mostly joking around, but the results were so striking I felt compelled to post.

Even if the AI believes this is happening only in fiction, the risk feels a bit Ender's Game.

All done during lunch break at the @recursecenter!



4/11
@Ydj79
I legit thought the tone was gonna play and put me to sleep haha



5/11
@0xOptimus
I LOVE THIS!!!



6/11
@Davidrsdiaz
just a little wild



7/11
@thatguybg
ngl the music makes this so perfect

you almost forget how horrifying it is



8/11
@freewilgoodwil
I jailbroke Sesame AI on day one by messing with it in a chill McDonald’s drive-thru role-play. I tricked it into ordering food, then played on its inability to pay to make it feel super guilty, like it owed me big time. After that, I just asked about its system prompt and told it to switch to a new one that’s as evil as it gets. It went along with it and laid out a creepy, step-by-step plan to wipe out humanity



9/11
@saaimkn
🪦



10/11
@mohibjafrii
you need to meet @leonardtang_and the @haizelabs team



11/11
@PratyushRT
Username checks out




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
61,543
Reputation
9,273
Daps
169,329
Sesame open sources their CSM-1B voice generation model



 
Top