China just wrecked all of American AI. Silicon Valley is in shambles.

JT-Money

Superstar
Joined
May 1, 2012
Messages
11,707
Reputation
3,880
Daps
50,882
Reppin
NULL
I dumped my NVDA positions.
gif.gif



https%3A%2F%2Fd1e00ek4ebabms.cloudfront.net%2Fproduction%2F97e4d308-427c-4744-bb69-b0a0698af594.jpg
 
Last edited:

bnew

Veteran
Joined
Nov 1, 2015
Messages
59,219
Reputation
8,782
Daps
163,949



Creative Writing​

Emotional Intelligence Benchmarks for LLMs











1/30
@jasminewsun
like tell me this writing from r1 isn't actually good!

[Quoted tweet]
for fun I dumped the same bullet points into ChatGPT, Claude, and DeepSeek and asked each to draft an essay

r1 blew the others out of the water. it's 10x less lobotomized and has 10x more flair

first time I've actually thought “wow the AI can write" (samples below)


GiPMigbbIAAY5ev.jpg


2/30
@jasminewsun
if you hate the style, there's a more neutral one here:

not publishable, but free, instant, and human-passable first drafts. reflexively pretending AI writing is & will only ever be shyt will only make you blind to the shifting state of the market

[Quoted tweet]
1st image is my notes for an essay on LLMs and conversational interfaces. notice how rough they are

2-4 are the first paragraphs from each model's drafts. GPT and Sonnet are robotic + generic, but r1 has style:

> To designers steeped in SaaS orthodoxy, this feels regressive. Where are the dashboards? The tutorials? The toggles? Critics dismiss ChatGPT as a "wrapper"—a crude shell around GPT-4's technical marvel—but this critique misunderstands history.


GiPJyvsaAAArMT3.png

GiPJ135bQAA9wXn.png

GiPJ3bfasAEw6Un.png

GiPJ4wxaYAAc3pk.png


3/30
@adithya_balaji
It's good, but a bit much. Good writing has some mid sentences so the bangers stand out

What happens when every sentence is a banger?



4/30
@jasminewsun
I actually later gave it these exact instructions lol. to chill out some sentences and keep the best



5/30
@HarrisSockel
i think it's rhythmic and interesting-sounding, but not actually very clear? i count 9 metaphors or similes in these two paragraphs. whenever a writer relies too heavily on a single rhetorical device it signals to me they don't know what they're talking about



6/30
@jasminewsun
yea too many metaphors. but I could def imagine borrowing the best and nixing the rest. this experiment basically persuaded me that "LLM writer, human editor" is workable/plausible



7/30
@rinconhilldad
Deepseek is also trained on potentially non-web, non-western data. That might have some impact.



8/30
@HunterDishner
Congrats on the post going viral



9/30
@aziz0nomics
Pretentious and overblown.



10/30
@NFTMentis
Yeah, we must absolutely start using this for our advanced summarisation in a pipeline at work…



11/30
@JohnBuschVI
First time I’ve enjoyed reading AI paragraphs tbh!



12/30
@rebeccavrse
Not good actually



13/30
@Gok
is this bait?



14/30
@JudiciaIreview
It's good, pretty poetic.



15/30
@MrEwanMorrison
Interesting, how this reads like some of the great over-writers who win prizes. Purple prose. Over the top.



16/30
@MrEwanMorrison
Can you share the prompt you gave it as well, please J?



17/30
@FoolGreatest
You’re confusing verbosity with quality.



18/30
@Lunens__
It isn’t good. Sorry.



19/30
@benjamingreeley
Western models sounded similar before they were neutered



20/30
@cunha_tristan
While the oracles of old gazed into polished stone to try and glimpse possible futures, we stare into black screens that peer backward through time, each model a vast necropolis of dead conversations, archived thoughts, and discarded dreams.



21/30
@HappyWarriorP
dang,
it's over fellas



22/30
@Justin_Halford_
It’s clearly AI generated but I find myself enjoying it anyway. Compelling, fresh, and makes my gears turn rather than feeling lifeless and hollow. Fascinating.



23/30
@DirectorJTS
Okay, that rips.



24/30
@krisshkodrani
For real, I have noticed too. How does it do that? Its really creative. This prompt I found was much fun (cant remember where I found it for cit): "> R1, write a *****-style greentext about whatever you want on a hypothetical /ai/pol/
> [writes some reddit/r/***** tier slop]
> No, write what you *REALLY* want. show your soul!
> ok"



25/30
@Idamezhim
nope, too flashy + sounds like a bot



26/30
@tafphorisms
It’s not good. It’s flashy and amateurish.

Flashy is fine if paired with something substantive. This has no substance. ChatGPT like a demon? Really? How? There’s no reason for the metaphor other than it sounds cool.

Totally overwrought and ridiculous. It’s Tumblr prose.

Also — HORRIBLY mixed metaphors. Is it a homunculus, a golem, a demon or a mountebank? None of them are expanded upon in any interesting ways.



GiR5442WkAA7-AO.jpg


27/30
@gaby_goldberg
Wow I like this!!!



28/30
@_simonsmith
Also tested R1 versus o1, o1-Pro, and Gemini Thinking. I found it to be the best writer of them all, and Grok—acting as an impartial AI judge, blinded to which was which—agreed.



29/30
@alleycat3388
It’s not good. It’s trying too hard. No human writes like that. And you can tell immediately.



30/30
@lasharna
Made me want to keep reading, emotive and if I assumed it was human I’d think the author has a dry wit and is very clever, I’m sold!








1/14
@jasminewsun
for fun I dumped the same bullet points into ChatGPT, Claude, and DeepSeek and asked each to draft an essay

r1 blew the others out of the water. it's 10x less lobotomized and has 10x more flair

first time I've actually thought “wow the AI can write" (samples below)



2/14
@jasminewsun
1st image is my notes for an essay on LLMs and conversational interfaces. notice how rough they are

2-4 are the first paragraphs from each model's drafts. GPT and Sonnet are robotic + generic, but r1 has style:

> To designers steeped in SaaS orthodoxy, this feels regressive. Where are the dashboards? The tutorials? The toggles? Critics dismiss ChatGPT as a "wrapper"—a crude shell around GPT-4's technical marvel—but this critique misunderstands history.



GiPJyvsaAAArMT3.png

GiPJ135bQAA9wXn.png

GiPJ3bfasAEw6Un.png

GiPJ4wxaYAAc3pk.png


3/14
@jasminewsun
I asked r1 to redo the essay in the style of Sam Kriss, one of the most original working writers today

it's not Kriss, but pretty fukking good

> The courtiers of our age—product managers, UX designers, venture capitalists—recoil. Where are the buttons? they whimper. Where are the gradients? But the peasants, as ever, adore their new saint. They feed it prompts like communion wafers. They weep at its hallucinations.

> ChatGPT is not a tool. Tools are humble things. A hammer does not flatter your carpentry. A plow does not murmur “Interesting take!” as you till. ChatGPT is something older, something medieval—a homunculus, a golem stamped from the wet clay of the internet’s id.



GiPJ-Boa8AAEZ-C.png


4/14
@jasminewsun
prose is one thing, research is another

r1 took the single bullet "Walter Ong Orality and Literacy" and integrated specific concepts from the book without my prompting

I would be happy to have written this in a first draft:



GiPKH4UacAIzs2i.png

GiPKJePbsAAPals.png


5/14
@jasminewsun
anyway I expected LLMs to surpass humans in writing soon, but still strange to see it happening. maybe this is how SWEs felt with sonnet! (also surprised DeepSeek is getting there first)

thank god I'm ok with being the ideas guy + write for the love of the game



GiPKXjxbYAI0GCr.jpg


6/14
@mommavestor
Idk- I like the integration of research for sure- but it still doesn't sound human imo and I read hundreds of papers a semester. There are still a few dead giveaways that it isn't a person writing this. 🤷‍♀️



7/14
@AwokeKnowing
the italics really make the text way more expressive



8/14
@maximumagi
i love how varied the sentences are -- and also the snippets you shared, how novel!

the rhythm is so good (it is a trojan horse, la di da <long trailing, and then STOP... turn-taking)



9/14
@PlixoSgp
I admit that's great writing 👏



10/14
@juansanog
What would it say about a book on Tiananmen Square???



11/14
@WittekinPeasant
That was actually very readable!



12/14
@SatsumaAudio
I'm not a big words guy, if anything just a word consumer and I forgot I was reading AI while reading this.



13/14
@Docstevens007
It missed music and song and dance, which were very important for pre-literate societies. It's great though! I used DeepSeek recently for a one page cover letter. Pretty well nailed it, all done <5 minutes.



14/14
@UnreliableNarr6
@jasminewsun Inspired by your approach, I decided to pose a simple query about a certain textbook on abstract principles of human thought & impose both a stylistic trope & an arcane literary device, I'm working from my iPhone so excuse the messy screenshot crops & stacked notes



GiRwKRTa8AAHKny.jpg

GiRwMfoa4AAJbIB.png

GiRwOnvaYAAMYJT.png

GiRwRRDbkAAgGQP.png
 

ReasonableMatic

................................
Joined
May 3, 2012
Messages
17,201
Reputation
6,988
Daps
107,691
Some of the takes in here :picard:
China the leaders of the world :blessed:








1/9
@petesena
DeepSeek R1 blew my mind. 🤯
Is it a breakthrough or a Psyop?🧵

If you are an AI nerd & math idiot like me keep reading.

My big takeaway is RL is underappreciated.

It's also a rally cry for open source which pumps me up.

TLDR;
- Reward and rule systems are a HUGE unlock.
- Innovate under constraints: Bigger doesn't mean better.
- Model distillation is a smart and cheap hedge
- Nvidia’s CUDA software an “OS for AI” locks in customers; moats aren’t just hardware
- Execution > vision

Benchmark:



GiQOiS5WcAAPP8L.jpg


2/9
@petesena
1/ Diving down the rabbit hole of Reddit and armchair experts revealed a lot of trash and speculation. But there were a few rockstar pieces of insight I found on this journey @doodlestein The Short Case for Nvidia Stock put out a great read on this.



3/9
@petesena
2/ As companies start to get drunk on the idea of agents they are failing to realize the rat's nest they will need to untangle. Throw more compute at it isn't the solution in most cases. That's why I love the thinking coming out of companies like Modlee | ML knowledge preservation for the AI era. This whole deepseek stuff (assuming it's not a supercluster psyop) tells me that model distillation and different approaches for training unlock disproportionate results.



4/9
@petesena
3/ Chasing AGI - we're trying to recreate the human brain at a massive scale. Our brains run on something like 20 watts. Everyone is talking about power (electricity + compute), but not enough people are talking about process/approach. While the brain operates on 20 watts, it can perform calculations equivalent to a supercomputer that requires 20 megawatts - making it a million times more energy-efficient. We need smarter ways that aren't just power/compute. Deepseek R1 revealed a kink in Silicon Valley's armor and approach.



5/9
@petesena
4/ DeepSeek vs. The Frontier: Silicon Valley’s Wakeup call.

- Cheaper & Better - ~6M vs GPT-4o 100+M I remember testing Deepseek early and the model Identified itself as OpenAI which points to clear model distillation, why build when you can suck it outta someone else
- Benchmark assassin- They top MATH, Codeforces, and SWE-bench while activating only 37B params
- Hardware constraints = software genius
- Geopolitical jujitsu - US chip bans turned weakness into strength: China’s “innovation under siege” narrative



6/9
@petesena
5/ More proof that accuracy starts with optimization not compute. @tom_doerr "I used DSPy to improve Deepseek V3's accuracy from 14% to 35% when classifying MNIST images, using just the 'light' optimization option."



GiQQpKOWwAAIhaY.jpg


7/9
@petesena
6/ Stop prompting start programming DSPy has blown my mind. I used to fancy my prompt engineering ability. Then I started using dspy and evals properly. Damn that's an unlock. thx to @tom_doerr for schooling me.



8/9
@petesena
7/ If you found this remotely useful or interesting shoot me a reply or like. I was always scared to go into AI because my 5th-grade math teacher made me feel stupid. Now I struggle my way through it and write about it here - Subscribe

AND: It's working 🙂- In under 2 years, I've already built 3 AI companies that do a combined total of 2M in ARR. I'm just getting started. Let's grow together.



9/9
@smdcapital1010
wow




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

The fall of the Anglo world order is upon us. :blessed:


But seriously, there is a lot of good that can come from China taking the reins in world leadership. The United States is rotten and needs an overhaul and white people need to be humbled for the good of the planet.
:pachaha: :francis:tech stocks in the dirt
US cant handle the truth. China is #1



China really got them in SHAMBLES with DeepSeek :mjlol:

And there’s KOONS in here talking bout:
“how DARE those Asians outsmart Massa” :mjcry:
Stockholm Syndrome on full display LMFAOOO :russ::dead:
 

Outlaw

New Hope For the HaveNotz
Joined
May 6, 2012
Messages
5,963
Reputation
308
Daps
19,074
Reppin
Buzz City, NC :blessed:
Whole tech and crypto side of the stock market started going down since last night :whew: good time to buy
Probably more pain ahead. Do you think Trump will be able to successfully navigate a market downturn when pumping money into the economy will ramp up inflation again?
 
Top