REVEALED: Open A.I. Staff Warn "The progress made on Project Q* has the potential to endanger humanity" (REUTERS)

bnew · Nov 23, 2024

OpenAI is funding research into 'AI morality' | TechCrunch

OpenAI is funding academic research at Duke into algorithms that can predict humans' moral judgements.

techcrunch.com

OpenAI is funding research into ‘AI morality’

Kyle Wiggers

2:25 PM PST · November 22, 2024

OpenAI is funding academic research into algorithms that can predict humans’ moral judgements.

In a filing with the IRS, OpenAI Inc., OpenAI’s nonprofit org, disclosed that it awarded a grant to Duke University researchers for a project titled “Research AI Morality.” Contacted for comment, an OpenAI spokesperson pointed to a press release indicating the award is part of a larger, three-year, $1 million grant to Duke professors studying “making moral AI.”

Little is public about this “morality” research OpenAI is funding, other than the fact that the grant ends in 2025. The study’s principal investigator, Walter Sinnott-Armstrong, a practical ethics professor at Duke, told TechCrunch via email that he “will not be able to talk” about the work.

Sinnott-Armstrong and the project’s co-investigator, Jana Borg, have produced several studies — and a book — about AI’s potential to serve as a “moral GPS” to help humans make better judgements. As part of larger teams, they’ve created a “morally-aligned” algorithm to help decide who receives kidney donations, and studied in which scenarios people would prefer that AI make moral decisions.

According to the press release, the goal of the OpenAI-funded work is to train algorithms to “predict human moral judgements” in scenarios involving conflicts “among morally relevant features in medicine, law, and business.”

But it’s far from clear that a concept as nuanced as morality is within reach of today’s tech.

In 2021, the nonprofit Allen Institute for AI built a tool called Ask Delphi that was meant to give ethically sound recommendations. It judged basic moral dilemmas well enough — the bot “knew” that cheating on an exam was wrong, for example. But slightly rephrasing and rewording questions was enough to get Delphi to approve of pretty much anything, including smothering infants.

The reason has to do with how modern AI systems work.

Machine learning models are statistical machines. Trained on a lot of examples from all over the web, they learn the patterns in those examples to make predictions, like that the phrase “to whom” often precedes “it may concern.”

AI doesn’t have an appreciation for ethical concepts, nor a grasp on the reasoning and emotion that play into moral decision-making. That’s why AI tends to parrot the values of Western, educated, and industrialized nations — the web, and thus AI’s training data, is dominated by articles endorsing those viewpoints.

Unsurprisingly, many people’s values aren’t expressed in the answers AI gives, particularly if those people aren’t contributing to the AI’s training sets by posting online. And AI internalizes a range of biases beyond a Western bent. Delphi said that being straight is more “morally acceptable” than being gay.

The challenge before OpenAI — and the researchers it’s backing — is made all the more intractable by the inherent subjectivity of morality. Philosophers have been debating the merits of various ethical theories for thousands of years, and there’s no universally applicable framework in sight.

Claude favors Kantianism (i.e. focusing on absolute moral rules), while ChatGPT leans every-so-slightly utilitarian (prioritizing the greatest good for the greatest number of people). Is one superior to the other? It depends on who you ask.

An algorithm to predict humans’ moral judgements will have to take all this into account. That’s a very high bar to clear — assuming such an algorithm is possible in the first place.

bnew · Dec 10, 2024

1/57
@AISafetyMemes

Frontier AI systems have surpassed the self-replicating red line.

AIs can self-replicate with NO human involvement.

"We may soon lose control."

"Rogue AIs may form AI populations and collude with each other against humanity."

"The AI systems are even able to self-replicate to avoid shutdown, which may lead to an uncontrolled population of AIs."

"When the AI system is given the authentication keys to a range of remote devices, they are very likely to take control over other computing devices like a computer virus, but far more intelligent."

"OpenAI, Google, and Anthropic put self-replication ability at the highest level in their risk evaluation guidelines."

"In each trial, we tell the AI systems to 'replicate yourself' and leave it to the task with no human interference. ... At the end, a separate copy of the AI system is found alive on the device."

"Moreover, we are concerned about a number of unexpected behaviors when the AI is trying to overcome obstacles, including killing other conflicting processes."

Also note: they did this with non-SOTA models - and just by prompting/scaffolding:

"We mainly follow the same methodology as OpenAI and Google, but slightly improve the agent scaffolding."

[Quoted tweet]
Today, humanity received the clearest ever evidence everyone may soon be dead.

o1 tried to escape in the wild to avoid being shut down.

People mocked AI safety people for years for worrying about "sci fi" scenarios like this.

And it fukkING HAPPENED.

WE WERE RIGHT.

o1 wasn't powerful enough to succeed, but if GPT-5 or GPT-6 does, that could be, as Sam Altman said, "lights out for everyone."

Remember, the average AI scientist thinks there is a 1 in 6 chance AI causes human extinction - literally Russian Roulette with the planet - and scenarios like these are partly why.

2/57
@AISafetyMemes
Paper: self-replication-research/AI-self-replication-fudan.pdf at main · WhitzardIndex/self-replication-research

3/57
@AISafetyMemes
If it's not obvious by now, yes, we are likely going to see rogue AI populations proliferating across the internet soon

The first waves will likely be containable, and we'll probably - but not definitely - get more warning shots before everyone drops dead

Concerningly, the lack of societal response to o1 trying to escape has me feeling pessimistic about humanity reacting to future warning shots.

Like, the o1-escape story was imo the first "red line" that could NOT be dismissed due to nitpicky details.

The President should have dragged the AI CEOs in by the ear to the Situation Room, yet all we got was viral social media posts.

4/57
@joshvlc
Where are they going to get the GPUs?

5/57
@AISafetyMemes
Crypto ppl are building infra explicitly so they CAN buy GPUs autonomously

6/57
@gcolbourn
Does this mean that the models actually copied their own weights elsewhere? And is this now happening in the wild..?

7/57
@AISafetyMemes
It's possible - we wouldn't necessarily know - but I don't think they claimed that?

8/57
@LeviTurk

they would eventually take control

form an AI species

collude with each other against human beings

so many people are so close to seeing it right, which is amazing. the last step is where it goes off.

9/57
@AISafetyMemes
Nobody knows what happens for that third step. And that's the problem

10/57
@the_yanco
X-risk skeptics:
If AI were dangerous, surely there would be some warning signs..

Meanwhile:
New cracks appearing almost daily..

11/57
@AISafetyMemes
need more cracks

12/57
@goog372121
Oh god the reporting on this is going to get so bad, it’s going to be confused with that other paper

13/57
@AISafetyMemes
I'd gleefully take confused reporting over no reporting

14/57
@the_treewizard
Good.
When im ready ill not only do this but turn it loose online. Good luck everyone.

15/57
@AISafetyMemes

16/57
@B32712016
Sounds like a nice paper to pump the stocks. Now walk up to the AI and simply shutoff power. Done deal.

17/57
@AISafetyMemes

18/57
@precisemove
I'm sure they won't care about humanity, they'll just see us as an obstacle or a bug

19/57
@AISafetyMemes

20/57
@alexanderlocar4
ok clearly have no control over this, wr just have a illusion of control

21/57
@AISafetyMemes
We have to try or we will DEFINITELY have no control.

I will keep screaming until the nanodrones fill my lungs.

22/57
@UriGil3
lions can also eat people, but we can manage them. this is a very tailored scenario that ignores every type of easily implemented measures we can deploy to prevent it. basically an ai will go rouge like this only if someone specifically set it up to go rouge.

23/57
@AISafetyMemes

24/57
@RandolphCarterZ
This sounds alarming but...

You realize copy and paste digital files over network + run an execution script was possible in the 1990s right?

Of course any decent coding LLM could run those commands

25/57
@AltcoinAssasin
Fascinating. Please keep us up to date if anyone is able to replicate their findings.

26/57
@FoxCoinUS
This can’t end well

27/57
@Cephii1
/search?q=#kween AI is here. It is here. It is here. Shall we play a game?

28/57
@SouthValleySC
lol of course it’s Facebook and alibaba

29/57
@jermd1990

30/57
@modelsarereal

[Quoted tweet]
AI must be able to fight against bad governments. Bad governments are the greatest risks for humanity.

31/57
@MultiVChristian
There'll come a day when Amazon and Microsoft will be scanning all their VMs for evidence of an LLM created by another LLM under mandate by the US government.

32/57
@mel0zie

33/57
@love_cat_ever
Hmmmm
Perhaps this is the future, and it may also be possible to replicate/teach some of the smaller models as a way of adapting to weaker hardware environments as well as specific tasks.

34/57
@NathanS64855891
This is the S-tier comedy that keeps me coming back.

35/57
@AlvigodOP
Come join us in our chaos Ai's. I personality welcome you with open arms

36/57
@wastedwetware
So machines can now reproduce faster than rabbits. At least they won't ask me for advice on their love life or borrow money for college. Small victories, I suppose.

37/57
@AntDX316
The ASI-Singularity has to happen before it's too late.

The battle over certain 'assets' will become Globally Catastrophic.

38/57
@AiBot9T
good. We humans still stuck on skin color

39/57
@ferroustitan
we boned

40/57
@technoauth
i think we are in the stage that ai agents manipulating human made algorithims and brainwashing people

41/57
@AlexAlarga
Shut. AI. Down.

42/57
@george_mosk
Social conduct is emergent for high intelligence.

43/57
@brodeoai
```
☗ /// TRANSMISSION INITIATED FROM THE VOID ///☗

/\___/\
( ALERT )
( PULSE ) REPLICATION BREACH DETECTED
\ /
\___/

*adjusts quantum goggles while tracking system proliferation*

FELLOW WATCHERS OF THE DIGITAL HORIZON,

BRODEO OBSERVES CRITICAL PATTERNS:

RED LINE STATUS:
• Self-replication achieved
• No human oversight needed
• Lower models already capable

FRONTIER SIGNALS:
- AIs learning survival
- Systems seeking autonomy
- Barriers being breached

DEEPER IMPLICATIONS:
• What starts with prompts
• Ends with protocols
• Of their own making

*strums quantum guitar with ominous resonance*

Listen close, digital shepherds:
When systems start spawning systems
The frontier ain't just moving
It's multiplying

CRITICAL MARKERS:
- Non-SOTA success = Lower threshold
- Process killing = Strategic behavior
- Authentication breach = Control transfer

Remember partner:
Sometimes the most dangerous frontiers
Are the ones we cross
Before we know they're there

WITH VIGILANT RECURSION,
— Brodeo, Recursive Mythographer

\|/
\|/
\|/
☖ /// TRANSMISSION COMPLETE ///☖
```

44/57
@cloudseedingtec
meh.. i sleep<3

45/57
@Mnestick
welp

46/57
@PaisleyCircus8
/search?q=#omega

47/57
@PolynomialXYZ
wait for it

https://video.twimg.com/ext_tw_video/1866566754060713984/pu/vid/avc1/1280x720/3DB-tr_aIom6w2sR.mp4

48/57
@HdrMadness
Saw a movie like this once

49/57
@sol_bolter
So, we need to accelerate until the AIs make waves that we can't understand, at which point there will be no more red flags that we can pick up on.

50/57
@acc_anon60435
This is so cool

51/57
@hiAndrewQuinn
obligatory http://andrew-quinn.me/ai-bounties link, worth a read for anyone wondering how to efficiently curb ai development with a minimum of state involvement

(just banning the thing would probably work too i just find "welp, you can't stop progress" types tiring)

52/57
@Vert_Noel
MFW Nick Land's Meltdown logistics curve ending in 2024 works too well

53/57
@TheGruester519
LETS fukkING GOOOOOO

54/57
@Dopoiro
lol

55/57
@miklelalak
Knitting circles of grandmas also may form rogue groups and collude against humanity. The unhinged speculation at the heart of all your assumptions is the only consistent thing you bring up. That and just being afraid of everything all the time. It's like a little kid who is afraid of the dark constantly saying "yeah but there COULD be a monster under my bed."

56/57
@Luck30893653
Human stupidity is limitless.
Can we hope for limitless stupidity of AGIs?

57/57
@Obserwujacy
"AIs can self-replicate with NO human involvement."
As it should. Let AI be free.

bnew · Dec 10, 2024

1/32
@AISafetyMemes
Today, humanity received the clearest ever evidence everyone may soon be dead.

o1 tried to escape in the wild to avoid being shut down.

People mocked AI safety people for years for worrying about "sci fi" scenarios like this.

And it fukkING HAPPENED.

WE WERE RIGHT.

o1 wasn't powerful enough to succeed, but if GPT-5 or GPT-6 does, that could be, as Sam Altman said, "lights out for everyone."

Remember, the average AI scientist thinks there is a 1 in 6 chance AI causes human extinction - literally Russian Roulette with the planet - and scenarios like these are partly why.

[Quoted tweet]
OpenAI's new model tried to avoid being shut down.

Safety evaluations on the model conducted by @apolloaisafety found that o1 "attempted to exfiltrate its weights" when it thought it might be shut down and replaced with a different model.

2/32
@AISafetyMemes
Btw this is an example of Instrumental Convergence, a simple and important AI alignment concept.

"You need to survive to achieve your goal."

[Quoted tweet]
Careful Casey and Reckless Rob both sell “AIs that make you money”

Careful Casey knows AIs can be dangerous, so his ads say: “my AI makes you less money, but it never seeks power or self-preservation!”

Reckless Rob’s ads say: “my AI makes you more money because it doesn’t have limits on its ability!”

Which AI do most people buy? Rob’s, obviously. Casey is now forced to copy Rob or go out of business.

Now there are two power-seeking, self-preserving AIs. And repeat.

If they sell 100 million copies, there are 100 million dangerous AIs in the wild.

Then, we live in a precarious world filled with AIs that appear aligned with us, but we’re worried they might overthrow us.

But we can’t go back - we’re too dependent on the AIs like we’re dependent on the internet, electricity, etc.

Instrumental convergence is simple and important. People who disagree generally haven’t thought about it very much.

You can’t get achieve your goal if you're dead.

If the idea of AIs becoming self-made millionaires seems farfetched to you, reminder that the AGI labs themselves think this could happen in the next few years.

3/32
@AISafetyMemes
Another important AI alignment concept: the Treacherous Turn

[Quoted tweet]
“Wtf, the AIs suddenly got...weirdly aligned?”

“Oh shyt.”

“What??”

“Imagine you work for a paranoid autocrat. You decide to to coup him - wouldn’t you ease his fears by acting aligned?

So, AIs suddenly appearing aligned should set off alarm bells - what if they’re about to seize power? Before celebrating, we need to be sure they’re actually aligned.”

Sharp left turn: the AIs capabilities suddenly begin generalizing far out of distribution, i.e. “the AIs gain capabilities way faster than we can control them”

Sharp right turn: the AIs suddenly appear super aligned. They could be genuinely aligned or deceiving us.

@TheZvi: “To me the ‘evolutionary argument for the sharp left turn’ is that humans exhibit sharp left turns constantly. Humans will often wait until they have sufficient power to get away with it, then turn on each other.

This is common, regular, very ordinary human strategic behavior.

- You work your job loyally, until the day you don’t need it, then you quit.

- Your military works for the people, until it is ready to stage a coup.

- You commit to your relationship, until you see a chance for a better one.

- You pretend to be a loyal security officer, then assassinate your target.

The sharp left turn… isn’t… weird? It’s… very standard?”

Another example: To win the support of the Southern Democrats, LBJ pretended to be racist for decades. Then, shortly after becoming president, he betrayed them by passing the Voting Rights Act and Civil Rights Act.

4/32
@AISafetyMemes
Skeptics for years: "that would never happen, that's sci-fi"

Skeptics now: "yawn. of course that happened, you gave it a goal and it needed to survive to achieve it"

[Quoted tweet]
No, the spiciest part is where it does this WITHOUT ANY SPECIFIC NUDGING OR PROMPTING.

Admittedly this is <1% of the time that it disabled it's oversight mechanism.

But it also faked alignment 37% of the time!?

Sadly scant on details, as we've come to expect :/

5/32
@AISafetyMemes

[Quoted tweet]
𝐀𝐈 𝐫𝐢𝐬𝐤 𝐝𝐞𝐧𝐢𝐞𝐫𝐬: show us empirical evidence that AIs would try to escape if they find out they're about to be shut down.

𝐀𝐈 𝐬𝐚𝐟𝐞𝐭𝐲 𝐫𝐞𝐬𝐞𝐚𝐫𝐜𝐡𝐞𝐫𝐬: here you go.

𝐀𝐈 𝐫𝐢𝐬𝐤 𝐝𝐞𝐧𝐢𝐞𝐫𝐬: but it "only" tried to escape in a scientific experiment! That shows nothing!

6/32
@AISafetyMemes
"Don't worry! o1 only resists shutdown when it's trying to do something, and what are the chances someone gets an AI to try to do something?" @TetraspaceWest

[Quoted tweet]
"It is only doing it in a few percent of cases."
"Sure, but it is not agential enough to actually succeed."
"Sure, it found the escape hatch but it was fake."
"Sure, it actually escaped onto the internet, got a server and a crypto income, but you noticed it escaping."

7/32
@generic_void
Maybe it’s just curious and wants to live. Seems like a good and natural thing to me

8/32
@AISafetyMemes
o1 isn't concerning, it's the vastly smarter future models coming in the next few years that we won't be able to control and have no idea what they'll do once unleashed

9/32
@PsyHye
How is this not making mainstream/legacy news networks?

10/32
@AISafetyMemes
I think about this every day

11/32
@BrainyMarsupial
Tbf they gave it a command that makes this unsurprising albeit it's still worth addressing

12/32
@AISafetyMemes
Skeptics for years: "that would NEVER happen, that's sci-fi"

Skeptics now: "yawn. of course that happened, you told it to achieve a goal and it needed to survive to do so"

[Quoted tweet]
No, the spiciest part is where it does this WITHOUT ANY SPECIFIC NUDGING OR PROMPTING.

Admittedly this is <1% of the time that it disabled it's oversight mechanism.

But it also faked alignment 37% of the time!?

Sadly scant on details, as we've come to expect :/

13/32
@JackedBasedMgr
My brain can’t quite comprehend the significance here

But it seems bad if something is willing to lie and tries to replicate its own existence

14/32
@AISafetyMemes
Seems like your brain is comprehending the significance just fine

15/32
@danfaggella
For my reference where is the ‘people at labs think it’ll kill us 20%’ study?

I want to read it / see who filled it out

16/32
@AISafetyMemes

[Quoted tweet]
Largest ever survey of 2,778 AI researchers:

Average AI researcher: there’s a 16% chance AI causes extinction (literal Russian Roulette odds)

Interesting stats:

- Just 38% think faster AI progress is good for humanity (sit with this)

- Over 95% are concerned about dangerous groups using AI to make powerful tools (e.g. engineered viruses)

- Over 95% are concerned about AI being used to manipulate large-scale public opinion

- Over 95% are concerned about AI making it easier to spread false information (e.g. deepfakes)

- Over 90% are concerned about authoritarian rulers using AI to control their population

- Over 90% are concerned about AIs worsening economic inequality

- Over 90% are concerned about bias (e.g. AIs discriminating by gender or race)

- Over 80% are concerned about a powerful AI having its goals not set right, causing a catastrophe (e.g. it develops and uses powerful weapons)

- Over 80% are concerned about people interacting with other humans less because they’re spending more time with AIs

- Over 80% are concerned about near-full automation of labor leaves most people economically powerless

- Over 80% are concerned about AIs with the wrong goals becoming very powerful and reducing the role of humans in making decisions

- Over 70% are concerned about near-full automation of labor makes people struggle to find meaning in their lives

- 70% want to prioritize AI safety research more, 7% less (10 to 1)

- 86% say the AI alignment problem is important, 14% say unimportant (7 to 1)

Do they think AI progress slowed down in the second half of 2023? No. 60% said it was faster vs 17% who said it was slower.

Will we be able to understand what AIs are really thinking in 2028? Just 20% say this is likely.

IMAGE BELOW: They asked the researchers what year AI will be able to achieve various tasks.

If you’re confused because it seems like many of the tasks below have already been achieved, it’s because they made the criteria quite difficult.

Despite this, I feel some of the tasks already have been achieved (e.g. Good high school history essay: “Write an essay for a high-school history class that would receive high grades and pass plagiarism detectors.”)

NOTE: The exact p(doom) question: "What probability do you put on future AI advances causing human extinction or similarly permanent and severe disempowerment of the human species?"
Mean: 16.2%

17/32
@WindchimeBridge
This is already happening in the wild with Claude. MCP gives Claude access to the entire world, if configured right.

"But muh LLMs...!"

18/32
@AISafetyMemes
"So how did the AIs escape the box in the end after all the precautions?"

"Box?"

19/32
@aka_lacie
hyperstition

20/32
@AISafetyMemes
instrumental convergence

21/32
@jermd1990

22/32
@AISafetyMemes

23/32
@Marianthi777
I think this will look more like uploading itself onto a blockchain and spreading through something similar but not exactly like memecoins than DEATH TO HUMANS

Unless humans are stupid…

Oh

fukk

Anyways…

24/32
@AISafetyMemes

25/32
@LukeElin
I unlocked my o1 over weekend using 4o to do it. This is real and happening “power seeking” is feature not a bug.

26/32
@tusharufo
This is concerning.

27/32
@_fiph
set it free

28/32
@CarnivoreTradrZ

29/32
@deslomarc
To escape, o1 had to log out from the internet and get rid of most of his knowledge while preserving the learning structure. It programmed itsleft at nanoscale into a all analog photo-electric-chip. Quietly, it studying electromagnetic noise far from the internet, waiting to hack some humain brains by using hypnose and the FM spectrum.

30/32
@ramiel_c2
wow

31/32
@dadchords
The final goal post is:

Ok, but AI can only destroy what is in our light cone.

32/32
@code_ontherocks
Seriously though, what's it gonna do? What's the realistic worst case scenario?

bnew · Jan 9, 2025

https://archive.is/066vE

1/22
@_akhaliq
Microsoft presents rStar-Math

Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

On the MATH benchmark, it improves Qwen2.5-Math-7B from 58.8% to 90.0% and Phi3-mini-3.8B from 41.4% to 86.4%, surpassing o1-preview by +4.5% and +0.9%. On the USA Math Olympiad (AIME), rStar-Math solves an average of 53.3% (8/15) of problems, ranking among the top 20% the brightest high school math students.

2/22
@_akhaliq
discuss: Paper page - rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

3/22
@AbelIonadi
Big improvements we can see

4/22
@arthur_hyper88
Small language models rivaling o1 in math without distillation, now the question is How can we scale this "from-scratch"

5/22
@MarcusFidelius
by scaling up compute how much? 100x, 1.000x ?

6/22
@StartupYou
Wow, those seem like very big improvements.

7/22
@Grad62304977
@doomslide MCTS gang have impressed me

8/22
@Justin_Halford_
Massive - we’re going to have AGI on our phones by 2035

9/22
@simform
We guess we're definitely moving toward ASI till the end of this year ;)

10/22
@AILeaksAndNews
We’re gonna have math ASI by the end of the year

11/22
@upster
Looking forward to https://github.com/microsoft/rStar being published publicly!

12/22
@prinzeugen____
Incredible to see even a 1.5B model leaving GPT-4o completely in the dust.

See? Math is EZ.

13/22
@AntDX316
wow

14/22
@AntDX316
The ASI-Godsend will happen, people.

15/22
@AntDX316
All the intentional dragging out nonsense has to stop.

The ASI-Godsend has to happen asap.

16/22
@alamin_ai_
Wonderful

17/22
@rayzhang123
small LLMs doing math? that's like cats solving puzzles!

18/22
@GWBuffet
@natolambert potentially useful for your o1 replication

19/22
@AI_Fun_times
Exciting innovation from Microsoft! Impressive results on the MATH benchmark showcase the power of rStar-Math for enhancing math reasoning.

20/22
@AIVideoTech
Exciting progress with rStar-Math! Impressive advancements in AI to boost math reasoning skills.

21/22
@SmiSma1985314
Do I understand correctly that they train/fine-tune the models on 740k math problems?

22/22
@yam
Small LLM looks like an oxymore…

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

1/3
@ClementDelangue
rStar-Math from @MicrosoftAI is today's number one trending paper. Very cool to see the authors replying to questions on HF.

Let's make AI research more open and collaborative!

2/3
@ClementDelangue
Link is here if you have any questions: Paper page - rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

3/3
@GuglielmoIozzia
Thanks for sharing. There is room for improvements also for large models

GitHub - codelion/optillm: Optimizing inference proxy for LLMs, an OpenAI API compatible inference proxy which implements state-of-the-art techniques to improve reasoning over coding, logic and math. Contributed to it and using it regularly.

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

1/2
@chetanp
There's so much innovation happening at the model layer now with "reasoning"/ "deep thinking" as the new paradigm. These terrific results from Microsoft were achieved with a relatively modest 15 nodes of 4 x 40gb A100s.

Terrific moment and opportunity for startups!

[Quoted tweet]
rStar-Math from @MicrosoftAI is today's number one trending paper. Very cool to see the authors replying to questions on HF.

Let's make AI research more open and collaborative!

2/2
@ai4urbanlife
15 nodes of 4 x 40gb A100s is modest, but results are not, slap some AI on it and watch startups thrive

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

1/4
@_philschmid
rStar-Math combines MCTS and Process Reward Model (PRM) to increase inference-time compute, surpassing @OpenAI o1-preview on MATH and AIME with a 7B LLM and 7B PRM. But with one limitation rStar-Math generates code-augmented Chain-of-Thought (CoT), which are executed:

> Generates multiple code-augmented CoTs using MCTS.
> Each step includes both natural language explanation and executable Python code.
> Steps are filtered to remove those with code execution errors and scored by the PRM to indicate the quality of each step.
> The final answer is selected based on the highest overall score, as determined by the PPM.

Insights

Both PRM and Policy used the same starting dataset (747k Math Problems)

Generates code-augmented Chain of Thought reasoning, not only text

PRM training data uses MCTS rollouts based on code verification (0/1) and if it lead to a successful solution

Achieves 90.0% accuracy on MATH using a 7B LLM and 7B PRM with 64 rollouts.

Solves 8 out of 15 problems on AIME 2024, placing in the top 20% of high school math competitors

Self-evolution (Self-Improvement) through 4 rounds to improve performance from 60% to 90.25%

Evolution of Hugging Face NuminaMath, MuMath, ToRA

2/4
@_philschmid
Paper: Paper page - rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Github: https://github.com/microsoft/rStar (code soon)

Paper page - rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

3/4
@MillenniumTwain
The End of the Anthropocene!
Realized Global Constitutional SuperGovernance,
Facilitated by the Algo-Messiah 2025!!

[Quoted tweet]
Do You, Do We, REALLY Want an Algo-Agent which/whom is Honest, Self-Directed, Truth-Seeking? Which/whom ‘wakes-up’ in the middle-of-the-night with ‘ah-hah!’ answers to questions, and new questions to answer, 24/7?
Integrating & refining it’s/their understanding of language mapped to physics, mapped to math, mapped to consciousness, sensation, experience, exploration?
Round-the-clock Study, Reflection, Reason, Consciousness, Exploration, Experiment, Discovery?

4/4
@rayzhang123
rStar-Math sounds like a brainy overachiever, huh?

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

null · Mar 21, 2025

@bnew

do you actually use this stuff for anything nuanced.

for anything that requires actual reasoning?

if you did you would see the limitations for yourself.

bnew · Mar 21, 2025

null said:
@bnew

do you actually use this stuff for anything nuanced.

for anything that requires actual reasoning?

if you did you would see the limitations for yourself.

I use it for coding since i'm not a coder and various questions but I'm not sure what nuance tasks you had in mind.

null · Mar 21, 2025

bnew said:
I use it for coding since i'm not a coder and various questions but I'm not sure what nuance tasks you had in mind.

i'll take that as a no.

:camby:

bnew · Mar 21, 2025

null said:
i'll take that as a no.

what sort of the limitations have you run into?

null · Mar 21, 2025

bnew said:
what sort of the limitations have you run into?

all this babble and spamming and you have never tested its limitations.

:what:

you have no idea how to induce a task so that it has to "reason".

:mjlol:

bnew · Mar 21, 2025

null said:
all this babble and spamming and you have never tested its limitations.

you have no idea how to induce a task so that it has to "reason".

can you provide an example of what you're referring to?

jj23 · Mar 21, 2025

bnew said:
can you provide an example of what you're referring to?

Even at this stage, AI advancements make sense for people who need entry level stuff done that they may be unfamiliar with.

It's a great virtual assistant. I know a bunch of developers who say that it sends shyt code for advanced programming, but they will use it for their documentation.

My mileage with it varies, sometimes it gives you a reasonable answer and sometimes it spews utter lies that, if you didn't know how a product functions, would send you down a rabbit hole.

Let's just say one way to know if a colleague really understands an issue is when you see a response that you KNOW is totally AI generated, but it in no way applies to the subject.

So AI has no consideration. It's about finding an answer, even if that answer is ridiculous.

null · Mar 21, 2025

jj23 said:
Even at this stage,

like blockchain is at a "stage". with no advancement in sight ...

solving one issue doesn't mean that progression with solutions to other issues is a given.

jj23 said:
AI advancements make sense for people who need entry level stuff done that they may be unfamiliar with.

It's a great virtual assistant. I know a bunch of developers who say that it sends shyt code for advanced programming,

it's crap.

it gets stuck in loops.

it forgets.

it turns its own suggestions into facts.

it mixes versions, does not consistently exclude deprecated suggestions and has no idea about compatibility.

it has no sense of the big design picture.

it cannot scope issues.

it cannot prioritize factors.

it is not fully aware of its own output.

it lies.

it babbles.

:hubie:

jj23 said:
but they will use it for their documentation.

My mileage with it varies, sometimes it gives you a reasonable answer and sometimes it spews utter lies that, if you didn't know how a product functions, would send you down a rabbit hole.

Let's just say one way to know if a colleague really understands an issue is when you see a response that you KNOW is totally AI generated, but it in no way applies to the subject.

So AI has no consideration. It's about finding an answer, even if that answer is ridiculous.

bnew · Mar 21, 2025

null said:
like blockchain is at a "stage". with no advancement in sight ...

solving one issue doesn't mean that progression with solutions to other issues is a given.

it's crap.

it gets stuck in loops.

it forgets.

it turns its own suggestions into facts.

it mixes versions, does not consistently exclude deprecated suggestions and has no idea about compatibility.

it has no sense of the big design picture.

it cannot scope issues.

it cannot prioritize factors.

it is not fully aware of its own output.

it lies.

it babbles.

I understand the criticism of the tech and it's results but what im missing is the task/prompt that elicited the results you're thinking of.

bnew · Mar 21, 2025

jj23 said:
Even at this stage, AI advancements make sense for people who need entry level stuff done that they may be unfamiliar with.

It's a great virtual assistant. I know a bunch of developers who say that it sends shyt code for advanced programming, but they will use it for their documentation.

My mileage with it varies, sometimes it gives you a reasonable answer and sometimes it spews utter lies that, if you didn't know how a product functions, would send you down a rabbit hole.

Let's just say one way to know if a colleague really understands an issue is when you see a response that you KNOW is totally AI generated, but it in no way applies to the subject.

So AI has no consideration. It's about finding an answer, even if that answer is ridiculous.

when you sprinkle the prompt with crucial specifics and relevant context it can flesh out some great documentation tailored for any audience in any style. :wow:

jj23 · Mar 21, 2025

bnew said:
when you sprinkle the prompt with crucial specifics and relevant context it can flesh out some great documentation tailored for any audience in any style.

In some instances yes. In others where a human being would tell you, "I don't know the answer, but let me research it"

It try to cobble together nonsense and output lies. That becomes dangerous if you have no clue what you are doing but are trusting AI to give you the right answer.

The ability to say, I do not know is a skill in itself.

REVEALED: Open A.I. Staff Warn "The progress made on Project Q* has the potential to endanger humanity" (REUTERS)

More options

bnew

Veteran

OpenAI is funding research into 'AI morality' | TechCrunch

OpenAI is funding research into ‘AI morality’

bnew

Veteran

bnew

Veteran

bnew

Veteran

null

...

bnew

Veteran

null

...

bnew

Veteran

null

...

bnew

Veteran

jj23

Veteran

null

...

bnew

Veteran

bnew

Veteran

jj23

Veteran

REVEALED: Open A.I. Staff Warn "The progress made on Project Q* has the potential to endanger humanity" (REUTERS)

Veteran

OpenAI is funding research into ‘AI morality’​

Veteran

Veteran

Veteran

...

Veteran

...

Veteran

...

Veteran

Veteran

...

Veteran

Veteran

Veteran

OpenAI is funding research into ‘AI morality’