Deepseek-R1 Incentivizes Reasoning in Llms Through Reinforcement Learning

Posted4 months agoActive4 months ago

rntn

7 points

1 comments

nature.comResearchstory

calmpositive

Debate

10/100

LlmsReinforcement LearningAI Capabilities

Key topics

Llms

Reinforcement Learning

AI Capabilities

Researchers have developed DeepSeek-R1, a method that uses reinforcement learning to improve reasoning in large language models (LLMs), with potential applications in smaller on-device models. The discussion highlights the potential for passing down capabilities to smaller models and the interest in on-device AI.

Snapshot generated from the HN discussion

Discussion Activity

Light discussion

First comment

Peak period

7-8h

Avg / period

Key moments

01Story posted
Sep 18, 2025 at 9:10 AM EDT
4 months ago
Step 01
02First comment
Sep 18, 2025 at 4:32 PM EDT
7h after posting
Step 02
03Peak activity
1 comments in 7-8h
Hottest window of the conversation
Step 03
04Latest activity
Sep 18, 2025 at 4:32 PM EDT
4 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (1 comments)

Showing 1 comments

jerojero

4 months ago

I think one of the most interesting parts here is, and this is something we've been seeing with other models too, how these capabilities can be passed down to smaller models to improve their capabilities.

Personally I'm really interested in on-device models, our phones have gotten pretty good and I think for a lot of things it should be possible to have these capable but not amazing little ants running around in our phones doing things.

View full discussion on Hacker News

ID: 45289216Type: storyLast synced: 11/17/2025, 4:04:27 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN