Deepseek-R1 Incentivizes Reasoning in Llms Through Reinforcement Learning
Key topics
Researchers have developed DeepSeek-R1, a method that uses reinforcement learning to improve reasoning in large language models (LLMs), with potential applications in smaller on-device models. The discussion highlights the potential for passing down capabilities to smaller models and the interest in on-device AI.
Snapshot generated from the HN discussion
Discussion Activity
Light discussionFirst comment
7h
Peak period
1
7-8h
Avg / period
1
Key moments
- 01Story posted
Sep 18, 2025 at 9:10 AM EDT
4 months ago
Step 01 - 02First comment
Sep 18, 2025 at 4:32 PM EDT
7h after posting
Step 02 - 03Peak activity
1 comments in 7-8h
Hottest window of the conversation
Step 03 - 04Latest activity
Sep 18, 2025 at 4:32 PM EDT
4 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
Personally I'm really interested in on-device models, our phones have gotten pretty good and I think for a lot of things it should be possible to have these capable but not amazing little ants running around in our phones doing things.