AI Models Are Using Material From Retracted Scientific Papers
Posted3 months agoActive3 months ago
technologyreview.comTechstory
calmnegative
Debate
10/100
Artificial IntelligenceScientific ResearchData Integrity
Key topics
Artificial Intelligence
Scientific Research
Data Integrity
AI models are being trained on data that includes material from retracted scientific papers, potentially spreading misinformation, and the community is concerned about the implications for research validity.
Snapshot generated from the HN discussion
Discussion Activity
Light discussionFirst comment
26m
Peak period
1
0-1h
Avg / period
1
Key moments
- 01Story posted
Sep 23, 2025 at 9:16 AM EDT
3 months ago
Step 01 - 02First comment
Sep 23, 2025 at 9:42 AM EDT
26m after posting
Step 02 - 03Peak activity
1 comments in 0-1h
Hottest window of the conversation
Step 03 - 04Latest activity
Sep 23, 2025 at 9:42 AM EDT
3 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Discussion (1 comments)
Showing 1 comments
JohnFen
3 months ago
They're probably also being trained on the contents of preprint services like arxiv, which includes lots of papers with little or no merit.
View full discussion on Hacker News
ID: 45346597Type: storyLast synced: 11/17/2025, 1:09:45 PM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.