AI Models Are Using Material From Retracted Scientific Papers

Posted3 months agoActive3 months ago

gnabgib

3 points

1 comments

technologyreview.comTechstory

calmnegative

Debate

10/100

Artificial IntelligenceScientific ResearchData Integrity

Key topics

Artificial Intelligence

Scientific Research

Data Integrity

AI models are being trained on data that includes material from retracted scientific papers, potentially spreading misinformation, and the community is concerned about the implications for research validity.

Snapshot generated from the HN discussion

Discussion Activity

Light discussion

First comment

26m

Peak period

0-1h

Avg / period

Key moments

01Story posted
Sep 23, 2025 at 9:16 AM EDT
3 months ago
Step 01
02First comment
Sep 23, 2025 at 9:42 AM EDT
26m after posting
Step 02
03Peak activity
1 comments in 0-1h
Hottest window of the conversation
Step 03
04Latest activity
Sep 23, 2025 at 9:42 AM EDT
3 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (1 comments)

Showing 1 comments

JohnFen

3 months ago

They're probably also being trained on the contents of preprint services like arxiv, which includes lots of papers with little or no merit.

View full discussion on Hacker News

ID: 45346597Type: storyLast synced: 11/17/2025, 1:09:45 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN