Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
Web Scraping | Trending Topic on Hacker News | Not Hacker News!
Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
Home
/
Discussion
/
Web Scraping
Back to Discussion
Web Scraping
Loading...
20 stories
•
24h:
0%
•
7d: 0
•
3,006 comments
Top contributors:
taviso
classichasclass
chmaynard
ColinWright
HermanMartinus
Stories
Related Stories
20 stories tagged with web scraping
Why Are Anime Catgirls Blocking My Access to the Linux Kernel?
839
908 comments
by taviso
•
3mo ago
captcha
AI
web scraping
security
Ban Me at the IP Level If You Don't Like Me
599
485 comments
by classichasclass
•
3mo ago
web scraping
bot traffic
IP blocking
Feed the Bots
305
203 comments
by chmaynard
•
1mo ago
AI
web scraping
information pollution
AI Scrapers Request Commented Scripts
266
221 comments
by ColinWright
•
26d ago
AI
web scraping
LLMs
Messing with Scraper Bots
244
84 comments
by HermanMartinus
•
11d ago
web scraping
bot detection
cybersecurity
AI Crawlers, Fetchers Are Blowing Up Websites; Meta, Openai Are Worst Offenders
231
144 comments
by rntn
•
3mo ago
AI crawlers
web scraping
data privacy
Aggressive Bots Ruined My Weekend
209
104 comments
by shaunpud
•
28d ago
web scraping
bot traffic
cybersecurity
How I Block All 26m of Your Curl Requests
208
70 comments
by foxmoss
•
1mo ago
web scraping
security
networking
Blocking LLM Crawlers Without Javascript
197
100 comments
by todsacerdoti
•
11d ago
LLM crawlers
web scraping
robot.txt
You Don't Need Anubis
177
170 comments
by flexagoon
•
25d ago
web scraping
bot protection
AI ethics
Feedmaker: URL + CSS Selectors = Rss Feed
173
30 comments
by mustaphah
•
2mo ago
RSS feeds
web scraping
CSS selectors
Asking AI to Build Scrapers Should Be Easy Right?
150
82 comments
by suchintan
•
1mo ago
AI
web scraping
automation
A Stateful Browser Agent Using Self-Healing Dom Maps
120
58 comments
by shardullavekar
•
1mo ago
browser automation
AI agents
web scraping
Poisoning Well
118
127 comments
by wonger_
•
2mo ago
LLMs
AI ethics
web scraping
Webhound (yc S23) – Research Agent That Builds Datasets From the Web
112
80 comments
by mfkhalil
•
2mo ago
AI
data extraction
web scraping
You Can't Parse XML with Regex. Let's Do It Anyways
90
90 comments
by birdculture
•
1mo ago
XML parsing
regular expressions
web scraping
Simplex (yc S24) – Browser Automation Platform for Developers
54
28 comments
by marcon680
•
1mo ago
browser automation
web scraping
AI agents
Feed Me Up, Scotty – Custom Rss Feed Generation Using CSS Selectors
53
8 comments
by diymaker
•
1mo ago
RSS feeds
web scraping
CSS selectors
Amazon Sends Legal Threats to Perplexity Over Agentic Browsing
28
7 comments
by erhuve
•
22d ago
AI
web scraping
intellectual property
The Accidental Click That Changed Everything: the Apify Origin Story
24
7 comments
by mooreds
•
18d ago
startup story
entrepreneurship
web scraping