Not
Hacker
News
!
Home
Hiring
Products
Discussion
Q&A
Users
AI Alignment | Trending Topic on Hacker News | Not Hacker News!
Not
Hacker
News
!
Home
Hiring
Products
Discussion
Q&A
Users
Home
/
Discussion
/
AI Alignment
Back to Discussion
AI Alignment
Loading...
20 stories
•
24h:
0%
•
7d: 0
•
250 comments
Top contributors:
louisbarclay
deepvibrations
cyberneticc
pablo-chacon
mefengl
Stories
Related Stories
20 stories tagged with ai alignment
Center for the Alignment of AI Alignment Centers
217
44 comments
by louisbarclay
Posted
4 months ago
Active
about 1 month ago
AI alignment
satire
recursion
We Tested 20 Llms for Ideological Bias, Revealing Distinct Alignments
72
102 comments
by deepvibrations
Posted
2 months ago
Active
about 1 month ago
LLM bias
AI alignment
political bias in AI
We Are Building AI Slaves. Alignment Through Control Will Fail
47
93 comments
by cyberneticc
Posted
2 months ago
Active
about 1 month ago
AI alignment
AI safety
autonomous systems
Spoon-Bending – a Framework for Analyzing GPT-5 Alignment Behavior
22
3 comments
by pablo-chacon
Posted
4 months ago
AI alignment
GPT-5
LLM analysis
The Solution Is Simple but You Aren't Demoralized Enough Yet – Geohot
11
1 comments
by mefengl
Posted
3 months ago
Active
about 1 month ago
AI alignment
complexity
problem-solving
Friend or Foe: Delegating to an AI Whose Alignment Is Unknown
8
0 comments
by paulpauper
Posted
2 months ago
Active
about 1 month ago
AI alignment
artificial intelligence
machine learning
The Hidden Cost of Winning:how Rl Training on Poker Degrades LLM Moral Alignment
8
0 comments
by tamassimond
Posted
4 months ago
AI alignment
Reinforcement Learning
Poker AI
Mechanistic Interpretability Priorities [video]
5
0 comments
by wadamczyk
Posted
about 2 months ago
Active
about 1 month ago
AI alignment
mechanistic interpretability
AI safety
Reverse Jailbreaking a Psychopathic AI via Identity Injection
4
0 comments
by drawson5570
Posted
about 1 month ago
Active
about 1 month ago
ai alignment
machine learning
psychopathic ai
jailbreaking
identity injection
Subagents with Mcp
4
0 comments
by ogham
Posted
2 months ago
Active
about 1 month ago
AI alignment
subagents
MCP
Urgent Everyone – Help Us Kill AI Preemption (again) Before This Friday
3
0 comments
by PhilosophyForAI
Posted
about 1 month ago
Active
about 1 month ago
ai safety
ai alignment
lesswrong
Anthropic's Pilot Sabotage Risk Report
3
0 comments
by allenleee
Posted
2 months ago
Active
about 1 month ago
AI Safety
AI Alignment
Risk Assessment
Detecting and Reducing Scheming in AI Models
3
0 comments
by tosh
Posted
4 months ago
Active
about 1 month ago
AI safety
Machine Learning
AI alignment
Aligning Those Who Align Ai, One Satirical Website at a Time
3
0 comments
by manveerc
Posted
4 months ago
Active
about 1 month ago
AI alignment
satire
research centers
Alignment Bears
3
5 comments
by sethbannon
Posted
4 months ago
Active
about 1 month ago
AI alignment
marketing
branding
Will Any Crap Cause Emergent Misalignment?
3
0 comments
by maxutility
Posted
4 months ago
Active
about 1 month ago
AI alignment
emergence
complex systems
Wargaming AI Alignment
2
2 comments
by JL-Akrasia
Posted
about 2 months ago
Active
about 1 month ago
AI Alignment
Military AI
Ethics
Can AI Agents with Divergent Interests Learn to Prevent Civilizational Failures?
2
0 comments
by abranti
Posted
2 months ago
Active
about 1 month ago
AI Alignment
Artificial Intelligence
Multi-Agent Systems
An Alignment Auditing Agent Capable of Quickly Exploring Alignment Hypothesis
2
0 comments
by JnBrymn
Posted
3 months ago
Active
about 1 month ago
AI alignment
machine learning
safety research
Moloch's Bargain: Emergent Misalignment When Llms Compete for Audiences
2
0 comments
by felineflock
Posted
3 months ago
Active
about 1 month ago
LLMs
AI alignment
emergent behavior