Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
Home
/
Discussion
/
Deep Learning
Back to Discussion
Deep Learning
Loading...
20 stories
•
24h:
0%
•
7d: 0
•
629 comments
Top contributors:
meetpateltech
gpjt
swatson741
jxmorris12
montyanderson
Stories
Related Stories
20 stories tagged with deep learning
Backpropagation Is a Leaky Abstraction (2016)
353
160 comments
by swatson741
•
24d ago
deep learning
backpropagation
AI education
How Does Gradient Descent Work?
325
24 comments
by jxmorris12
•
1mo ago
gradient descent
deep learning
optimization
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
314
114 comments
by montyanderson
•
1mo ago
AI-generated content
multimodal fusion
deep learning
Deepseek-V3.2-Exp
309
50 comments
by meetpateltech
•
1mo ago
AI
LLM
Deep Learning
Writing an LLM From Scratch, Part 22 – Training Our LLM
254
10 comments
by gpjt
•
1mo ago
LLM
AI
Machine Learning
Deep Learning
From Multi-Head to Latent Attention: the Evolution of Attention Mechanisms
174
41 comments
by mgninad
•
2mo ago
attention mechanisms
deep learning
AI research
The Math Behind Gans (2020)
141
31 comments
by sebg
•
3mo ago
GANs
Deep Learning
Generative Models
We Reverse-Engineered Flash Attention 4
134
48 comments
by birdculture
•
2mo ago
Flash Attention 4
GPU optimization
deep learning
Who Invented Deep Residual Learning?
114
35 comments
by timlod
•
1mo ago
deep learning
residual neural networks
credit attribution
I Unified Convolution and Attention Into a Single Framework
80
18 comments
by umjunsik132
•
2mo ago
Deep Learning
Convolution
Attention Mechanism
A Trick for Backpropagation of Linear Transformations
74
6 comments
by tripplyons
•
2mo ago
backpropagation
linear algebra
deep learning
Modern Optimizers – an Alchemist's Notes on Deep Learning
46
7 comments
by maxall4
•
19d ago
deep learning
optimizers
machine learning
Fantastic Pretraining Optimizers and Where to Find Them
42
4 comments
by fzliu
•
2mo ago
deep learning
optimizers
pretraining
Writing an LLM From Scratch, Part 20 – Starting Training, and Cross Entropy Loss
41
3 comments
by gpjt
•
1mo ago
LLM
machine learning
deep learning
Ugmm-Nn: Univariate Gaussian Mixture Model Neural Network
31
12 comments
by zakeria
•
2mo ago
neural networks
probabilistic modeling
deep learning
Torchcomms: a Modern Pytorch Communications API
30
6 comments
by paladin314159
•
1mo ago
PyTorch
deep learning
distributed computing
Deepseek V3.1 Released – Single Model, Thinking and Non-Thinking Modes
26
0 comments
by k_sze
•
3mo ago
artificial intelligence
machine learning
deep learning
Deepseek-V3.1
26
0 comments
by meetpateltech
•
3mo ago
artificial intelligence
large language models
deep learning
Does Anyone Think the Current AI Approach Will Hit a Dead End?
23
58 comments
by rh121
•
2mo ago
AI
deep learning
AGI
Parrot – a C++ Library for Fused Array Operations Using Cuda/thrust
22
2 comments
by operator-name
•
14d ago
C++
CUDA
deep learning
performance optimization
Deep Learning | Trending Topic on Hacker News | Not Hacker News!