DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
SFT Offline RL Online RL: The Three-Stage Training Pipeline Behind Mano-P

SFT Offline RL Online RL: The Three-Stage Training Pipeline Behind Mano-P

1
Comments
8 min read
A11: A Structured Way to Not Lie to Yourself During Reasoning

A11: A Structured Way to Not Lie to Yourself During Reasoning

Comments
3 min read
Day 8 — Beginning My Journey into Neural Networks

Day 8 — Beginning My Journey into Neural Networks

Comments
1 min read
Deep Learning Is More Logistic Regression Than You Think

Deep Learning Is More Logistic Regression Than You Think

Comments
4 min read
Better Data Beats Better Algorithms: Before Changing the Model, Change the Data

Better Data Beats Better Algorithms: Before Changing the Model, Change the Data

Comments
3 min read
Understanding Attention in Transformers — Intuition Before Equations

Understanding Attention in Transformers — Intuition Before Equations

Comments
3 min read
PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

1
Comments
5 min read
What Does a Product Data Scientist Actually Do?

What Does a Product Data Scientist Actually Do?

Comments
2 min read
A11: A Structural Answer to AI Collapse

A11: A Structural Answer to AI Collapse

Comments
3 min read
Gemma 4 12B shows how far local multimodal AI has moved

Gemma 4 12B shows how far local multimodal AI has moved

Comments
5 min read
NVIDIA Cosmos 3: How a Two-Tower Architecture Unifies Physical AI Reasoning and Generation

NVIDIA Cosmos 3: How a Two-Tower Architecture Unifies Physical AI Reasoning and Generation

1
Comments
5 min read
Building Medical AI for the Other 90%: A Field Report from a Solo Developer

Building Medical AI for the Other 90%: A Field Report from a Solo Developer

Comments
5 min read
NVIDIA Cosmos 3: Unifying Physical AI Reasoning and Generation with Two-Tower Architecture

NVIDIA Cosmos 3: Unifying Physical AI Reasoning and Generation with Two-Tower Architecture

1
Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.