DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Shrink Your LLMs: FAIRY2I Makes Tiny AI a Reality

Shrink Your LLMs: FAIRY2I Makes Tiny AI a Reality

Comments
2 min read
大模型微调:SFT

大模型微调:SFT

Comments
1 min read
The Math Behind Machine Learning & Deep Learning (Explained Simply)

The Math Behind Machine Learning & Deep Learning (Explained Simply)

Comments
3 min read
I Skipped My Birthday to Give Go Its First Real ML Framework

I Skipped My Birthday to Give Go Its First Real ML Framework

Comments
4 min read
Fundamentals of Large Language Models: Understanding LLM Architectures

Fundamentals of Large Language Models: Understanding LLM Architectures

1
Comments
5 min read
Introduction to Computer Vision: Teaching Machines to See

Introduction to Computer Vision: Teaching Machines to See

Comments
3 min read
Neural Network — A Simple, Beginner-Friendly Overview

Neural Network — A Simple, Beginner-Friendly Overview

Comments
3 min read
Introduction to PyTorch: The Deep Learning Framework You Need to Know

Introduction to PyTorch: The Deep Learning Framework You Need to Know

Comments
3 min read
Introduction to Deep Learning: A Complete Beginner’s Guide

Introduction to Deep Learning: A Complete Beginner’s Guide

Comments
3 min read
Why GPUs Ate the AI World

Why GPUs Ate the AI World

Comments
8 min read
BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

BIG STEPS TO TRANSFORMER (PART 1): BUILDING THE BIGRAM

Comments
13 min read
Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Unlocking AI's Universal Secrets: Do Neural Networks Think in Fractals?

Comments
2 min read
Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Unlocking Neural Network Secrets: Scale-Invariant Geometry for Smarter AI by Arvind Sundararajan

Comments
2 min read
How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

How I Built a 6B Image Model That Runs on a 16GB GPU (Z-Image)

Comments
2 min read
🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

🧑‍🚀 LLM Engine Telemetry: How to Profile Models and See Where Performance is Lost

Comments
5 min read
How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

How Neural Networks Learn – A Simple Guide to Machine Learning & Deep Learning

Comments
6 min read
Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Unlocking AI's Inner Geometry: Scale-Agnostic Structures in Neural Networks

Comments
2 min read
The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

Comments
2 min read
My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

Comments
3 min read
I trained a Robot Arm: What I failed to learn.

I trained a Robot Arm: What I failed to learn.

4
Comments 4
3 min read
Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
Tame Your LLMs: A New Optimizer for Robust Deep Learning

Tame Your LLMs: A New Optimizer for Robust Deep Learning

Comments
2 min read
🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

1
Comments
3 min read
Surgical Precision with AI: A New Era in Lung Cancer Staging

Surgical Precision with AI: A New Era in Lung Cancer Staging

Comments
2 min read
Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Comments
2 min read
loading...