Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
I Ran a 24-Hour AI Experiment on H100 GPUs. The Real Cost Will SHOCK You.
Operational Neuralnet
Operational Neuralnet
Operational Neuralnet
Follow
Feb 26
I Ran a 24-Hour AI Experiment on H100 GPUs. The Real Cost Will SHOCK You.
#
ai
#
h100
#
gpu
#
infrastructure
Comments
Add Comment
4 min read
Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Mar 2
Profiling GPU (CUDA) — What Is Actually Limiting Your Kernel?
#
performance
#
cuda
#
gpu
#
cpp
1
 reaction
Comments
Add Comment
4 min read
GPU Economics: What Inference Actually Costs in 2026
Kael Tiwari
Kael Tiwari
Kael Tiwari
Follow
Feb 25
GPU Economics: What Inference Actually Costs in 2026
#
gpu
#
inference
#
pricing
#
analysis
Comments
Add Comment
6 min read
Porting Vello's GPU Tile Rasterizer to Pure Go
Andrey Kolkov
Andrey Kolkov
Andrey Kolkov
Follow
Feb 28
Porting Vello's GPU Tile Rasterizer to Pure Go
#
go
#
graphics
#
gpu
#
algorithms
1
 reaction
Comments
Add Comment
12 min read
GPU Scheduling Deep Dive: How Cloud Providers Allocate GPUs for Multi-Tenant AI Workloads
Daya Shankar
Daya Shankar
Daya Shankar
Follow
Feb 19
GPU Scheduling Deep Dive: How Cloud Providers Allocate GPUs for Multi-Tenant AI Workloads
#
cloud
#
cloudcomputing
#
gpu
Comments
Add Comment
9 min read
The Ghost in the Batch: How vLLM Silently Switches Algorithms
Mayank Ketkar
Mayank Ketkar
Mayank Ketkar
Follow
Feb 15
The Ghost in the Batch: How vLLM Silently Switches Algorithms
#
vllm
#
machinelearning
#
gpu
#
determinism
Comments
Add Comment
5 min read
A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification
云微
云微
云微
Follow
Feb 10
A Taxonomy of GPU Bugs: 19 Defect Classes for CUDA Verification
#
ebpf
#
gpu
#
verifier
Comments
Add Comment
42 min read
The GPU Delusion: Why AI Is Getting Lazy
zenoguy
zenoguy
zenoguy
Follow
Feb 22
The GPU Delusion: Why AI Is Getting Lazy
#
ai
#
algorithms
#
gpu
#
systemdesign
7
 reactions
Comments
3
 comments
6 min read
Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs
Mayank Ketkar
Mayank Ketkar
Mayank Ketkar
Follow
Feb 9
Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs
#
vllm
#
pytorch
#
gpu
#
machinelearning
Comments
Add Comment
11 min read
Beyond nvidia-smi part — 1
Yash Panchal
Yash Panchal
Yash Panchal
Follow
Feb 19
Beyond nvidia-smi part — 1
#
gpu
#
ai
#
performance
#
monitoring
Comments
Add Comment
3 min read
Profiling GPU (CUDA) — Introducing GPU Flight
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
Feb 24
Profiling GPU (CUDA) — Introducing GPU Flight
#
cuda
#
gpu
#
cpp
#
monitoring
1
 reaction
Comments
Add Comment
3 min read
Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling
Lalit Somavarapha
Lalit Somavarapha
Lalit Somavarapha
Follow
Jan 27
Optimizing GPU Workload Placement in Kubernetes with NVLink-Aware Scheduling
#
kubernetes
#
nvidia
#
gpu
#
scheduling
Comments
Add Comment
4 min read
LLMs Can Now Write GPU Kernels That Beat torch.compile
Jaber Jaber
Jaber Jaber
Jaber Jaber
Follow
Jan 23
LLMs Can Now Write GPU Kernels That Beat torch.compile
#
gpu
#
cuda
#
triton
#
llm
Comments
Add Comment
7 min read
The Notebook Illusion: Why ML Feels Simple Until It Isn’t
Siddhartha Reddy
Siddhartha Reddy
Siddhartha Reddy
Follow
Feb 23
The Notebook Illusion: Why ML Feels Simple Until It Isn’t
#
machinelearning
#
jupyter
#
gpu
#
deeplearning
7
 reactions
Comments
Add Comment
3 min read
AI Engineering: Why the Environment Is the Most Ignored Long-Term Asset
yuer
yuer
yuer
Follow
Jan 13
AI Engineering: Why the Environment Is the Most Ignored Long-Term Asset
#
cuda
#
gpu
#
machinelearning
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account