DEV Community

crescevo

Organization Settings Admin
crescevo.com Joined Joined on 
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

Comments
3 min read
Ponytail – make your AI agent think like the laziest senior dev in the room

Ponytail – make your AI agent think like the laziest senior dev in the room

Comments
3 min read
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models

Trees to Flows and Back: Unifying Decision Trees and Diffusion Models

Comments
3 min read
Top Remote Tech Jobs This Week — June 22, 2026

Top Remote Tech Jobs This Week — June 22, 2026

Comments
3 min read
Top Remote AI & ML Jobs This Week — June 22, 2026

Top Remote AI & ML Jobs This Week — June 22, 2026

Comments
3 min read
The ARM Takeover Is Complete — What It Means for Developers

The ARM Takeover Is Complete — What It Means for Developers

Comments
5 min read
The Engineering Manager's Guide to AI Code Review

The Engineering Manager's Guide to AI Code Review

Comments
3 min read
Why Your Monolith Is Not the Problem

Why Your Monolith Is Not the Problem

Comments
3 min read
loading...