DEV Community

# computervision

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

Seeing in the Dark: Unveiling Hidden Details with Adaptive Image Processing

Comments
2 min read
Let’s unlock Synthetic Presence with SadTalker in Google Colab And Bring Images to Life

Let’s unlock Synthetic Presence with SadTalker in Google Colab And Bring Images to Life

Comments
5 min read
See to Do: Teaching Robots to Handle the Real World by Arvind Sundararajan

See to Do: Teaching Robots to Handle the Real World by Arvind Sundararajan

1
Comments
2 min read
From Pixel to Perfection: Instant 3D Models from Single Images by Arvind Sundararajan

From Pixel to Perfection: Instant 3D Models from Single Images by Arvind Sundararajan

1
Comments
2 min read
Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

Unlock the Secrets of Unlabeled Videos: A Deep Dive into Zero-Effort AI Training

1
Comments
2 min read
Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

Forget Labels: AI Learns Continuously From Raw Video (and It's a Game Changer)

1
Comments
2 min read
Vision Transform

Vision Transform

Comments
16 min read
Challenges to adapt AI-based Video Codecs

Challenges to adapt AI-based Video Codecs

1
Comments 1
5 min read
Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Tech in Roofing: Drones, CV, and LLMs that ship better inspections tags: ai, drones, computervision, construction, casestudy

Comments
2 min read
Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Optimizing Multi-Zone Restaurant Service with Computer Vision for Hospitality

Comments
9 min read
From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

From SageMaker to Static Site: Hosting a Deep Learning Model on the Frontend

Comments
4 min read
How I Built an AI-Powered Face Recognition App from Scratch

How I Built an AI-Powered Face Recognition App from Scratch

Comments
1 min read
Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Building a Diffusion Model from Scratch: CIFAR-10 in 15 Minutes

Comments
5 min read
Smart Stable Monitoring System for Premium Remote Horse Care

Smart Stable Monitoring System for Premium Remote Horse Care

1
Comments
9 min read
[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

[memo]SafeVLA: Towards Safety Alignment of VisionLanguage-Action Model via Constrained Learning

Comments
1 min read
Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Frontiers in Computer Vision: Foundation Models, Multimodal Learning, Robustness, and Privacy from the July 2025 arXiv H

Comments
7 min read
Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Building a Motion Tracking Balloon Burst Game with Python & OpenCV

Comments
3 min read
Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Two Face Recognition Projects Failed. $33K Burned — All Because of Bad Camera Setup

Comments
3 min read
Does DINO loss compare the [CLS] tokens from both teacher and student?

Does DINO loss compare the [CLS] tokens from both teacher and student?

Comments
1 min read
Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Modular Snip Recorder: A Data Collection Tool for Behavior Cloning (1/2)

Comments
5 min read
[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

[memo] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Comments
1 min read
Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

Building a Deep Learning Model to Detect Potato Diseases: My Journey with PlantVillage.

2
Comments 1
3 min read
How Do NLP and Computer Vision Work Together in Modern AI Applications?

How Do NLP and Computer Vision Work Together in Modern AI Applications?

Comments
4 min read
Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Inside the Research: A Detailed Technical Breakdown of SQD in Quantum Chemistry

Comments
4 min read
VideoPrism: A Foundational Visual Encoder for Video Understanding

VideoPrism: A Foundational Visual Encoder for Video Understanding

Comments
1 min read
loading...