DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Mastering SQL Queries: A Comprehensive Guide for Beginners

Mastering SQL Queries: A Comprehensive Guide for Beginners

25
Comments 5
2 min read
A Crescente Demanda por Profissionais com Habilidades em IA e Machine Learning

A Crescente Demanda por Profissionais com Habilidades em IA e Machine Learning

23
Comments 3
2 min read
5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

5 Ways to Celebrate Earth Day as a Developer 🌎🌏🌍

15
Comments 4
4 min read
Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

13
Comments
7 min read
PIGEON: Predicting Image Geolocations

PIGEON: Predicting Image Geolocations

10
Comments
4 min read
Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

10
Comments
2 min read
What to use parquet or CSV?

What to use parquet or CSV?

9
Comments
3 min read
Top 10 Common Data Engineers and Scientists Pain Points in 2024

Top 10 Common Data Engineers and Scientists Pain Points in 2024

9
Comments
5 min read
Day 1 of 30 : Machine Learning

Day 1 of 30 : Machine Learning

8
Comments 6
2 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
is Hadoop Dead?

is Hadoop Dead?

6
Comments
3 min read
The Age of Smart Applications: How AI is Redefining Business Software

The Age of Smart Applications: How AI is Redefining Business Software

6
Comments
3 min read
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant

6
Comments
3 min read
Show Your Work with Confidence: Confidence Bands for Tuning Curves

Show Your Work with Confidence: Confidence Bands for Tuning Curves

6
Comments
4 min read
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

6
Comments
4 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
SonicVisionLM: Playing Sound with Vision Language Models

SonicVisionLM: Playing Sound with Vision Language Models

5
Comments
4 min read
Efficient Quantum Circuit Design with a Standard Cell Approach, with an Application to Neutral Atom Quantum Computers

Efficient Quantum Circuit Design with a Standard Cell Approach, with an Application to Neutral Atom Quantum Computers

5
Comments
4 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

5
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

5
Comments
4 min read
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

5
Comments
4 min read
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

5
Comments
3 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
4 min read
Advancing LLM Reasoning Generalists with Preference Trees

Advancing LLM Reasoning Generalists with Preference Trees

5
Comments
4 min read
GenN2N: Generative NeRF2NeRF Translation

GenN2N: Generative NeRF2NeRF Translation

5
Comments
3 min read
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

5
Comments
3 min read
PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

5
Comments
5 min read
Active Liveness Detection vs Passive Liveness Detection

Active Liveness Detection vs Passive Liveness Detection

5
Comments
1 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing

Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing

5
Comments
3 min read
Characterization of Large Language Model Development in the Datacenter

Characterization of Large Language Model Development in the Datacenter

5
Comments
4 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
Visualizing Multi-Dimensional Data in Action: Vehicle Ownership

Visualizing Multi-Dimensional Data in Action: Vehicle Ownership

5
Comments
2 min read
ShapeFusion: A 3D diffusion model for localized shape editing

ShapeFusion: A 3D diffusion model for localized shape editing

5
Comments
6 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

5
Comments
4 min read
AIJack: Let's Hijack AI! Security and Privacy Risk Simulator for Machine Learning

AIJack: Let's Hijack AI! Security and Privacy Risk Simulator for Machine Learning

5
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
Basic Terms in Machine Learning (Model Training)

Basic Terms in Machine Learning (Model Training)

4
Comments
4 min read
Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

4
Comments
1 min read
LLMs are secretly good at regression calculations

LLMs are secretly good at regression calculations

4
Comments
9 min read
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

3
Comments
4 min read
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

3
Comments
4 min read
Common Table Expressions (CTEs) in SQL

Common Table Expressions (CTEs) in SQL

3
Comments
1 min read
Bot or Human? Detecting ChatGPT Imposters with A Single Question

Bot or Human? Detecting ChatGPT Imposters with A Single Question

3
Comments
4 min read
👭 Women suffrage dates (suffragettes) celebration w/ data 🗳️

👭 Women suffrage dates (suffragettes) celebration w/ data 🗳️

3
Comments 4
1 min read
Independence of Errors: A Guide to Validating Linear Regression Assumptions

Independence of Errors: A Guide to Validating Linear Regression Assumptions

3
Comments
3 min read
Ten Hard Problems in Artificial Intelligence We Must Get Right

Ten Hard Problems in Artificial Intelligence We Must Get Right

2
Comments 3
4 min read
Train, Dev and Test Sets

Train, Dev and Test Sets

2
Comments 1
2 min read
Information Retrieval with Entity Linking

Information Retrieval with Entity Linking

2
Comments
4 min read
What is SQL in picture

What is SQL in picture

2
Comments 2
1 min read
KAN: Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

2
Comments
4 min read
loading...