DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

10
Comments
2 min read
What to use parquet or CSV?

What to use parquet or CSV?

8
Comments
3 min read
Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

8
Comments
7 min read
The Age of Smart Applications: How AI is Redefining Business Software

The Age of Smart Applications: How AI is Redefining Business Software

6
Comments
3 min read
Common Table Expressions (CTEs) in SQL

Common Table Expressions (CTEs) in SQL

3
Comments
1 min read
Train, Dev and Test Sets

Train, Dev and Test Sets

2
Comments 1
2 min read
Computer Vision Meetup: Who needs RLHF When You Have SFT? 30:36

Computer Vision Meetup: Who needs RLHF When You Have SFT?

2
Comments
1 min read
KAN: Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

2
Comments
4 min read
Wukong: Towards a Scaling Law for Large-Scale Recommendation

Wukong: Towards a Scaling Law for Large-Scale Recommendation

2
Comments 1
3 min read
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks

Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks

1
Comments
3 min read
Scalable network reconstruction in subquadratic time

Scalable network reconstruction in subquadratic time

1
Comments
3 min read
Visual Enumeration is Challenging for Large-scale Generative AI

Visual Enumeration is Challenging for Large-scale Generative AI

1
Comments
5 min read
A Careful Examination of Large Language Model Performance on Grade School Arithmetic

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

1
Comments
5 min read
Fewer Truncations Improve Language Modeling

Fewer Truncations Improve Language Modeling

1
Comments
4 min read
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

1
Comments
3 min read
Uncovering the Metaverse within Everyday Environments: a Coarse-to-Fine Approach

Uncovering the Metaverse within Everyday Environments: a Coarse-to-Fine Approach

1
Comments
3 min read
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

1
Comments
3 min read
Prompt Design and Engineering: Introduction and Advanced Methods

Prompt Design and Engineering: Introduction and Advanced Methods

1
Comments
4 min read
A Simple and Effective Pruning Approach for Large Language Models

A Simple and Effective Pruning Approach for Large Language Models

1
Comments
3 min read
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

1
Comments
4 min read
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

1
Comments
4 min read
FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

1
Comments
4 min read
Computer Vision Meetup: Develop a Legal Search Application from Scratch using Milvus and DSPy! 19:57

Computer Vision Meetup: Develop a Legal Search Application from Scratch using Milvus and DSPy!

1
Comments
1 min read
GenCast: Diffusion-based ensemble forecasting for medium-range weather

GenCast: Diffusion-based ensemble forecasting for medium-range weather

1
Comments
3 min read
6 High-Paying Jobs That Could Make You a Millionaire

6 High-Paying Jobs That Could Make You a Millionaire

1
Comments 1
6 min read
Capabilities of Gemini Models in Medicine

Capabilities of Gemini Models in Medicine

1
Comments
4 min read
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

1
Comments
4 min read
Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

1
Comments
4 min read
Training-free Graph Neural Networks and the Power of Labels as Features

Training-free Graph Neural Networks and the Power of Labels as Features

1
Comments
4 min read
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

1
Comments
4 min read
A Primer on the Inner Workings of Transformer-based Language Models

A Primer on the Inner Workings of Transformer-based Language Models

Comments
4 min read
Practical Performance Guarantees for Pipelined DNN Inference

Practical Performance Guarantees for Pipelined DNN Inference

Comments
4 min read
Difference between Data Analysts, Data Scientists, and Data Engineers

Difference between Data Analysts, Data Scientists, and Data Engineers

Comments 1
1 min read
Anomaly Detection with FiftyOne and Anomalib

Anomaly Detection with FiftyOne and Anomalib

Comments
12 min read
🗺️ Neo4J #GraphSummits as data!

🗺️ Neo4J #GraphSummits as data!

Comments 3
1 min read
Releasing LightningChart JS v.5.2

Releasing LightningChart JS v.5.2

Comments
2 min read
Flexibility Meets Excellence: Online Data Science Course

Flexibility Meets Excellence: Online Data Science Course

Comments
3 min read
FLAME: Factuality-Aware Alignment for Large Language Models

FLAME: Factuality-Aware Alignment for Large Language Models

Comments
4 min read
Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Comments
3 min read
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

Comments
4 min read
R-Tuning: Instructing Large Language Models to Say `I Don't Know'

R-Tuning: Instructing Large Language Models to Say `I Don't Know'

Comments
4 min read
Circuit Component Reuse Across Tasks in Transformer Language Models

Circuit Component Reuse Across Tasks in Transformer Language Models

Comments
4 min read
Poisoning Web-Scale Training Datasets is Practical

Poisoning Web-Scale Training Datasets is Practical

Comments
3 min read
Are aligned neural networks adversarially aligned?

Are aligned neural networks adversarially aligned?

Comments
4 min read
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Comments
4 min read
Network reconstruction via the minimum description length principle

Network reconstruction via the minimum description length principle

Comments
3 min read
Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Comments
4 min read
PopulAtion Parameter Averaging (PAPA)

PopulAtion Parameter Averaging (PAPA)

Comments
3 min read
SAR image matching algorithm based on multi-class features

SAR image matching algorithm based on multi-class features

Comments
4 min read
Accelerating ETL Processes for Timely Business Intelligence

Accelerating ETL Processes for Timely Business Intelligence

Comments
4 min read
Automating Data Processes for Efficiency and Accuracy

Automating Data Processes for Efficiency and Accuracy

Comments
5 min read
AlphaMath Almost Zero: process Supervision without process

AlphaMath Almost Zero: process Supervision without process

Comments
4 min read
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

Comments
3 min read
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Comments
4 min read
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

Comments
3 min read
Predicting SSH keys in Open SSH Memory dumps

Predicting SSH keys in Open SSH Memory dumps

Comments
4 min read
How to Visualize LiDAR Data

How to Visualize LiDAR Data

Comments
1 min read
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Comments
3 min read
Thousands of AI Authors on the Future of AI

Thousands of AI Authors on the Future of AI

Comments
3 min read
Better & Faster Large Language Models via Multi-token Prediction

Better & Faster Large Language Models via Multi-token Prediction

Comments
4 min read
loading...