DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Mitigating LLM Hallucinations via Conformal Abstention

Mitigating LLM Hallucinations via Conformal Abstention

Comments
4 min read
Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

Comments
3 min read
The Psychosocial Impacts of Generative AI Harms

The Psychosocial Impacts of Generative AI Harms

Comments
4 min read
Generative Multimodal Models are In-Context Learners

Generative Multimodal Models are In-Context Learners

Comments
4 min read
Assemblage: Automatic Binary Dataset Construction for Machine Learning

Assemblage: Automatic Binary Dataset Construction for Machine Learning

Comments
4 min read
HCC Is All You Need: Alignment-The Sensible Kind Anyway-Is Just Human-Centered Computing

HCC Is All You Need: Alignment-The Sensible Kind Anyway-Is Just Human-Centered Computing

Comments
4 min read
xLSTM: Extended Long Short-Term Memory

xLSTM: Extended Long Short-Term Memory

Comments
3 min read
From ETL to Modern Integration Platforms

From ETL to Modern Integration Platforms

Comments
4 min read
Top 10 technologies for data engineers.

Top 10 technologies for data engineers.

1
Comments
2 min read
How to Visualize LiDAR Data

How to Visualize LiDAR Data

Comments
1 min read
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

Comments
3 min read
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Comments
4 min read
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

Comments
3 min read
AlphaMath Almost Zero: process Supervision without process

AlphaMath Almost Zero: process Supervision without process

Comments
4 min read
Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

10
Comments
7 min read
Automating Data Processes for Efficiency and Accuracy

Automating Data Processes for Efficiency and Accuracy

Comments
5 min read
Accelerating ETL Processes for Timely Business Intelligence

Accelerating ETL Processes for Timely Business Intelligence

Comments
4 min read
FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

1
Comments
4 min read
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

1
Comments
4 min read
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

1
Comments
4 min read
A Simple and Effective Pruning Approach for Large Language Models

A Simple and Effective Pruning Approach for Large Language Models

1
Comments
3 min read
Prompt Design and Engineering: Introduction and Advanced Methods

Prompt Design and Engineering: Introduction and Advanced Methods

1
Comments
4 min read
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

1
Comments
3 min read
SAR image matching algorithm based on multi-class features

SAR image matching algorithm based on multi-class features

Comments
4 min read
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

1
Comments
3 min read
PopulAtion Parameter Averaging (PAPA)

PopulAtion Parameter Averaging (PAPA)

Comments
3 min read
Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Comments
4 min read
Network reconstruction via the minimum description length principle

Network reconstruction via the minimum description length principle

Comments
3 min read
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Comments
4 min read
Are aligned neural networks adversarially aligned?

Are aligned neural networks adversarially aligned?

Comments
4 min read
Poisoning Web-Scale Training Datasets is Practical

Poisoning Web-Scale Training Datasets is Practical

Comments
3 min read
Circuit Component Reuse Across Tasks in Transformer Language Models

Circuit Component Reuse Across Tasks in Transformer Language Models

Comments
4 min read
R-Tuning: Instructing Large Language Models to Say `I Don't Know'

R-Tuning: Instructing Large Language Models to Say `I Don't Know'

Comments
4 min read
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

Comments
4 min read
Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Comments
3 min read
FLAME: Factuality-Aware Alignment for Large Language Models

FLAME: Factuality-Aware Alignment for Large Language Models

Comments
4 min read
What to use parquet or CSV?

What to use parquet or CSV?

8
Comments
3 min read
Flexibility Meets Excellence: Online Data Science Course

Flexibility Meets Excellence: Online Data Science Course

Comments
3 min read
Releasing LightningChart JS v.5.2

Releasing LightningChart JS v.5.2

Comments
2 min read
🗺️ Neo4J #GraphSummits as data!

🗺️ Neo4J #GraphSummits as data!

Comments 3
1 min read
The Age of Smart Applications: How AI is Redefining Business Software

The Age of Smart Applications: How AI is Redefining Business Software

6
Comments
3 min read
Anomaly Detection with FiftyOne and Anomalib

Anomaly Detection with FiftyOne and Anomalib

Comments
12 min read
Difference between Data Analysts, Data Scientists, and Data Engineers

Difference between Data Analysts, Data Scientists, and Data Engineers

Comments 1
1 min read
Wukong: Towards a Scaling Law for Large-Scale Recommendation

Wukong: Towards a Scaling Law for Large-Scale Recommendation

2
Comments 1
3 min read
Practical Performance Guarantees for Pipelined DNN Inference

Practical Performance Guarantees for Pipelined DNN Inference

Comments
4 min read
A Primer on the Inner Workings of Transformer-based Language Models

A Primer on the Inner Workings of Transformer-based Language Models

Comments
4 min read
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable

Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable

Comments
4 min read
Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy

Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy

Comments
4 min read
Mentions of Prejudice in News Media -- An International Comparison

Mentions of Prejudice in News Media -- An International Comparison

Comments
4 min read
Visual Enumeration is Challenging for Large-scale Generative AI

Visual Enumeration is Challenging for Large-scale Generative AI

1
Comments
5 min read
Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting

Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting

Comments
4 min read
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

Comments
3 min read
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

1
Comments
4 min read
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Comments
4 min read
Streamlining Image Editing with Layered Diffusion Brushes

Streamlining Image Editing with Layered Diffusion Brushes

Comments
5 min read
RGB$leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models

RGB$leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models

Comments
3 min read
6 High-Paying Jobs That Could Make You a Millionaire

6 High-Paying Jobs That Could Make You a Millionaire

1
Comments 1
6 min read
On Premise Face Recognition SDK and Liveness Detection SDK by FacePlugin

On Premise Face Recognition SDK and Liveness Detection SDK by FacePlugin

Comments
1 min read
Common Table Expressions (CTEs) in SQL

Common Table Expressions (CTEs) in SQL

3
Comments
1 min read
Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

10
Comments
2 min read
loading...