DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

13
Comments
7 min read
Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

Unlocking the Power of Python: Why It's Your Ultimate Programming Partner

10
Comments
2 min read
What to use parquet or CSV?

What to use parquet or CSV?

8
Comments
3 min read
The Age of Smart Applications: How AI is Redefining Business Software

The Age of Smart Applications: How AI is Redefining Business Software

6
Comments
3 min read
is Hadoop Dead?

is Hadoop Dead?

5
Comments
3 min read
Common Table Expressions (CTEs) in SQL

Common Table Expressions (CTEs) in SQL

3
Comments
1 min read
Wukong: Towards a Scaling Law for Large-Scale Recommendation

Wukong: Towards a Scaling Law for Large-Scale Recommendation

2
Comments 1
3 min read
Train, Dev and Test Sets

Train, Dev and Test Sets

2
Comments 1
2 min read
6 High-Paying Jobs That Could Make You a Millionaire

6 High-Paying Jobs That Could Make You a Millionaire

1
Comments 1
6 min read
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

1
Comments
4 min read
Visual Enumeration is Challenging for Large-scale Generative AI

Visual Enumeration is Challenging for Large-scale Generative AI

1
Comments
5 min read
Anomaly Detection with FiftyOne and Anomalib

Anomaly Detection with FiftyOne and Anomalib

1
Comments
12 min read
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

1
Comments
3 min read
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

1
Comments
3 min read
Prompt Design and Engineering: Introduction and Advanced Methods

Prompt Design and Engineering: Introduction and Advanced Methods

1
Comments
4 min read
A Simple and Effective Pruning Approach for Large Language Models

A Simple and Effective Pruning Approach for Large Language Models

1
Comments
3 min read
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

1
Comments
4 min read
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

1
Comments
4 min read
FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

1
Comments
4 min read
Top 10 technologies for data engineers.

Top 10 technologies for data engineers.

1
Comments
2 min read
Recapping the AI, Machine Learning and Data Science Meetup — May 8, 2024

Recapping the AI, Machine Learning and Data Science Meetup — May 8, 2024

1
Comments
9 min read
Practical Performance Guarantees for Pipelined DNN Inference

Practical Performance Guarantees for Pipelined DNN Inference

Comments
4 min read
Mastering Data Exploration with Tidyverse: A Beginner-Friendly Guide

Mastering Data Exploration with Tidyverse: A Beginner-Friendly Guide

Comments
7 min read
Difference between Data Analysts, Data Scientists, and Data Engineers

Difference between Data Analysts, Data Scientists, and Data Engineers

Comments 1
1 min read
History of Math & Machine Learning

History of Math & Machine Learning

Comments
8 min read
Introducing Tapyr: Create and Deploy Enterprise-Ready PyShiny Dashboards with Ease

Introducing Tapyr: Create and Deploy Enterprise-Ready PyShiny Dashboards with Ease

Comments
5 min read
🗺️ Neo4J #GraphSummits as data!

🗺️ Neo4J #GraphSummits as data!

Comments 3
1 min read
9 months of Machine Learning and beyond: before I've started

9 months of Machine Learning and beyond: before I've started

Comments
5 min read
Releasing LightningChart JS v.5.2

Releasing LightningChart JS v.5.2

Comments
2 min read
Flexibility Meets Excellence: Online Data Science Course

Flexibility Meets Excellence: Online Data Science Course

Comments
3 min read
Exploring SQL Functions: Harnessing the Power of Built-in Functions

Exploring SQL Functions: Harnessing the Power of Built-in Functions

Comments
3 min read
FLAME: Factuality-Aware Alignment for Large Language Models

FLAME: Factuality-Aware Alignment for Large Language Models

Comments
4 min read
Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Comments
3 min read
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

Comments
4 min read
R-Tuning: Instructing Large Language Models to Say `I Don't Know'

R-Tuning: Instructing Large Language Models to Say `I Don't Know'

Comments
4 min read
Circuit Component Reuse Across Tasks in Transformer Language Models

Circuit Component Reuse Across Tasks in Transformer Language Models

Comments
4 min read
Poisoning Web-Scale Training Datasets is Practical

Poisoning Web-Scale Training Datasets is Practical

Comments
3 min read
Are aligned neural networks adversarially aligned?

Are aligned neural networks adversarially aligned?

Comments
4 min read
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Comments
4 min read
Network reconstruction via the minimum description length principle

Network reconstruction via the minimum description length principle

Comments
3 min read
Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Porting HPC Applications to AMD Instinct$^text{TM}$ MI300A Using Unified Memory and OpenMP

Comments
4 min read
PopulAtion Parameter Averaging (PAPA)

PopulAtion Parameter Averaging (PAPA)

Comments
3 min read
Voxel51 Filtered Views Newsletter - May 10, 2024

Voxel51 Filtered Views Newsletter - May 10, 2024

Comments
11 min read
SAR image matching algorithm based on multi-class features

SAR image matching algorithm based on multi-class features

Comments
4 min read
OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration

OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration

Comments
4 min read
TIM: An Efficient Temporal Interaction Module for Spiking Transformer

TIM: An Efficient Temporal Interaction Module for Spiking Transformer

Comments
5 min read
Large Language Models can Strategically Deceive their Users when Put Under Pressure

Large Language Models can Strategically Deceive their Users when Put Under Pressure

Comments
4 min read
Neural Networks Make Approximately Independent Errors Over Repeated Training

Neural Networks Make Approximately Independent Errors Over Repeated Training

Comments
4 min read
LLMs Can Patch Up Missing Relevance Judgments in Evaluation

LLMs Can Patch Up Missing Relevance Judgments in Evaluation

Comments
4 min read
The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

Comments
4 min read
Accelerating ETL Processes for Timely Business Intelligence

Accelerating ETL Processes for Timely Business Intelligence

Comments
4 min read
Automating Data Processes for Efficiency and Accuracy

Automating Data Processes for Efficiency and Accuracy

Comments
5 min read
Architecture of Neural Networks

Architecture of Neural Networks

Comments
6 min read
AlphaMath Almost Zero: process Supervision without process

AlphaMath Almost Zero: process Supervision without process

Comments
4 min read
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

Comments
3 min read
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Comments
4 min read
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

Comments
3 min read
How to Visualize LiDAR Data

How to Visualize LiDAR Data

Comments
1 min read
Chain of Thoughtlessness: An Analysis of CoT in Planning

Chain of Thoughtlessness: An Analysis of CoT in Planning

Comments
4 min read
From ETL to Modern Integration Platforms

From ETL to Modern Integration Platforms

Comments
4 min read
loading...