DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Advanced SQL Techniques: Taking Your Data Skills to the Next Level

Advanced SQL Techniques: Taking Your Data Skills to the Next Level

Comments
2 min read
LLM Agents can Autonomously Exploit One-day Vulnerabilities

LLM Agents can Autonomously Exploit One-day Vulnerabilities

Comments
4 min read
A decoder-only foundation model for time-series forecasting

A decoder-only foundation model for time-series forecasting

Comments
4 min read
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Comments
4 min read
A Closer Look at AUROC and AUPRC under Class Imbalance

A Closer Look at AUROC and AUPRC under Class Imbalance

Comments
4 min read
Twenty Constructionist Things to Do with Artificial Intelligence and Machine Learning

Twenty Constructionist Things to Do with Artificial Intelligence and Machine Learning

Comments
4 min read
Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models

1
Comments
4 min read
Embracing Open Source: A Catalyst for Scientific Progress

Embracing Open Source: A Catalyst for Scientific Progress

Comments
2 min read
Basic Terms in Machine Learning (Model Training)

Basic Terms in Machine Learning (Model Training)

4
Comments
4 min read
Computer Vision Meetup: GraphRAG with a Knowledge Graph 27:01

Computer Vision Meetup: GraphRAG with a Knowledge Graph

Comments
1 min read
AI: A Massive Shift in How We'll Work in the Future is Coming. Are You Ready?

AI: A Massive Shift in How We'll Work in the Future is Coming. Are You Ready?

Comments
2 min read
Data Analysis 1: Scraping web pages

Data Analysis 1: Scraping web pages

Comments
2 min read
Chinchilla Scaling: A replication attempt

Chinchilla Scaling: A replication attempt

Comments
3 min read
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

2
Comments
4 min read
What is Exploratory Data Analysis (EDA)?

What is Exploratory Data Analysis (EDA)?

Comments
2 min read
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text

The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text

Comments
4 min read
What are human values, and how do we align AI to them?

What are human values, and how do we align AI to them?

Comments
4 min read
Confidential Federated Computations

Confidential Federated Computations

Comments
4 min read
KMDS, a package for knowledge managment in data science

KMDS, a package for knowledge managment in data science

1
Comments
1 min read
Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

4
Comments
1 min read
Long-form music generation with latent diffusion

Long-form music generation with latent diffusion

1
Comments
4 min read
"Day 61 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Graph - 1)

"Day 61 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Graph - 1)

1
Comments
1 min read
Choisir la Bonne Agence de Développement Web : Les Clés d'un Partenariat Réussi

Choisir la Bonne Agence de Développement Web : Les Clés d'un Partenariat Réussi

1
Comments
2 min read
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

Comments
4 min read
🔬 Fiches emploi Nouvelle-Calédonie et codes ROME

🔬 Fiches emploi Nouvelle-Calédonie et codes ROME

Comments 1
3 min read
What is the importance of learning linear algebra for data science?

What is the importance of learning linear algebra for data science?

Comments
3 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Comments
3 min read
The Illusion of State in State-Space Models

The Illusion of State in State-Space Models

Comments
4 min read
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
21 Data Science Terms Everyone Should Know

21 Data Science Terms Everyone Should Know

Comments
2 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
H2O-Danube-1.8B Technical Report

H2O-Danube-1.8B Technical Report

Comments
4 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
3 min read
Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
Visualizing Multi-Dimensional Data in Action: Vehicle Ownership

Visualizing Multi-Dimensional Data in Action: Vehicle Ownership

5
Comments
2 min read
CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

Comments
9 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
4 min read
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

6
Comments
4 min read
Show Your Work with Confidence: Confidence Bands for Tuning Curves

Show Your Work with Confidence: Confidence Bands for Tuning Curves

6
Comments
4 min read
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

5
Comments
3 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
loading...