DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Basics of Git and GitHub

Basics of Git and GitHub

Comments
4 min read
Why Data Engineers Are Becoming Agent Engineers

Why Data Engineers Are Becoming Agent Engineers

Comments
3 min read
Apache Gravitino Introduction

Apache Gravitino Introduction

Comments
5 min read
Tired of ETL Bottlenecks? Build a Logical Data Warehouse with SPL

Tired of ETL Bottlenecks? Build a Logical Data Warehouse with SPL

5
Comments
11 min read
Dev List Digest for Apache Iceberg, Parquet, Polaris and Arrow: January 6–14, 2026

Dev List Digest for Apache Iceberg, Parquet, Polaris and Arrow: January 6–14, 2026

Comments
4 min read
Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Comments
4 min read
Geospatial Data Orchestration: Why Modern GIS Pipelines Require an Asset-Based Approach

Geospatial Data Orchestration: Why Modern GIS Pipelines Require an Asset-Based Approach

Comments
7 min read
We're Manufacturing Dashboards & Data Nobody Uses (And the Data Proves It)

We're Manufacturing Dashboards & Data Nobody Uses (And the Data Proves It)

Comments
4 min read
Conference Notes: How ML Powers LINE Services

Conference Notes: How ML Powers LINE Services

Comments
5 min read
Modern Data Integration at Scale with Microsoft Fabric Connectors

Modern Data Integration at Scale with Microsoft Fabric Connectors

Comments
3 min read
LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

Comments
7 min read
JSONL is a seriously weird format!

JSONL is a seriously weird format!

Comments
2 min read
How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

Comments
7 min read
SQL on Kafka Data Does Not Require a Streaming Engine

SQL on Kafka Data Does Not Require a Streaming Engine

Comments
4 min read
Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Comments
21 min read
Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

1
Comments
4 min read
The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

Comments
11 min read
Ask Our AI Experts: An AMA With Our Tech Leads

Ask Our AI Experts: An AMA With Our Tech Leads

Comments
3 min read
A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

Comments
4 min read
Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

1
Comments
4 min read
Proyecto Weather Service (Parte 1): Construyendo el Recolector de Datos con Python y GitHub Actions o Netlify

Proyecto Weather Service (Parte 1): Construyendo el Recolector de Datos con Python y GitHub Actions o Netlify

1
Comments
10 min read
Your 2026 Resolution: Add Context to Your Data (Before It Breaks You)

Your 2026 Resolution: Add Context to Your Data (Before It Breaks You)

Comments
10 min read
The Natasha Problem: Why Your Data Pipeline Only Fits One Person

The Natasha Problem: Why Your Data Pipeline Only Fits One Person

Comments
5 min read
Before Big Data: 3 Key Discoveries That Changed Business Strategy Forever

Before Big Data: 3 Key Discoveries That Changed Business Strategy Forever

Comments
4 min read
Stop Re-running Everything: A Local Incremental Pipeline in DuckDB

Stop Re-running Everything: A Local Incremental Pipeline in DuckDB

Comments
4 min read
loading...