loading...
👋 Sign in for the ability sort posts by top and latest.

6 Ways of Applying a Function to Python Pandas DataFrame

Reactions 4
1 min read

Cut data warehouse costs with run caching

Reactions 5
3 min read

Dagster with User Code Deployments (gRPC)

Reactions 4
6 min read

Some of my favourite public data sets

Reactions 8 Comments 2
2 min read

5 Essential skills for becoming a Data Engineer

Reactions 4
6 min read

The Most Popular Data Science Newsletters

Reactions 7
9 min read

Build a monitored code-based pipeline to move data from Postgres to Snowflake

Reactions 6
9 min read

Handling upstream data changes via Change Data Capture

Reactions 5
8 min read

Intoduction to Apache Spark

Reactions 4
6 min read
01:00

Kafka Connect in 60 seconds

Reactions 2
2 min read

Deploying data pipelines to AWS Fargate - with monitoring and alerts built-in

Reactions 3
3 min read

Large-Scale Data Quality Verification in .NET PT.1

Reactions 2
9 min read

How To Run Airflow on Windows (with Docker)

Reactions 9 Comments 1
8 min read

Data Engineering Project for Beginners - Batch edition

Reactions 8
19 min read

Airflow UI with Role-Based Access Control

Reactions 3
1 min read

🛢Create New Kedro Pipeline (kedro new)

Reactions 6
4 min read

🤷‍♀️ What is Kedro (The Parts)

Reactions 14 Comments 3
3 min read

Data engineering portfolio projects?

Reactions 12 Comments 1
1 min read

Apache Airflow Core Concepts

Reactions 15
4 min read

5 Considerations to have when using Airflow

Reactions 10
6 min read
00:31

Data Engineering Skills

Reactions 12
1 min read

Manage Data Pipelines with Apache Airflow

Reactions 62
13 min read

How to Run Parallel Data Analysis in Python using Dask Dataframes

Reactions 6
6 min read

Scrape Structured Data with Python and Extruct

Reactions 7
16 min read

Loading CSV data into Kafka - video walkthrough

Reactions 5
10 min read

CI/CD for ETL/ELT pipelines

Reactions 18
3 min read

Terraform in Anger Part 1: AWS S3 Access

Reactions 6
9 min read

5 Challenges ในการสร้าง Production-Grade Data Pipeline

Reactions 26 Comments 5
1 min read

What differentiates schema on read from schema on write?

Reactions 2 Comments 2
3 min read

Scraping Data on the Web with BeautifulSoup

Reactions 30
12 min read

10 Key skills, to help you become a data engineer

Reactions 7
3 min read

Apache Airflow Installation - mysql+celery

Reactions 3
1 min read

Extract Nested Data From Complex JSON

Reactions 8
6 min read

Psycopg2: PostgreSQL & Python (the Old Fashioned Way)

Reactions 16
6 min read

Azure Message Brokers patterns for Data Applications

Reactions 6
6 min read

Coding MapReduce in C from Scratch using Threads: Map

Reactions 7
9 min read

How to collect the data you need to bootstrap your digital marketing analytics

Reactions 12
12 min read

Structured Streaming in PySpark

Reactions 10
9 min read

Becoming Familiar with Apache Kafka and Message Queues

Reactions 16
6 min read

I am a junior data engineer without a senior engineer. What should I do?

Reactions 6 Comments 1
1 min read

Why we chose Apache Spark for ETL (Extract-Transform-Load)

Reactions 22
6 min read

Intro to Data Ingestion and Data Lakes

Reactions 7
3 min read

Data Engineering — Complete Reference Guide From A-Z [2019]

Reactions 18
16 min read

Overview of the different approaches to putting Machine Learning (ML) models in production

Reactions 7
14 min read

ON the evolution of Data Engineering

Reactions 14
4 min read

Understanding and Optimizing Throughput in Azure Cosmos DB

Reactions 7 Comments 2
8 min read

Welcome to SQL: Modifying Databases and Tables

Reactions 6
10 min read

From CSVs to Tables: Infer Schema Data Types From Raw Spreadsheets

Reactions 6
8 min read

Intro to Python Database Management with SQLAlchemy

Reactions 15
7 min read

Know Your Data (KYD)

Reactions 5
1 min read

Optimising E-Commerce Data - Know Your Data (KYD)

Reactions 2
3 min read

So, you want to data science

Reactions 28 Comments 2
8 min read

Choosing Your Data Warehouse

Reactions 10 Comments 4
1 min read

What Do I Look for in Data Engineers?

Reactions 0
2 min read

Data Warehouse - The Minimal Architectural Approach

Reactions 2
2 min read

Thermal Data

Reactions 2
1 min read
loading...