DEV Community

# dataprocessing

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Apache Spark vs. Apache Flink: A Comparison of the Data Processing Duo

Apache Spark vs. Apache Flink: A Comparison of the Data Processing Duo

1
Comments
2 min read
Data Manipulation with Strings and Arrays in Bash

Data Manipulation with Strings and Arrays in Bash

Comments
3 min read
Simplifying Data Processing with Java Stream API

Simplifying Data Processing with Java Stream API

Comments
2 min read
Understanding Apache Spark and Hadoop Jobs

Understanding Apache Spark and Hadoop Jobs

Comments
5 min read
Beginner's guide to Apache Flink

Beginner's guide to Apache Flink

1
Comments
3 min read
Standardizing the Data Using StandardScaler in ML

Standardizing the Data Using StandardScaler in ML

3
Comments
5 min read
Introducing Memphis Functions

Introducing Memphis Functions

2
Comments
3 min read
Elements of Event Driven Architecture(EDA)

Elements of Event Driven Architecture(EDA)

Comments
4 min read
Event-Driven Architecture with Serverless Functions – Part 1

Event-Driven Architecture with Serverless Functions – Part 1

2
Comments 1
5 min read
Data Processing with Elixir (Part 2)

Data Processing with Elixir (Part 2)

6
Comments 3
3 min read
Part 3: Transforming MongoDB CDC Event Messages

Part 3: Transforming MongoDB CDC Event Messages

Comments
6 min read
kafka: distributed task queue

kafka: distributed task queue

4
Comments
6 min read
Getting started with Apache Flink: A guide to stream processing

Getting started with Apache Flink: A guide to stream processing

19
Comments
8 min read
Apache Flink vs Apache Spark: A detailed comparison for data processing

Apache Flink vs Apache Spark: A detailed comparison for data processing

12
Comments 1
5 min read
Real-Time Data Processing using AWS

Real-Time Data Processing using AWS

6
Comments 1
5 min read
Customer Data Pipeline And Data Processing: Types, Importance, And Benefits

Customer Data Pipeline And Data Processing: Types, Importance, And Benefits

2
Comments
4 min read
What does a Data Orchestration platform do?

What does a Data Orchestration platform do?

5
Comments
2 min read
Open-source Deep Dive: Broadway (Part 1) - Message queues, concurrency in Elixir, and Broadway architecture

Open-source Deep Dive: Broadway (Part 1) - Message queues, concurrency in Elixir, and Broadway architecture

13
Comments
9 min read
Open-source Deep Dive: Broadway (Part 2) - Inner workings of Broadway

Open-source Deep Dive: Broadway (Part 2) - Inner workings of Broadway

12
Comments
14 min read
MRI Data Processing with Python

MRI Data Processing with Python

6
Comments 6
2 min read
Taking a leap on data: Revolutionizing data usage with tokenization

Taking a leap on data: Revolutionizing data usage with tokenization

7
Comments
1 min read
Monitoring = (Elasticsearch + Logstash + Kibana ) + Kafka * Flink

Monitoring = (Elasticsearch + Logstash + Kibana ) + Kafka * Flink

18
Comments
3 min read
{A} Numbers - All things are number

{A} Numbers - All things are number

8
Comments
1 min read
Data Pipeline Orchestration With Zeebe (And An Example Map/Reduce Implementation)

Data Pipeline Orchestration With Zeebe (And An Example Map/Reduce Implementation)

6
Comments
6 min read
Real world data processing with Google Cloud Platform

Real world data processing with Google Cloud Platform

12
Comments 1
1 min read
loading...