DEV Community

loading...

# bigdata

👋 Sign in for the ability sort posts by top and latest.
"I learned right away how important Data manipulation and cleaning was to managing business affairs", — Matthew D. Groves

"I learned right away how important Data manipulation and cleaning was to managing business affairs", — Matthew D. Groves

Reactions 6 Comments
4 min read
Big Data, What's the Big Deal

Big Data, What's the Big Deal

Reactions 1 Comments
2 min read
"Working with Data helps to uncover the inner workings of multiple spheres in our daily life",— Roksolana Diachuk.

"Working with Data helps to uncover the inner workings of multiple spheres in our daily life",— Roksolana Diachuk.

Reactions 3 Comments
4 min read
"Data is always something new to learn in the area, some new tool to try, some new insight to discover", — Ruben Berenguel

"Data is always something new to learn in the area, some new tool to try, some new insight to discover", — Ruben Berenguel

Reactions 2 Comments
4 min read
"Data is the new center of gravity", — Jules Damji.

"Data is the new center of gravity", — Jules Damji.

Reactions 2 Comments
3 min read
Data as a service ( DaaS ) benefits & trends

Data as a service ( DaaS ) benefits & trends

Reactions 2 Comments
5 min read
The Management of Data

The Management of Data

Reactions 5 Comments 1
3 min read
Impact of COVID-19 on people's habits worldwide

Impact of COVID-19 on people's habits worldwide

Reactions 8 Comments 5
2 min read
How we implemented Distributed Multi-document ACID Transactions in Couchbase

How we implemented Distributed Multi-document ACID Transactions in Couchbase

Reactions 6 Comments
14 min read
Integrating LVM with Hadoop Cluster

Integrating LVM with Hadoop Cluster

Comments
3 min read
7 Real-Time Data Streaming Tools You Should Consider On Your Next Project

7 Real-Time Data Streaming Tools You Should Consider On Your Next Project

Reactions 17 Comments 1
9 min read
Elasticsearch as a primary database?

Elasticsearch as a primary database?

Reactions 4 Comments
2 min read
Why You Need a CRM Data Cleanup

Why You Need a CRM Data Cleanup

Reactions 2 Comments
8 min read
Starting your Journey with Big Data Analytics

Starting your Journey with Big Data Analytics

Reactions 4 Comments
4 min read
Data architecture characteristics & principles

Data architecture characteristics & principles

Reactions 2 Comments
6 min read
Getting Started With JanusGraph

Getting Started With JanusGraph

Reactions 2 Comments
5 min read
Apache Kafka: What is and how it works

Apache Kafka: What is and how it works

Reactions 2 Comments
8 min read
Spark MLlib for Big data and Machine learning

Spark MLlib for Big data and Machine learning

Reactions 7 Comments
4 min read
What is data engineering?

What is data engineering?

Reactions 4 Comments
1 min read
A Look at the Long-Lasting Java and Big Data Relationship (With a List of Resources Data Scientists Can Use for Java Learning)

A Look at the Long-Lasting Java and Big Data Relationship (With a List of Resources Data Scientists Can Use for Java Learning)

Reactions 5 Comments 1
8 min read
BIG DATA COURSE

BIG DATA COURSE

Reactions 3 Comments
3 min read
Using Your Own Apache Spark/Hudi Versions With AWS EMR

Using Your Own Apache Spark/Hudi Versions With AWS EMR

Reactions 3 Comments
2 min read
What In The World Is Dremio And Why Is It Valued At 1 Billion Dollars?

What In The World Is Dremio And Why Is It Valued At 1 Billion Dollars?

Reactions 5 Comments
7 min read
The ugly truth of the CDP

The ugly truth of the CDP

Reactions 4 Comments
1 min read
The Unbiased Guide to Choosing the Right BI Tool

The Unbiased Guide to Choosing the Right BI Tool

Reactions 37 Comments 1
5 min read
Optimize Data Lake layout using Clustering in Apache Hudi

Optimize Data Lake layout using Clustering in Apache Hudi

Reactions 2 Comments
6 min read
Aprendiendo Spark: #1 Introducción

Aprendiendo Spark: #1 Introducción

Reactions 11 Comments
3 min read
Kinesis Data Streams vs. Kinesis Firehose Delivery Streams

Kinesis Data Streams vs. Kinesis Firehose Delivery Streams

Reactions 2 Comments
3 min read
Data Mining

Data Mining

Reactions 3 Comments
2 min read
Right Sizing Snowflake Warehouses / Compute

Right Sizing Snowflake Warehouses / Compute

Reactions 2 Comments
3 min read
5 Best Hadoop Tutorials to Start in 2021

5 Best Hadoop Tutorials to Start in 2021

Reactions 6 Comments
7 min read
SHARING|BitCherry Testnet is about to Online, Countdown: 1 Day

SHARING|BitCherry Testnet is about to Online, Countdown: 1 Day

Reactions 2 Comments
1 min read
Data Analytics on AWS — What, Why & How

Data Analytics on AWS — What, Why & How

Reactions 5 Comments
13 min read
What Is Big Data?

What Is Big Data?

Reactions 2 Comments
6 min read
Hadoop Installation on Windows 10 using WSL

Hadoop Installation on Windows 10 using WSL

Reactions 12 Comments
7 min read
Here is a python ORM/Driver for InfluxDB : Influxable

Here is a python ORM/Driver for InfluxDB : Influxable

Reactions 5 Comments
2 min read
Machine Learning and Artificial Intelligence

Machine Learning and Artificial Intelligence

Reactions 2 Comments
8 min read
Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

Reactions 7 Comments
5 min read
Data Analyst vs Business Analyst

Data Analyst vs Business Analyst

Reactions 16 Comments 3
4 min read
Event Driven Data Pipelines in AWS

Event Driven Data Pipelines in AWS

Reactions 5 Comments
9 min read
5 Reasons Why Big Data Analytics is the Best Career Move

5 Reasons Why Big Data Analytics is the Best Career Move

Reactions 2 Comments
4 min read
What Are ETLs And Why We Use Them

What Are ETLs And Why We Use Them

Reactions 26 Comments
14 min read
Automation and Machine Learning: A Match Made In Heaven

Automation and Machine Learning: A Match Made In Heaven

Reactions 30 Comments 3
5 min read
Trying to grow an open-source ETL project with PHP

Trying to grow an open-source ETL project with PHP

Reactions 4 Comments
1 min read
3 Ways To Improve Your Data Science Teams Efficiency

3 Ways To Improve Your Data Science Teams Efficiency

Reactions 14 Comments
7 min read
Apache Spark Java Tutorial: Simplest Guide to Get Started

Apache Spark Java Tutorial: Simplest Guide to Get Started

Reactions 6 Comments
3 min read
Simulate IoT sensor, use Kafka to process data in real-time, save to Elasticsearch

Simulate IoT sensor, use Kafka to process data in real-time, save to Elasticsearch

Reactions 15 Comments
4 min read
Change Data Capture from PostgreSQL to Azure Data Explorer using Kafka Connect

Change Data Capture from PostgreSQL to Azure Data Explorer using Kafka Connect

Reactions 6 Comments
17 min read
Top Hadoop Interview Questions

Top Hadoop Interview Questions

Reactions 5 Comments
2 min read
Predicting machine failures with distributed computing (Spark, AWS EMR, and DL)

Predicting machine failures with distributed computing (Spark, AWS EMR, and DL)

Reactions 9 Comments
10 min read
Demystify Apache Spark with Azure Synapse Analytics

Demystify Apache Spark with Azure Synapse Analytics

Reactions 5 Comments
1 min read
Transform AWS CloudTrail data using AWS Data Wrangler

Transform AWS CloudTrail data using AWS Data Wrangler

Reactions 3 Comments
8 min read
Enterprise Digital Transformation Guide in the Post Covid World

Enterprise Digital Transformation Guide in the Post Covid World

Reactions 2 Comments 1
4 min read
Dark Data and why it matters in Big Data

Dark Data and why it matters in Big Data

Reactions 2 Comments
3 min read
Please ELI5 big data and privacy concerns, and possible black hacks

Please ELI5 big data and privacy concerns, and possible black hacks

Reactions 2 Comments 3
1 min read
MLOps

MLOps

Reactions 4 Comments
2 min read
Spark Journey begins...

Spark Journey begins...

Reactions 6 Comments
3 min read
Data Ingestion into Azure Data Explorer using Kafka Connect on Kubernetes

Data Ingestion into Azure Data Explorer using Kafka Connect on Kubernetes

Reactions 7 Comments 1
12 min read
Data Scraping and Data Crawling, what are they for?

Data Scraping and Data Crawling, what are they for?

Reactions 4 Comments
5 min read
Working with nested structures in Spark

Working with nested structures in Spark

Reactions 6 Comments 1
3 min read
loading...