DEV Community

loading...

# bigdata

👋 Sign in for the ability sort posts by top and latest.
Right Sizing Snowflake Warehouses / Compute

Right Sizing Snowflake Warehouses / Compute

Reactions 2
3 min read
SHARING|BitCherry Testnet is about to Online, Countdown: 1 Day

SHARING|BitCherry Testnet is about to Online, Countdown: 1 Day

Reactions 2
1 min read
Data Analytics on AWS — What, Why & How

Data Analytics on AWS — What, Why & How

Reactions 5
13 min read
What Is Big Data?

What Is Big Data?

Reactions 2
6 min read
Hadoop Installation on Windows 10 using WSL

Hadoop Installation on Windows 10 using WSL

Reactions 7
7 min read
Here is a python ORM/Driver for InfluxDB : Influxable

Here is a python ORM/Driver for InfluxDB : Influxable

Reactions 5
2 min read
Trying to grow an open-source ETL project with PHP

Trying to grow an open-source ETL project with PHP

Reactions 5
1 min read
Machine Learning and Artificial Intelligence

Machine Learning and Artificial Intelligence

Reactions 2
8 min read
Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

Spark on Kubernetes Made Easy - How Data Mechanics Improves on the Open-Source version

Reactions 7
5 min read
Data Analyst vs Business Analyst

Data Analyst vs Business Analyst

Reactions 15 Comments 2
4 min read
Event Driven Data Pipelines in AWS

Event Driven Data Pipelines in AWS

Reactions 5
9 min read
5 Reasons Why Big Data Analytics is the Best Career Move

5 Reasons Why Big Data Analytics is the Best Career Move

Reactions 2
4 min read
What Are ETLs And Why We Use Them

What Are ETLs And Why We Use Them

Reactions 26
14 min read
Automation and Machine Learning: A Match Made In Heaven

Automation and Machine Learning: A Match Made In Heaven

Reactions 30 Comments 3
5 min read
3 Ways To Improve Your Data Science Teams Efficiency

3 Ways To Improve Your Data Science Teams Efficiency

Reactions 14
7 min read
Apache Spark Java Tutorial: Simplest Guide to Get Started

Apache Spark Java Tutorial: Simplest Guide to Get Started

Reactions 6
3 min read
Simulate IoT sensor, use Kafka to process data in real-time, save to Elasticsearch

Simulate IoT sensor, use Kafka to process data in real-time, save to Elasticsearch

Reactions 11
4 min read
Change Data Capture from PostgreSQL to Azure Data Explorer using Kafka Connect

Change Data Capture from PostgreSQL to Azure Data Explorer using Kafka Connect

Reactions 6
17 min read
Top Hadoop Interview Questions

Top Hadoop Interview Questions

Reactions 2
2 min read
Predicting machine failures with distributed computing (Spark, AWS EMR, and DL)

Predicting machine failures with distributed computing (Spark, AWS EMR, and DL)

Reactions 7
10 min read
Demystify Apache Spark with Azure Synapse Analytics

Demystify Apache Spark with Azure Synapse Analytics

Reactions 5
1 min read
Transform AWS CloudTrail data using AWS Data Wrangler

Transform AWS CloudTrail data using AWS Data Wrangler

Reactions 3
8 min read
Enterprise Digital Transformation Guide in the Post Covid World

Enterprise Digital Transformation Guide in the Post Covid World

Reactions 2
4 min read
Dark Data and why it matters in Big Data

Dark Data and why it matters in Big Data

Reactions 2
3 min read
Please ELI5 big data and privacy concerns, and possible black hacks

Please ELI5 big data and privacy concerns, and possible black hacks

Reactions 2 Comments 3
1 min read
MLOps

MLOps

Reactions 4
2 min read
Spark Journey begins...

Spark Journey begins...

Reactions 6
3 min read
Data Ingestion into Azure Data Explorer using Kafka Connect on Kubernetes

Data Ingestion into Azure Data Explorer using Kafka Connect on Kubernetes

Reactions 7 Comments 1
12 min read
Data Scraping and Data Crawling, what are they for?

Data Scraping and Data Crawling, what are they for?

Reactions 4
5 min read
Working with nested structures in Spark

Working with nested structures in Spark

Reactions 6 Comments 1
3 min read
Guide - AWS Glue and PySpark

Guide - AWS Glue and PySpark

Reactions 5
14 min read
Intoduction to Apache Spark

Intoduction to Apache Spark

Reactions 8
6 min read
Kafka Connect in 60 seconds 01:00

Kafka Connect in 60 seconds

Reactions 3
2 min read
Data Governance 101

Data Governance 101

Reactions 4
4 min read
Big Data - Testing Strategy

Big Data - Testing Strategy

Reactions 2
1 min read
Supply Chain Risk Management with Data Analytics

Supply Chain Risk Management with Data Analytics

Reactions 2
2 min read
Tutorial: How to Ingest data from Kafka into Azure Data Explorer

Tutorial: How to Ingest data from Kafka into Azure Data Explorer

Reactions 11
10 min read
Unit Testing Apache Spark Structured Streaming using MemoryStream

Unit Testing Apache Spark Structured Streaming using MemoryStream

Reactions 7
4 min read
Exploiting Schema Inference in Apache Spark

Exploiting Schema Inference in Apache Spark

Reactions 2
3 min read
How to use Azure Go SDK to manage Azure Data Explorer clusters

How to use Azure Go SDK to manage Azure Data Explorer clusters

Reactions 6
9 min read
Tutorial: Getting started with Azure Data Explorer using the Go SDK

Tutorial: Getting started with Azure Data Explorer using the Go SDK

Reactions 12
9 min read
Hadoop vs Spark: Which is a better framework to select for processing Big Data?

Hadoop vs Spark: Which is a better framework to select for processing Big Data?

Reactions 4
5 min read
Why are we building DevOps platform for Big Data?

Why are we building DevOps platform for Big Data?

Reactions 3
3 min read
The Big Data Bravura: Introducing Apache Spark

The Big Data Bravura: Introducing Apache Spark

Reactions 20 Comments 2
3 min read
Introduction to Hive for dummies [Module1.3]

Introduction to Hive for dummies [Module1.3]

Reactions 7
10 min read
On.NET Episode: Data processing with .NET for Apache Spark

On.NET Episode: Data processing with .NET for Apache Spark

Reactions 7
1 min read
How to compare your data in/with Spark

How to compare your data in/with Spark

Reactions 6
6 min read
How Can Organizations Ensure the Success of Their Customer Master Data Management Initiatives?

How Can Organizations Ensure the Success of Their Customer Master Data Management Initiatives?

Reactions 4
5 min read
Install Hadoop in linux (Debian) for Big Data Analysis

Install Hadoop in linux (Debian) for Big Data Analysis

Reactions 6
3 min read
The 5-minute guide to using bucketing in Pyspark

The 5-minute guide to using bucketing in Pyspark

Reactions 8 Comments 4
4 min read
spark-submit command builder with live preview

spark-submit command builder with live preview

Reactions 8
1 min read
Database normalization may be harmful to efficiency on large scale analytics projects.

Database normalization may be harmful to efficiency on large scale analytics projects.

Reactions 12 Comments 2
2 min read
AWS Certified Big Data: Specialty study blueprint

AWS Certified Big Data: Specialty study blueprint

Reactions 13
18 min read
Cloud Data Fusion, a game-changer for GCP

Cloud Data Fusion, a game-changer for GCP

Reactions 11 Comments 7
4 min read
Database is not always the answer

Database is not always the answer

Reactions 21 Comments 12
2 min read
Data Lake vs Data Warehouse

Data Lake vs Data Warehouse

Reactions 8
2 min read
Life Beyond Kafka with Apache Pulsar

Life Beyond Kafka with Apache Pulsar

Reactions 16
4 min read
10 Apache Hadoop tutorials, books, and courses for Java and Web developers

10 Apache Hadoop tutorials, books, and courses for Java and Web developers

Reactions 46
6 min read
Azure Blob Storage with Pyspark

Azure Blob Storage with Pyspark

Reactions 10 Comments 1
2 min read
Big Data file formats explained

Big Data file formats explained

Reactions 10
7 min read
loading...