DEV Community

# bigdata

Posts

👋 Sign in for the ability sort posts by top and latest.
Cube Cloud Deep Dive: Starting a New Cube App

Cube Cloud Deep Dive: Starting a New Cube App

Reactions 17 Comments
9 min read
Understanding Apache Hive LLAP

Understanding Apache Hive LLAP

Reactions 3 Comments
7 min read
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?

ETLs vs ELTs: Why are ELTs Disrupting the Data Market?

Reactions 8 Comments
8 min read
Setting up a single-node Hadoop cluster

Setting up a single-node Hadoop cluster

Reactions 6 Comments
10 min read
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro

Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro

Reactions 6 Comments
8 min read
Bigdata: A problem and a solution

Bigdata: A problem and a solution

Reactions 1 Comments
4 min read
Build your own data quality rules with AWS Glue DataBrew

Build your own data quality rules with AWS Glue DataBrew

Reactions 11 Comments
6 min read
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew

Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew

Reactions 6 Comments
4 min read
Data Engineering Introduction

Data Engineering Introduction

Reactions 6 Comments
2 min read
What Is Crypto and How Does It Work ?

What Is Crypto and How Does It Work ?

Reactions 7 Comments
3 min read
Find The Best Way To Load Data In A Data Warehouse

Find The Best Way To Load Data In A Data Warehouse

Reactions 2 Comments
4 min read
Amazon Redshift: Cost Optimization | AWS White Paper Summary

Amazon Redshift: Cost Optimization | AWS White Paper Summary

Reactions 22 Comments
9 min read
The Important SQL Queries for Beginners

The Important SQL Queries for Beginners

Reactions 12 Comments
8 min read
Data lakes: building a serverless data pipeline

Data lakes: building a serverless data pipeline

Reactions 2 Comments
6 min read
Examples of Digital Transformation in Healthcare

Examples of Digital Transformation in Healthcare

Reactions 2 Comments
6 min read
Big Data Analytics Options on AWS | AWS White Paper Summary

Big Data Analytics Options on AWS | AWS White Paper Summary

Reactions 23 Comments
10 min read
A first update on our AI/ML/Big Data salary survey

A first update on our AI/ML/Big Data salary survey

Reactions 2 Comments
2 min read
Performance capabilities of data warehouses and how Cube can help

Performance capabilities of data warehouses and how Cube can help

Reactions 24 Comments
18 min read
Scramjet Transform Hub — Quick Start introduction

Scramjet Transform Hub — Quick Start introduction

Reactions 10 Comments
7 min read
Introduction to Scramjet Data Processing Platform

Introduction to Scramjet Data Processing Platform

Reactions 9 Comments
3 min read
Getting Started With Apache Airflow

Getting Started With Apache Airflow

Reactions 5 Comments
11 min read
Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many

Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many

Reactions 9 Comments
6 min read
Key concepts of Big Data

Key concepts of Big Data

Reactions 2 Comments
5 min read
Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives

Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives

Reactions 5 Comments 1
6 min read
ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!

ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!

Reactions 8 Comments 1
6 min read
What Are the Technology Trends That Reshape the Future of The Retail Industry in 2021?

What Are the Technology Trends That Reshape the Future of The Retail Industry in 2021?

Reactions 4 Comments
8 min read
Computing the Pearson correlation matrix on huge datasets in Python

Computing the Pearson correlation matrix on huge datasets in Python

Reactions 7 Comments 1
5 min read
How to Scrape Twitter Data with Headless Chrome

How to Scrape Twitter Data with Headless Chrome

Reactions 6 Comments 1
5 min read
Reliable ingestion from AWS S3 using Hudi

Reliable ingestion from AWS S3 using Hudi

Reactions 3 Comments
6 min read
ทดสอบการทำ Anonymize data in your data lake with Amazon Athena

ทดสอบการทำ Anonymize data in your data lake with Amazon Athena

Reactions 11 Comments 1
2 min read
เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake

เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake

Reactions 10 Comments
6 min read
How to deal with Big data challenges

How to deal with Big data challenges

Reactions 6 Comments
5 min read
Updating data files, commits vs. pull requests

Updating data files, commits vs. pull requests

Reactions 6 Comments 4
3 min read
SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake

SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake

Reactions 10 Comments 6
7 min read
To let the beginners know their career goals who have opted data science.

To let the beginners know their career goals who have opted data science.

Reactions 2 Comments
2 min read
Predictive Analytics: How to Make Digital Price Predictions

Predictive Analytics: How to Make Digital Price Predictions

Reactions 3 Comments
4 min read
Big data and how to get them

Big data and how to get them

Reactions 2 Comments
3 min read
Unboxing a Database-How Databases Work Internally

Unboxing a Database-How Databases Work Internally

Reactions 12 Comments 1
11 min read
UPSERTS and DELETES using AWS Glue and Delta Lake

UPSERTS and DELETES using AWS Glue and Delta Lake

Reactions 21 Comments
10 min read
Exploratory Data Analysis Using Python

Exploratory Data Analysis Using Python

Reactions 42 Comments 1
5 min read
Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing

Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing

Reactions 2 Comments
9 min read
How to easily install kafka without zookeeper

How to easily install kafka without zookeeper

Reactions 7 Comments
7 min read
Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)

Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)

Reactions 9 Comments
7 min read
Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos

Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos

Reactions 12 Comments
1 min read
How to Use Consistent Hashing in a System Design Interview?

How to Use Consistent Hashing in a System Design Interview?

Reactions 12 Comments 3
7 min read
5 Best Big Data Frameworks You Can Learn in 2021

5 Best Big Data Frameworks You Can Learn in 2021

Reactions 52 Comments
8 min read
The Complete Guide to Data Science, Big Data, and Data Analytics

The Complete Guide to Data Science, Big Data, and Data Analytics

Reactions 6 Comments
3 min read
The Evolution of Data Access Control

The Evolution of Data Access Control

Reactions 3 Comments
3 min read
AWS Data Lake with Terraform - Part 2 of 6

AWS Data Lake with Terraform - Part 2 of 6

Reactions 14 Comments
2 min read
AWS Data Lake with Terraform - Part 1 of 6

AWS Data Lake with Terraform - Part 1 of 6

Reactions 21 Comments
4 min read
Assess how many Kafka servers are needed to face a scenario of 1 billion requests.

Assess how many Kafka servers are needed to face a scenario of 1 billion requests.

Reactions 8 Comments
6 min read
Big Data + MySQL = Mission InnoPossible?

Big Data + MySQL = Mission InnoPossible?

Reactions 4 Comments
9 min read
A Visual Guide To: Azure Data Factory

A Visual Guide To: Azure Data Factory

Reactions 10 Comments
4 min read
AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021

AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021

Reactions 7 Comments
3 min read
5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021

5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021

Reactions 4 Comments 2
6 min read
AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData

AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData

Reactions 6 Comments
3 min read
Top 5 Online Events on new technologies and trends 2021

Top 5 Online Events on new technologies and trends 2021

Reactions 2 Comments 1
6 min read
Eliminate frictions from the developers’ experience – discover the new Inspector data visualization UI

Eliminate frictions from the developers’ experience – discover the new Inspector data visualization UI

Reactions 4 Comments
3 min read
Data storage patterns, versioning and partitions

Data storage patterns, versioning and partitions

Reactions 7 Comments
9 min read
Here is What Happens If You Decouple Your BI Stack

Here is What Happens If You Decouple Your BI Stack

Reactions 5 Comments
7 min read
loading...