loading...
Cover image for Apache Spark Bitesize Series

Apache Spark Bitesize Series

adipolak profile image Adi Polak ・1 min read

Apache Spark in Bitesize (4 Part Series)

1) Apache Spark Bitesize Series 2) Apache Spark Basics 3) Apache Spark Accumulators, Simplified ! 4) PySpark and Apache Spark Broadcast Mechanism

Want to learn Apache Spark?
Stream Processing, Analytics, and Machine Learning?

This blog post series is for you!

Apache Spark is an open-source distributed general-purpose cluster- computing framework. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

It's time to light up the Spark:

Apache Spark bitesize series is built for busy people.

Each post will cover one of the three most important areas in working
with data technologies and challenges today:

Streaming
Analytics
Machine Learning

Every post will have a maximum of 2 minutes read length coupled with longer tutorial and hand-on workshops for when you can put the time.

Want to get updates on new bitesize posts?
Follow me here on dev.to and on Twitter.

Have a question for me? comment or send a DM.

Apache Spark in Bitesize (4 Part Series)

1) Apache Spark Bitesize Series 2) Apache Spark Basics 3) Apache Spark Accumulators, Simplified ! 4) PySpark and Apache Spark Broadcast Mechanism

Posted on by:

adipolak profile

Adi Polak

@adipolak

1 out of 25 influential women in Software Development according to Apiumhub. I am a software developer who would like to learn more!

Discussion

markdown guide
 

I'm in! Have set a personal goal of getting this 'kitchen sink', bit.ly/2KTwPwL example cooking for my own edification and Spark is a piece of the puzzle. So, chipping away at Spark sounds just like what the dr. ordered!