Want to learn Apache Spark?
Stream Processing, Analytics, and Machine Learning?
This blog post series is for you!
Apache Spark is an open-source distributed general-purpose cluster- computing framework. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Apache Spark bitesize series is built for busy people.
Each post will cover one of the three most important areas in working
with data technologies and challenges today:
Every post will have a maximum of 2 minutes read length coupled with longer tutorial and hand-on workshops for when you can put the time.
Want to get updates on new bitesize posts?
Follow me here on dev.to and on Twitter.
Have a question for me? comment or send a DM.