Below is a mini data engineering crash course that I quickly put together and this list includes subjects that aspiring data engineers may want to become familiar with -- they include:
💿 R / RStudio : data.frame
💿 Spark or DataBricks or EMR : DataFrame / DataSet
💿 Python : Pandas
💿 Java: JPA / JPQL / Hibernate / HQL
💿 Data modeling
💿 Relational databases & SQL
💿 NOSQL / MapReduce / Hadoop
💿 MongoDB
💿 CSV files
What else would you include?
Top comments (0)