Taming Big Data using Spark & Scala
The Course is for those who do not know even ABC of Big Data and tools, want to learn them and be in a comfortable situation to implement them in projects. The course is also for those, who have some knowledge on Big Data tools, but want to enhance them further and be comfortable working in Projects. Due to the extensive scenario implementation, the course is also suitable for people interested to write Big Data Certifications like CCA 175. The course contains Practice Test for CCA 175.
Because the course is focused on setting up the entire Hadoop Platform on your windows (for those having less than 6GB RAM) and providing or working on fully configured VM’s, you need not to buy cluster very often to practice the tools. Hence, the Course is ONE TIME INVESTMENT for secure future.
In the course, we will learn how to utilize Big Data tools like Hadoop, Flume, Kafka, Spark, Scala (the most valuable tech skills on the market today).
In this course I will show you how to –
Use Scala and Spark to analyze Big Data.
Practice Test for writing CCA 175 Exam is available at the end of the course.
Extensive and Real time project scenarios with solutions as you will write in REAL PROJECTS
Use Sqoop to import data from Traditional Relational Databases to HDFS & Hive.
Use Flume and Kafka to process streaming data
Use Hive to view and store data & Partition the tables
Use Spark Streaming to fetch the streaming data from Kafka & Flume
The VM’s in the course are configured to work synchronously together and also have Spark 2.2.0 Version Installed. (Standard Cloudera VM has Spark 1.6 Installed with NO KAFKA and requires an upgrade for Spark, while the VM’s provided in the course has Spark 2.2 configured and working along with Kafka.)
Big Data is the most in demand skills right now, and with this course you can learn them quickly and easily! You can also learn the components in the basic setup in files like “hdfs-site.xml”, “core-site.xml” etc They are good to know if working for a projet.
The course is focused on upskilling someone who do not know Big Data tools and target is to bring them up-to the mark to be able to work in Big Data projects seamlessly without issues.
This course comes with some project scenarios and multiple datasets to work on with.
After completing this course you will feel comfortable putting Big Data, Scala and Spark on your resume and also will be easily able to work and implement in projects!
Thanks and I will see you inside the course!
Big Data Platform Setup
Use Windows/Cloudera VM provided in the course
Simply setup IntelliJ and Spark and Practice only these two
Learning Hadoop - Architecture, Concepts & Implementation
Learning Sqoop - Architecture, Concepts & Implementation
Learning Hive - Architecture, Concepts & Implementation
Learning Flume - Architecture, Concepts & Implementation
Learning Kafka - Architecture, Concepts & Implementation
Use Flume to write data to Kafka Topics and then read the data from Kafka Consumer
Learning Scala in Command Line Interface (REPL) & IntelliJ
Understanding While Loops in Scala
Writing Functions in Scala in IntelliJ