PS:
Please do NOTÂ join the course if you do NOT have any basic working knowledge of AWSÂ Console and AWS Services like S3, IAM, VPC, Security Groups etc. AWSÂ Beginners may struggle understanding some of the topics.
Course explains all the labs. If you want to practice labs, it would require AWSÂ Account and may cost $$.
Basic working knowledge of Redshift is recommended, but not a must.
This course has been designed for intermediate and expert AWSÂ Developers / Architects / Administrators.
Serverless is the future of cloud computing and AWSÂ is continuously launching new services on Serverless paradigm. AWSÂ launched Athena and QuickSight in Nov 2016, Redshift Spectrum in Apr 2017, and Glue in Aug 2017. Data and Analytics on AWSÂ platform is evolving and gradually transforming to serverless mode.
Businesses have always wanted to manage less infrastructure and more solutions. Big data challenges are continuously challenging the infrastructure boundaries. Having Serverless Storage, Serverless ETL, Serverless Analytics, and Serverless Reporting, all on one cloud platform had sounded too good to be true for a very long time. But now its a reality on AWSÂ platform. AWSÂ is the only cloud provider that has all the native serverless components for a true Serverless Data Lake Analytics solution.
It’s not a secret that when a technology is new in the industry, professionals with expertise in new technologies command great salaries. Serverless is the future, Serverless is the industry demand, and Serverless is new. It’s the perfect time and opportunity to jump into Serverless Analytics on AWSÂ Platform.
In this course, we would learn the following:
1)Â We will start with Basics on Serverless Computing and Basics of Data Lake Architecture on AWS.
2)Â We will learn Schema Discovery, ETL, Scheduling, and Tools integration using Serverless AWSÂ Glue Engine built on Spark environment.
3)Â We will learn to develop a centralized Data Catalogue too using Serverless AWSÂ Glue Engine.
4) We will learn to query data lake using Serverless Athena Engine build on the top of Presto and Hive.
5)Â We will learn to bridge the data warehouse and data lake using Serverless Amazon Redshift Spectrum Engine built on the top of Amazon Redshift platform.
6)Â We will learn to develop reports and dashboards, with a powerpoint like slideshow feature, and mobile support, without building any report server, by using Serverless Amazon QuickSight Reporting Engines.
7)Â We will finally learn how to source data from data warehouse, data lake, join data, apply row security, drill-down, drill-through and other data functions using the Serverless Amazon QuickSight Reporting Engines.
This course understands your time is important, and so the course is designed to be laser-sharp on lecture timings, where all the trivial details are kept at a minimum and focus is kept on core content for experienced AWSÂ Developers / Architects / Administrators. By the end of this course, you can feel assured and confident that you are future-proof for the next change and disruption sweeping the cloud industry.
IÂ am very passionate about AWSÂ Serverless computing on Data and Analytics platform, and am covering A-to-Z of all the topics discussed in this course.
So if you are excited and ready to get trained on AWSÂ Serverless Analytics platform, IÂ am ready to welcome you in my class !
Introduction
Instructor and Course Introduction
Pre-requisites - What you'll need for this course
Course Objectives
Course Content, Convention and Resources
AWS Serverless Analytics and Data Lake Basics
Section Agenda
Learn about basics of Serverless Computing and which AWS Services fits into it
Learn basics of AWS Serverless Data Lake Architecture
Amazon S3 - Test-Data Setup
Section Agenda
Setup sample data on S3 buckets that would be used throughout this course
Configure S3 Storage Analytics
Amazon Redshift - Cluster and Sample Data Setup
Section Agenda
Introduction to Amazon Redshift
Develop Amazon Redshift Cluster
Install and setup SQL Client to work with Amazon Redshift
Load sample data in Redshift cluster
AWS Glue - Architecture and Setup
Section Agenda
Learn AWS Glue Architecture with diagrams
Learn frequently used AWS Glue Terms and their meanings
Learn about different applications and features of AWS Glue
Learn internal architecture of AWS Glue
Learn about the cost economics of AWS Glue
Setup IAM Role and policies to use with AWS Glue
Learn about the networking concepts and settings required for AWS Glue
Configure network settings for AWS Glue
AWS Glue - Database Objects
Section Agenda
Learn about the concept of Data Catalog in AWS Glue
Learn to develop databases in AWS Glue
Learn to develop tables in AWS Glue
Develop tables manually in AWS Glue
AWS Glue - Crawlers
Section Agenda
Learn about the concept of Crawler in AWS Glue
Learn about the concept of classifiers in AWS Glue
Develop crawlers in AWS Glue - Lab 1
Develop crawlers in AWS Glue - Lab 2
Develop crawlers in AWS Glue - Lab 3
Develop crawlers in AWS Glue - Lab 4
Develop crawlers in AWS Glue - Lab 5
Develop crawlers in AWS Glue - Lab 6
Develop crawlers in AWS Glue - Lab 7
AWS Glue - ETL Jobs
Section Agenda
Learn to develop serverless ETL jobs with AWS Glue
Learn to develop serverless ETL jobs with AWS Glue
Learn about different ETL job properties in AWS Glue
Learn to develop serverless ETL jobs with AWS Glue
Learn to develop serverless ETL jobs with AWS Glue with Redshift as data source
Learn to develop serverless ETL jobs with AWS Glue
Learn to develop Python scripts and properties for serverless ETL jobs using AWS Glue
Learn to develop Python scripts and properties for serverless ETL jobs using AWS Glue
Learn about built-in ETL Transformations in AWS Glue
AWS Glue - Triggers
Section Agenda
Learn about Triggers in AWS Glue
Learn about Triggers in AWS Glue
Learn about Triggers in AWS Glue
AWS Glue - Dev Ops Setup
Section Agenda
Learn about AWS Glue Development Endpoints
Learn to install and setup Apache Zeppelin
Learn to install Git and setup Port Forwarding
Learn to integrate AWS Glue Development Endpoint with Apache Zeppelin Notebook
Learn monitoring options available for AWS Glue
AWS Athena - Architecture and Setup
Section Agenda
Learn about AWS Athena Serverless Architecture
Learn about features of AWS Athena
Learn about object model in AWS Athena
AWS Athena - Development and Administration
Section Agenda
Learn to develop objects for data catalog with AWS Athena
Learn to develop objects for data catalog with AWS Athena
Learn about data types and DDL statements in AWS Athena
Learn about SerDe libraries in AWS Athena
Learn about developing objects for data catalog in AWS Athena
Learn to query AWS Logs with AWS Athena
Learn about limitations of AWS Athena
Amazon Redshift Spectrum - Architecture and Setup
Section Agenda
Learn about AWS Redshift Spectrum Serverless Architecture
Learn about features of Amazon Redshift Spectrum
Amazon Redshift Spectrum - Development
Section Agenda
Learn to develop IAM Role for Redshift Spectrum
Learn to develop database objects in data catalog with Redshift Spectrum
Learn to query data from data lake as well as data warehouse using Amazon Redshift Spectrum
Amazon QuickSight - Architecture and Setup
Section Agenda
Overview of Amazon QuickSight
Learn about Amazon QuickSight Serverless Architecture and SPICE Engine
Subscribe to Amazon QuickSight and learn subscription options
Learn about Report Authoring Workflow in Amazon QuickSight
Amazon QuickSight - Developing Your First Analysis in QuickSight
Section Agenda