Big Data on AWS
4509
Big Data on AWS
Live Virtual
Private/On Site
In this course, you will learn about cloud-based big data solutions such as Amazon Elastic MapReduce (EMR), Amazon Redshift, Amazon Kinesis, and the rest of the AWS big data platform. You will learn how to use Amazon EMR to process data using the broad ecosystem of Apache Hadoop tools like Hive and Hue. Additionally, you will learn how to create big data environments, work with Amazon DynamoDB, Amazon Redshift, and Amazon Kinesis, and leverage best practices to design big data environments for security and cost-effectiveness.
1. Overview of Big Data 2. Data Ingestion, Transfer, and Compression 3. AWS Data Storage Options 4. Using DynamoDB with Amazon EMR 5. Using Kinesis for Near Real-Time Big Data Processing 6. Introduction to Apache Hadoop and Amazon EMR 7. Using Amazon Elastic MapReduce 8. The Hadoop Ecosystem 9. Using Hive for Advertising Analytics 10. Using Streaming for Life Sciences Analytics 11. Using Hue with Amazon EMR 12. Running Pig Scripts with Hue on Amazon EMR 13. Spark on Amazon EMR 14. Running Spark and Spark SQL Interactively on Amazon EMR 15. Using Spark and Spark SQL for In-Memory Analytics 16. Managing Amazon EMR Costs 17. Securing your Amazon EMR Deployments 18. Data Warehouses and Columnar Datastores 19. Introduction to Amazon Redshift 20. Optimizing Your Amazon Redshift Environment 21. The Big Data Ecosystem on AWS 22. Visualizing and Orchestrating Big Data 23. Using Tibco Spotfire to Visualize Big Data This course allows you to test new skills and apply knowledge to your working environment through a variety of practical exercisesOutline
Labs
Upcoming Classes
Dates | Location | GTR | |
---|---|---|---|
Jun 13-15 (8:30am-4:30pm) | EST | ||
Jul 11-13 (8:30am-4:30pm) | EST |
Questions?
Whether you need assistance scheduling a class for yourself or for your group, GCA's Education Account Manager's will craft a customized training solution to meet the needs of your organization.