AWS云端的Big Data

3 实验练习 · 40 积分 · 3小时 40分钟

用例(熟练者) 9 big data on aws option 02

This quest is designed to teach you how to work with AWS services to perform big data analytics on the cloud.

使用 Amazon Redshift

本实验演示了如何使用 Amazon RedShift 来创建集群、加载数据、运行查询以及监控性能。注意:在本实验中,学员需要下载免费的 SQL 客户端。

Icon  advanced advanced 10 积分 45 分钟

[:zh] 使用Amazon EMR分析Ngrams

[:zh] 本实验演示如何运行Amazon Elastic MapReduce(EMR)集群进行大数据分析,并使用Hive以类似SQL查询的方式来分析数据。你将使用Amazon EMR创建一个小的Hadoop集群,对存储在S3上的数据运行交互式的Hive查询。你将使用Hive来把数据规范化处理,创建有意义的数据表并保存在S3上,便于在集群上运行其他作业。

Icon  expert expert 15 积分 1 小时

Advanced Amazon Redshift: Analytics and Amazon Machine Learning

In this lab, you will build a smart solution using Amazon Redshift and Amazon Machine Learning that predicts delays for flights originating in Chicago’s O’Hare international airport. You will learn how to analyze large amounts of data using Redshift. Then you will practice using Machine Learning to create a model that will predict flight delays. Prerequisites: To successfully complete this lab, you should be familiar with Redshift concepts by taking the introductory lab at Some knowledge of SQL and Python programming is required, although full solution code is provided. You should be comfortable using RDP to connect to a Windows server and using SQL client software. You should have at a minimum taken the “Introduction to Amazon Redshift” and “Introduction to Machine Learning” labs at

Icon  expert expert 15 积分 1 小时 45 分钟