menu
arrow_back

Working with Amazon Redshift

1m setup · 115m access · 45m completion
Connection Details

Warning: Do not transmit data into the AWS Console that is not related to Qwiklabs or the lab you are taking.

Free

info_outline
You can take this lab for free! Other more advanced labs may cost credits to start.

01:55:00

Working with Amazon Redshift

SPL-17 Version 4.2.2

© 2018 Amazon Web Services, Inc. and its affiliates. All rights reserved. This work may not be reproduced or redistributed, in whole or in part, without prior written permission from Amazon Web Services, Inc. Commercial copying, lending, or selling is prohibited.

Errors or corrections? Email us at aws-course-feedback@amazon.com.

Other questions? Contact us at https://aws.amazon.com/contact-us/aws-training/

Overview

This lab provides an overview of Amazon Redshift. In this lab, you will launch and work with an Amazon Redshift cluster to analyze USA Domestic flight data.

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Amazon Redshift delivers fast query and I/O performance for virtually any size dataset by using columnar storage technology and parallelizing and distributing queries across multiple nodes. We’ve made Amazon Redshift easy to use by automating most of the common administrative tasks associated with provisioning, configuring, monitoring, backing up, and securing a data warehouse.

Topics Covered

By the end of this lab, you will be able to:

  • Launch an Amazon Redshift cluster
  • Connect to Amazon Redshift by using SQL client software
  • Load data from Amazon S3 into Amazon Redshift
  • Query data from Amazon Redshift
  • Monitor Amazon Redshift performance

Lab Pre-requisites

To successfully complete this lab, you should be familiar with basic concepts of databases and SQL.

Start Lab

Notice the lab properties below the lab title:

  • setup - The estimated time to set up the lab environment
  • access - The time the lab will run before automatically shutting down
  • completion - The estimated time the lab should take to complete
  1. Launch your lab by clicking START LAB

If you are prompted for a token, use the one distributed to you (or credits you've purchased).

A status bar shows the progress of the lab environment creation process (the AWS Management Console is accessible during lab resource creation, but your AWS resources may not be fully available until the process is complete).

  1. Open your lab by clicking OPEN CONSOLE

This will automatically log you into the AWS Management Console.

Please do not change the Region unless instructed.

Common login errors

Error : Federated login credentials

If you see this message:

  • Close the browser tab to return to your initial lab window
  • Wait a few seconds
  • Click Open Console again

You should now be able to access the AWS Management Console.

Error: You must first log out

If you see this message:

  • Click To logout, click here
  • Close the browser tab to return to your initial Qwiklabs window
  • Click Open Console again

Task 1: Launch your Amazon Redshift Cluster

You will now launch an Amazon Redshift cluster, which starts your very own database for use in this lab.

  1. Wait until the lab status at the top of this page shows Lab Running.

This indicates that required resources are available for your lab.

  1. In the AWS Management Console, on the Services menu, click Amazon Redshift.

  2. Click Launch cluster.

  3. Configure the following settings:

  • Cluster identifier:
  • Database name:
  • Database port:
  • Master user name:
  • Master user password:
  • Confirm password: Enter the same password again
  1. Click Continue.

  2. For Node Type, select dc2.large.

This lab uses the dc2.large node size, which has 160GB of storage per node. You will be using a single node for this lab, but the type and number of nodes in a Redshift cluster can be changed at any time to provide extra storage and faster data processing.

  1. Click Continue.

You wll be presented with a configuration page.

  1. Configure the following, leaving other settings at their default value:
  • Choose a VPC: Select the VPC that is not the Default VPC
  • VPC security groups: Redshift Security Group
  • Available roles: Redshift-Role

If you cannot see the Redshift Security Group, confirm that you have changed the VPC.

These settings define the network and security configurations for the Redshift cluster.

  1. Click Continue.

  2. Review the settings, then click Launch Cluster (you might need to scroll down to see it).

  3. In the left navigation pane, click Clusters to view your Clusters.

  4. Click the lab link to view your cluster details.

Your cluster can take up to 5 minutes to launch.

While you are waiting, please continue reading the next section.

Join Qwiklabs to Read the Rest of this Lab...and More!

  • Get temporary access to the Google Cloud Console.
  • Nearly 100 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab
view_comfy
Catalog
menu
More
More