Transforming Data Lakes with Amazon S3 Select & Amazon Glacier Select – AWS Online Tech Talks – #AWS
Data Lakes contain massive amounts of data that companies want to store more cost-effectively and query faster and more efficiently. Amazon S3 Select can increase analytics query performance up to 400%, and Amazon Glacier Select makes it practical to extend queries to archive storage, significantly reducing data lake storage costs. In this webinar, we will demonstrate ways to accelerate analytics applications and extend your data lake to cost-effective archive storage by filtering and retrieving only a subset of data from an S3 or Glacier object instead of retrieving the entire object. We’ll discuss how to use these features with Amazon Athena or Amazon Redshift Spectrum, with third-party software, and we’ll demonstrate a query on an S3-based data lake using a Presto connector.
– Define Amazon S3 Select and Amazon Glacier Select
– Understand the scenarios in which these features can help you increase performance and extend your data lake
– See a before & after scenario of a query with and without Amazon S3 Select