|Amazon Glacier Select Analyzes Archived Data|
|Written by Kay Ewbank|
|Friday, 01 December 2017|
Amazon has released a tool that can query archived data held in Amazon Glacier, its low cost storage for data archiving. Glacier Select improves the attractiveness of Glacier because it makes the data stored in it usable.
Amazon Glacier, which we reported when it was announced in 2012, is intended for data archiving and online backup, with storage costs of as little as $0.01 per gigabyte per month. It is optimized for infrequently accessed data where a retrieval time of several hours is suitable. Glacier Select, The new tool, announced at AWS re:Invent 2017, can be used to run queries directly on data stored in Glacier, retrieving only the data you need out of your archives to use for analytics.
While Glacier data retrieval takes hours, a SQL query run on the archive can return data either as a bulk retrieval taking up to 12 hours, standard retrieval in 3-5 hours, or expedited retrieval taking 1-5 minutes. You're told when a query is complete with Amazon Simple Notification Service (SNS), and you can specify the Amazon S3 bucket where you want the output results to be stored.
Until now, if you wanted to use the data stored in a Glacier archive in any way you had to restore the entire archive. The Select tool has an API that can be used to retrieve only the relevant data for your query from the Amazon Glacier archive. Amazon Glacier Select will soon integrate with Amazon Athena and Amazon Redshift Spectrum so Glacier will become more integrated with the other data stored in Amazon storage.
Amazon says Glacier Select can be used for operations such as auditing and pattern matching over large amounts of data that is archived in Amazon Glacier. For example, you can use Amazon Glacier Select to find and retrieve only records matching a particular account or only billing data for a particular customer.
or email your comment to: email@example.com
|Last Updated ( Friday, 01 December 2017 )|