Amazon Glacier Select Analyzes Archived Data
Written by Kay Ewbank   
Friday, 01 December 2017

Amazon has released a tool that can query archived data held in Amazon Glacier, its low cost storage for data archiving. Glacier Select improves the attractiveness of Glacier because it makes the data stored in it usable.


Amazon Glacier, which we reported when it was announced in 2012, is intended for data archiving and online backup, with storage costs of as little as $0.01 per gigabyte per month. It is optimized for infrequently accessed data where a retrieval time of several hours is suitable. Glacier Select, The new tool, announced at AWS re:Invent 2017, can be used to run queries directly on data stored in Glacier, retrieving only the data you need out of your archives to use for analytics.

While Glacier data retrieval takes hours, a SQL query run on the archive can return data either as a bulk retrieval taking up to 12 hours, standard retrieval in 3-5 hours, or expedited retrieval taking 1-5 minutes. You're told when a query is complete with Amazon Simple Notification Service (SNS), and you can specify the Amazon S3 bucket where you want the output results to be stored.

Until now, if you wanted to use the data stored in a Glacier archive in any way you had to restore the entire archive. The Select tool has an API that can be used to retrieve only the relevant data for your query from the Amazon Glacier archive. Amazon Glacier Select will soon integrate with Amazon Athena and Amazon Redshift Spectrum so Glacier will become more integrated with the other data stored in Amazon storage.

Amazon says Glacier Select can be used for operations such as auditing and pattern matching over large amounts of data that is archived in Amazon Glacier. For example, you can use Amazon Glacier Select to find and retrieve only records matching a particular account or only billing data for a particular customer.


More Information

Amazon Glacier Site

Related Articles

Amazon Glacier For Cold Storage 

New AWS Services

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on, Twitter, Facebook or Linkedin.



Couchbase Connect Goes Online

Couchbase has virtualized its annual NoSQL developer conference, Couchbase Connect, and is offering three days of free content on Couchbase for developers - who can sit on their own com [ ... ]

Deep Learning Restores Time-Ravaged Photos

Researchers have devised a novel deep learning approach to repairing the damage suffered by old photographic prints. The project is open source and a PyTorch implementation is downloadable from GitHub [ ... ]

More News






or email your comment to:

Last Updated ( Friday, 01 December 2017 )