Amazon AWS Athena S3 and Glacier Mixed Bucket

Question

Amazon AWS Athena S3 and Glacier Mixed Bucket

Amazon Athena S3 Glacier Analysis Services

We have petabytes of data in S3. We are https://www.pubnub.com/ and we store usage data in S3 of our network for billing purposes. We have tab delimited log files stored in the S3 bucket. Athena gives us HIVE_CURSOR_ERROR .

Our S3 bucket is set to automatically push the AWS glacier after 6 months. Our bucket contains S3 files that are hot and ready to read in addition to the Glacier backup files. Because of this, we get access errors from Athena. The file referenced by the error is a Glacier backup.

I guess the answer is: do not store glacier backups in the same bucket. We do not have this option with ease due to our data volume sizes. I believe that Athena will not work in this setup, and we will not be able to use Athena for our log analysis.

However, if there is a way that we can use Athena, we would be delighted. Is there a solution for HIVE_CURSOR_ERROR and a way to skip Glacier files? Our s3 bucket is a flat bucket without folders .

The file name of the S3 file shown in the screenshots above and below is not displayed in the screenshot. The file reference in HIVE_CURSOR_ERROR is actually a Glacier object. You can see it in this screenshot of our S3 Bucket.

Note. I tried to post at https://forums.aws.amazon.com/ , but that was not bueno.

+6

amazon-s3 amazon-web-services amazon-glacier amazon-athena

Stephen blum Jan 25 '17 at 10:33

source share

1 answer

user6405978 · Accepted Answer · 2017-05-17T14:51:20+0000

AWS documentation dated May 16, 2017 states that Athena does not support the GLACIER storage class:

Athena does not support different storage classes in the bucket specified by LOCATION, does not support the GLACIER storage class, and does not support Requester Pays buckets. For more information, see Storage Classes , Changing Object Storage Class in | S3 | and "Requester Pays Buckets" in the Amazon Simple Storage Service Developer's Guide.

We are also interested in this; if you earn it, tell us how to do it. :-)

Amazon AWS Athena S3 and Glacier Mixed Bucket

Amazon Athena S3 Glacier Analysis Services

More articles: