I am trying to access string delimited JSON data in S3. From my understanding of the docs, I could do something like
print data(S3(Chunks(JSONLines))('s3://KEY:SECRET@bucket/dir/part-*.json').peek()
which throws
BotoClientError: BotoClientError: Bucket names cannot contain upper-case characters when using either the sub-domain or virtual hosting calling format.
I tried options leading to various errors.
I can get the following for working with local files:
print data(chunks(JSONLines)(map(JSONLines, glob("/home/me/data/*")))).peek()
I'm not sure why you need it (map(JSONLines, glob(.
I really don't understand how to work with type-modofiers
source
share