I get a set (1 MB) of S3 CSV / JSON files that I would like to convert to Parquet. I expected that I could easily convert these files to Parquet using the Lambda function.
Looking at Google, I did not find a solution for this, without having a kind of Hadoop.
Since this is a file conversion, I cannot believe that there is no easy solution for this. Does anyone have Java / Scala sample code for this conversion?
source share