What is faster / easier to convert to SQL that accepts SQL scripts as input: Spark SQL, which is included as a speed level for high latency queries of Hive or Phoenix? And if so, how? I need to do a lot of upserts / join / grouping over the data. [HBase]
Is there an alternative on top of Cassandra CQL to support the above (real-time concatenation / grouping)?
I am most likely attached to Spark, as I would like to use MLlib. But for data processing, which should be my options?
Thanks kraster
source share