I am switching from Impala to SparkSQL using the following code to read a table:
my_data = sqlContext.read.parquet('hdfs://my_hdfs_path/my_db.db/my_table')
How do I call SparkSQL above so that it can return something like:
'select col_A, col_B from my_table'
scala hive hdfs apache-spark apache-spark-sql parquet
Edamame Dec 21 '16 at 2:03 2016-12-21 02:03
source share