I would like to create a python application for analyzing twitter streaming data using Apache Spark.
Is there a way to use the Apache Spark streaming functionality without setting up a Hadoop environment. How to run Apache Spark offline? I just downloaded the binaries and tried to launch the spark shell, getting a NullPointerException. Maybe someone can help.
<console>:10: error: not found: value sqlContext
import sqlContext.implicits.
<console>:10: error: not found: value sqlContext
import sqlContext.sql
source
share