Can Spark code run on a cluster without spark-submit?

Question

Can Spark code run on a cluster without spark-submit?

I would like to develop a Scala application that connects the wizard and runs the spark part of the code. I would like to achieve this without using spark-submit. Is it possible? In particular, I would like to know if the following code can run from my machine and connect to the cluster:

val conf = new SparkConf()
  .setAppName("Meisam")
  .setMaster("yarn-client")

val sc = new SparkContext(conf)

val sqlContext = new SQLContext(sc)
val df = sqlContext.sql("SELECT * FROM myTable")

...

+4

yarn apache-spark

Meisam emamjome Nov 27 '15 at 11:42

source share

3 answers

xfreewind · Answer 1 · 2015-11-27T15:54:29+0000

add conf

val conf = new SparkConf() .setAppName("Meisam") .setMaster("yarn-client") .set("spark.driver.host", "127.0.0.1");

Jacek Laskowski · Answer 2 · 2015-12-09T19:26:14+0000

Yes, it’s possible and basically what you did is all that is needed to complete the tasks performed on the YARN cluster in client deployment mode (where the driver runs on the computer on which the application is running).

spark-submit SparkConf, , , URL-. , Spark Spark- - YARN, Mesos, Spark Standalone local - .

Ido.Schwartzman · Answer 3 · 2017-05-28T13:39:46+0000

, , , , , Spark, . , , , , - , - , UDF ( , AKA, ). https://issues.apache.org/jira/browse/SPARK-18075 , , , . , ( ): Eclipse Spark

Can Spark code run on a cluster without spark-submit?

More articles: