Running an eclipse spark code using a spark on another server

Question

Running an eclipse spark code using a spark on another server

I set up eclipse for scala and created a maven project and wrote a simple task to count words in windows. Now my spark + hadoop is installed on linux server. How can I run my spark code from an eclipse to a spark cluster (which is on Linux)?

Any suggestion.

+2

debugging linux scala apache-spark

Shashi Nov 08 '15 at 6:49

source share

2 answers

Alberto bonsanto · Answer 1 · 2015-11-08T15:06:38+0000

In fact, this answer is not as simple as one would expect.

I will make a lot of assumptions, first I use sbt , secondly, you work on a Linux-based computer, the third is that you have two classes in your project, say RunMe and Globals , and the last assumption will be that You want to adjust the parameters inside the program. So, somewhere in your executable code, you should have something like this:

 object RunMe { def main(args: Array[String]) { val conf = new SparkConf() .setMaster("mesos://master:5050") //If you use Mesos, and if your network resolves the hostname master to its IP. .setAppName("my-app") .set("spark.executor.memory", "10g") val sc = new SparkContext(conf) val sqlContext = new SQLContext() //your code comes here } }

You must follow these steps:

Compile the project in its root using:
$ sbt assembly
Submit the task to the node wizard, this is the interesting part (assuming that you have the following structure in your target/scala/ project, and inside you have a .jar file that corresponds to the compiled project)
$ spark-submit --class RunMe target/scala/app.jar

Note that since I suggested that the project has two or more classes, you will need to determine which class you want to run. In addition, I am sure that both approaches, for Yarn and Mesos , are very similar.

Chobeat · Answer 2 · 2015-11-08T13:18:51+0000

What you are looking for is the wizard where SparkContext should be created.

You need to install your wizard as the cluster that you want to use.

I invite you to read the Spark programming guide or follow an introductory course to understand these basic concepts. Spark is not a tool with which you can start working at night, it takes some time.

http://spark.apache.org/docs/latest/programming-guide.html#initializing-spark

Running an eclipse spark code using a spark on another server

More articles: