I have a Dataset [String] and you need to convert to RDD [String]. How?
Note. I recently switched from spark 1.6 to spark 2.0. Some of my clients were expecting RDD, but now Spark is giving me Dataset.
As stated in the scala API documentation , you can call .rddin your dataset:
.rdd
val myRdd : RDD[String] = ds.rdd
A dataset is a strongly typed Dataframe, so both Dataset and Dataframe can use .rdd to convert to RDD.
Source: https://habr.com/ru/post/1650356/More articles:Check for classes when clicked (React) - javascriptPOSIX shell equivalent to <() - bashAssociation rule in R - removing a redundant rule (arules) - rIs it possible to simulate process substitution on msys / mingw (using bash 3.x) - bashJobScheduler.schedule(job) не запускает JobService - androidR - regular expression error (PCRE version) - rWebpack - export SASS files (.scss) - cssA portable way to check if an expression compiles - c ++A pointer to a structure that has a fixed array of another structure - c #Angular 2: вызов функции Javascript из typescript - javascriptAll Articles