Connect to Spark SQL via ODBC

According to this page: https://spark.apache.org/sql/ you can connect existing BI tools to Spark SQL via ODBC or JDBC: screen shot for spark sql

I do not mean Shark, since it is basically EOL :

It is for this reason that we are finishing development in Shark as a separate project and moving all our development resources to Spark SQL, a new component in Spark.

How does a BI tool (like Tableau) connect to sql shark via ODBC?

+6
source share
6 answers

Simba provides an ODBC driver that uses the Databricks, however this only applies to the Databricks distribution. We are launching a public version for use with Apache tomorrow (Wed, December 3) at www.simba.com . You can then download the trial driver for use with Tableau.

+2
source

With the release of Spark SQL 1.1, you also have an economical JDBC driver, see https://spark.apache.org/docs/latest/sql-programming-guide.html#distributed-sql-engine

+4
source

Please take a look at: http://www.openstratio.org/blog/connecting-to-the-stratio-big-data-platform-using-odbc-2/

Stratio is a platform that includes a certified Spark distribution that allows you to connect Spark to any type of data repository (e.g. Cassandra, MongoDB, ...). It has an ODBC driver, so you can write SQL queries that will be translated to Spark jobs or even faster, direct queries to Cassandra or any other database that you want to connect to, if possible. Thus, it is quite simple to connect Tableau to Spark and your data warehouse. If you need help, we will be happy to help you.

Disclaimer: I am one of the developers of Stratio ODBC

+1
source

As Carlos said, Stratio Meta is a module that acts as a layer of a parser, validator, planner, and coordinator over various save levels (currently only Cassandra and Mongo, but also HDFS in the short term). These modules offer a shell with SQL-like language, Java / Scala API, REST API and ODBC (soon in JDBC). It also uses another Stratio module, Stratio Deep, which allows us to use Apache Spark to efficiently and quickly complete a request.

Disclaimer: I am currently working at Stratio Big Data p>

+1
source

Simba will offer one: http://databricks.com/blog/2014/04/30/Databricks-selects-Simba-ODBC-driver-for-shark.html . The official release date is not known.

[update]

Use the HIVE ODBC driver to connect to Spark SQL, as described here and here .

+1
source

For Spark on Azure HDInsight, you can connect Tableau (or PowerBI) as described here https://azure.microsoft.com/en-us/documentation/articles/hdinsight-apache-spark-use-bi-tools/ . The ODBC driver is located here: http://www.microsoft.com/en-us/download/details.aspx?id=47713

0
source

Source: https://habr.com/ru/post/974992/


All Articles