How to load SQL data in Hortonworks?

I installed Hortonworks SandBox on my computer. I also tried with a CSV file and get it in the structerd table in the OK way (Hive + Hadoop), nw I want to transfer the current SQL Databse to Sandbox (MS SQL 2008 r2). How do i do this? I also want to connect to my project (VS 2010 C #).

Can I connect via ODBC?

I heard what sqoop uses to transfer data from SQL to Hadoop, so how can I do this migration using sqoop?

+4
source share
2 answers

You can write your own data transfer task. But Sqoop would be more convenient. To do this, you need to download Sqoop and the corresponding connector, Microsoft SQL Server Connector for Apache Hadoop in your case. You can download it from here . Go through the Sqoop User Guide . It contains all the information in detail.

And Hive supports ODBC. You can find more on this page.

+1
source

I wrote down the steps you need to go through in Hortonworks Sandbox to install the JDBC driver and make it work: http://hortonworks.com/community/forums/topic/import-microsoft-sql-data-into-sandbox/

To connect to Hadoop in your C # project, you can use the Hortonworks Hive ODBC driver from http://hortonworks.com/thankyou-hdp13/#addon-table . Read the PDF (which is also on this page) to see how it works (I used Hive Server Type 2 with a username sandbox)

+1
source

Source: https://habr.com/ru/post/1485736/


All Articles