Please use this identifier to cite or link to this item:
http://dspace.dtu.ac.in:8080/jspui/handle/repository/15922
Title: | PERFORMANCE TUNING FOR EFFECTIVE SQOOP TO TRANSFORM BETWEEN SQL & NOSQL |
Authors: | SHARMA, PRANAV |
Keywords: | EFFECTIVE SQOOP TRANSFERRING DATA NOSQL SQL |
Issue Date: | Jul-2017 |
Series/Report no.: | TD-1901; |
Abstract: | Transferring data to and from relational databases is challenging and laborious. Because data transfer requires careful handling, Apache Sqoop, short for “SQL to Hadoop,” was created to perform bidirectional data transfer between Hadoop and almost any external structured data store. Taking advantage of MapReduce, Hadoop’s execution engine, Sqoop performs the transfers in a parallel manner. It is very challenging task to transfer data with maximum performance and efficient manner. The variety of data sources and analytic targets presents a challenge in setting up effective data transfer pipelines. Data sources can have a variety of subtle inconsistencies: different DBMS providers may use different dialects of SQL, treat data types differently, or use distinct techniques to offer optimal transfer speeds. Depending on whether you’re importing to Hive, Pig, Impala, or your own MapReduce pipeline, you may want to use a different file format or compression algorithm when writing data to HDFS. Sqoop helps the data engineer tasked with scripting such transfers by providing a compact but powerful tool that flexibly negotiates the boundaries between these systems and their data layouts. In order to enhance the performance of Sqoop import and export operation, different parameters are configured in this project. Basic information regarding Sqoop import and export tool are presented in the different chapter. Analysis of configured Sqoop parameters is documented in the result and analysis chapter. |
URI: | http://dspace.dtu.ac.in:8080/jspui/handle/repository/15922 |
Appears in Collections: | M.E./M.Tech. Computer Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Performance tuning for effective sqoop to transform between Sql & noSql.pdf | 1.45 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.