Please use this identifier to cite or link to this item: http://dspace.dtu.ac.in:8080/jspui/handle/repository/19847
Title: STUDY AND ANALYSIS OF BIG DATA ANALYTICS FRAMEWORKS AND CHALLENGES
Authors: SHEKHAWAT, SAURABH SINGH
Keywords: BIG DATA
HDFC
HANDOOP
SPARK
ARCHITECTURE
Issue Date: May-2023
Series/Report no.: TD-6407;
Abstract: Every day as we can see around us that data is generating exponentially. We are the reason the reason for that amount of data today, an individual generating on an average 40 Exabyte’s of data daily. The data can be come from any sources like social media, online transactions, IOT’s, digital media, records, different sensors etc. handling this huge amount of data nowadays becoming a challenging task. The data can be big or small in the size and can be of any form like unstructured, semi-structured or structured. We can’t handle these amount of big data with the traditional techniques. Therefore, in order to handle such large amounts of unstructured data, we need methods and mechanisms that are simple to use, quick to process, and effective. The two main technological advancements that can manage any type of information are Hadoop and Spark. for storing, processing, and analysing the data, there are many tools and techniques are used in the Hadoop and spark. The Hadoop framework data is processing the data in distributing manner. The two basis elements of Hadoop are HDFS for storage, MapReduce and yarn for parallel processing in distributed manner, scheduling the data(tasks) and analalyzing the data. The second one spark uses resilient distribute data sets for fast processing for overcome computational complexity. In this report we will see what is the Hadoop architecture, how it stores and process the data using MapReduce, how spark is better than Hadoop, how sparks done the job, what is the Apache spark technology and Hadoop and spark’s comparative analysis.
URI: http://dspace.dtu.ac.in:8080/jspui/handle/repository/19847
Appears in Collections:M.E./M.Tech. Computer Engineering

Files in This Item:
File Description SizeFormat 
Saurabh Singh Shekhawat M.Tech.pdf4.42 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.