Solving Big Data Problem using Hadoop File System(HDFS)

Call for Paper

October Edition

IJAIS solicits high quality original research papers for the upcoming October edition of the journal. The last date of research paper submission is 29 September 2025

Submit your paper

Know more

The week's pick

Exploring Search-Based Applications in the Software Development Life Cycle: A Literature Review

Abeer Alarainy Nora Madi Aljawharah Al-Muaythir Abir Benabid Najjar

Random Articles

Reseach Article

Solving Big Data Problem using Hadoop File System(HDFS)

Published on September 2015 by Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden

International Conference and Workshop on Communication, Computing and Virtualization

Foundation of Computer Science USA

ICWCCV2015 - Number 3

September 2015

Authors: Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden

Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden . Solving Big Data Problem using Hadoop File System(HDFS). International Conference and Workshop on Communication, Computing and Virtualization. ICWCCV2015, 3 (September 2015), 0-0.

@article{

author = { Smita Chaturvedi, Nivedita Bhirud, Fiona Lowden },

title = { Solving Big Data Problem using Hadoop File System(HDFS) },

journal = { International Conference and Workshop on Communication, Computing and Virtualization },

issue_date = { September 2015 },

volume = { ICWCCV2015 },

number = { 3 },

month = { September },

year = { 2015 },

issn = 2249-0868,

pages = { 0-0 },

numpages = 1,

url = { /proceedings/icwccv2015/number3/804-1576/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 International Conference and Workshop on Communication, Computing and Virtualization

%A Smita Chaturvedi

%A Nivedita Bhirud

%A Fiona Lowden

%T Solving Big Data Problem using Hadoop File System(HDFS)

%J International Conference and Workshop on Communication, Computing and Virtualization

%@ 2249-0868

%V ICWCCV2015

%N 3

%P 0-0

%D 2015

%I International Journal of Applied Information Systems

Abstract

The data which is useful not only for one person but for all, that data is called as Big data or It's a data to be too big to be processed in a single machine is known as Big data. Big data are the data which are extremely large in size that may be analyses computationally to disclose the patterns, associations and trends etc. For Example: Many users visited the amazon site; in particular page for how many user visit that page, from which IP address they visit the page, for how many hours they visit the page etc information stored in the amazon site is known as the example of Big data. Huge amount of data is created by phone data, online stores and by research data. Potentially data is created fast, the data coming from different sources in various formats and not most data are worthless but some data does has low value. Hadoop solves the Big data problem using the concept HDFS (Hadoop Distributed File System). In this paper the running of map reduce code in apache Hadoop is shown. Hadoop solves the problem of Big data by storing the data in distributed form in different machines. There are plenty of data but that data have to be store in a cost effective way and process it efficiently.

References

The diverse and exploding digital universe. http://www. emc. com/digital universe, 2009.
Hadoop. http://hadoop. apache. org, 2009.
HDFS (hadoop distributed file system) architecture. http://hadoop. apache. org/common/docs/current/hdfs design. html, 2009.
R. Abbott and H. Garcia-Molina. Scheduling I/O requestswith dead-lines: A performance evaluation. InProceedings of the 11th Real-TimeSystems Symposium, pages 113–124, Dec 1990.
G. Candea, N. Polyzotis, and R. Vingralek. A scalable, predictable joinoperator for highly concurrent data warehouses. In35th InternationalConference on Very Large Data Bases (VLDB), 2009.
The Hadoop Distributed File System : Balancing Portability, A. Hemanth, Dr. R. V. Krishniah (International Journal of Computer Engineering & Applications, Vol. III, Issue III, ISSN 2321-3469)
The Data Recovery File System Systems for Hadoop Cluster- Review Paper, V. S. Karwande, Dr. S. S. Lomte, Prof. R. A. Auti (ISSN:0975-9646)
The book titled with " Hadoop : The Definitive Guide" By Tom White
The book titled with "Hadoop Operations" by Eric Sammer

Index Terms

Computer Science

Information Sciences

Keywords

Big data mapreduce 3V Eco System HDFS Hadoop.