A Unified Frame Work to Integrate Hadoop and IOT to Resolve the Issues of Storage, Processing with Leveraging Capacity of  Analytics

Gudapati Syam Prasad; P Rajesh; Sk Wasim Akram

doi:10.14419/ijet.v7i2.32.15390

Authors

Gudapati Syam Prasad
P Rajesh
Sk Wasim Akram

Received date: July 10, 2018

Accepted date: July 10, 2018

Published date: May 31, 2018

DOI:

https://doi.org/10.14419/ijet.v7i2.32.15390

Keywords:

Hadoop, IOT, Analytics, Storage, Map Reduce.

Abstract

The new trend in the research and real time applications is Internet of Things (IOT). The functional benefits of IOT are ranging from smart house to smart cities. The main purpose of IOT is to integrate various devices logically and interacting between the devices without human intervention. The current discussion mainly focuses on leveraging the capacity of analytics in IOT and resolves the storage issues of the bulk data generated by IOT. The proposed idea gives the usage of Hadoop platform to store the data and from that data performing analytics for the sake of better utilization of IOT communications. The importance is explained with some real time scenarios where there is perfect blend of Hadoop platform and IOT. To store the various categories of the data Hadoop Distributed File System (HDFS) can be used, and to ingest the data from external platforms we can make use of Sqoop or Flume. The data available in HDFS can be used to process with the usage of Map Reduce (MR)technique. Once the data is available inÂ HDFS the analytics can be performed with Hive, Pig or R in the context of Machine learning or data mining techniques. The outcome of the proposed idea is integration of Hadoop and IOT platforms with a unified frame work which accommodates the integration of Hadoop and IOT, storage provisions to handle bulk data, processing of the stored data and applying analytics so as to effectively serve various stake holders.
Â
Â

Â

References

[1] Umapavankumar.K, Dr.B.Lakshmareddy ,â€ Various Computing models in Hadoop eco system along with the perspective of analytics using R and Machine learningâ€ Vol. 14 CIC 2016 Special Issue International Journal of Computer Science and Information Security (IJCSIS) https://sites.google.com/site/ijcsis/ ISSN 1947-5500.
[2] www.cloudera.com
[3] www. https://kontakt.io
[4] S. Lohr, â€œThe age of big data,â€ N. Y. Times, vol. 11, 2012.
[5] S. Madden, â€œFrom Databases to Big Data.,â€ IEEE Internet Comput., vol. 16, no. 3, 2012.
[6] P. Zikopoulos, C. Eaton, and others, Understanding big data: Analytics for enterprise class hadoop and streaming data. McGraw-Hill Osborne Media, 2011.
[7] A. McAfee, E. Brynjolfsson, T. H. Davenport, D. J. Patil, and D. Barton, â€œBig data,â€ Manag. Revolut.Harv. Bus Rev, vol. 90, no. 10, pp. 61â€“67, 2012.
[8] R. Appuswamy, C. Gkantsidis, D. Narayanan, O. Hodson, and A. Rowstron, â€œScale-up vs Scale-out for Hadoop: Time to rethink?,â€ in Proceedings of the 4th annual Symposium on Cloud Computing, 2013, p. 20.
[9] A. S. Tanenbaum and M. Van Steen, Distributed systems.Prentice-Hall, 2007.[7] C. P. Chen and C.-Y. Zhang, â€œDataintensive applications, challenges, techniques and technologies: A survey on Big Data,â€ Inf. Sci., vol. 275, pp. 314â€“347, 2014.
[10] T. B. Murdoch and A. S. Detsky, â€œThe inevitable application of big data to health care,â€ Jama, vol. 309, no. 13, pp. 1351â€“ 1352, 2013.
[11] Dr.B.LakshmaReddy,Umapavankumar.K,â€ Big data techniques and analytics in Ecommerce businessâ€ International Conference at Pondicherry University, on October 2016.
[12] www.safaribooksonline.com
[13] J. Dean and S. Ghemawat, â€œMapReduce: simplified data processing on large clusters,â€ Commun. ACM, vol. 51, no. 1, pp. 107â€“113, 2008.
[14] J. Y. Monteith, J. D. McGregor, and J. E. Ingram, â€œHadoop and its Evolving Ecosystem.,â€ in IWSECO@ ICSOB, 2013, pp. 57â€“68.
[15] K. Ting and J. J. Cecho, Apache Sqoop Cookbook. Oâ€™Reilly Media, Inc., 2013. [14] S. Hoffman, Apache Flume: Distributed Log Collection for Hadoop. Packt Publishing Ltd, 2013.
[16] S. Haloi, Apache ZooKeeper Essentials. Packt Publishing Ltd, 2015.
[17] M. K. Islam and A. Srinivasan, Apache Oozie: The Workflow Scheduler for Hadoop. Oâ€™Reilly Media, Inc., 2015.
[18] C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, â€œPig latin: a notsoforeign language for data processing,â€ in Proceedings of the 2008 ACM SIGMOD international conference on Management of data, 2008, pp. 1099â€“1110.
[19] H. Bansal, S. Mehrotra, and S. Chauhan, Apache Hive cookbook.Packt Publ., 2016.
[20] E. Alpaydin, Introduction to machine learning (adaptive computation and machine learning series). The MIT Press Cambridge, 2004.

A Unified Frame Work to Integrate Hadoop and IOT to Resolve the Issues of Storage, Processing with Leveraging Capacity of Analytics

Authors

Gudapati Syam Prasad

P Rajesh

Sk Wasim Akram

How to Cite

DOI:

Keywords:

Abstract

References

Downloads

How to Cite