Comparative study of NoSQL databases for big data storage

  • Authors

    • Gourav Bathla
    • Rinkle Rani
    • Himanshu Aggarwal
    2018-03-11
    https://doi.org/10.14419/ijet.v7i2.6.10072
  • NoSQL Database, Column oriented, Graph based, Document based, Key Value
  • Big data is a collection of large scale of structured, semi-structured and unstructured data. It is generated due to Social networks, Business organizations, interaction and views of social connected users. It is used for important decision making in business and research organizations. Storage which is efficient to process this large scale of data to extract important information in less response time is the need of current competitive time. Relational databases which have ruled the storage technology for such a long time seems not suitable for mixed types of data. Data can not be represented just in the form of rows and columns in tables. NoSQL (Not only SQL) is complementary to SQL technology which can provide various formats for storage that can be easily compatible with high velocity,large volume and different variety of data. NoSQL databases are categorized in four techniques- Column oriented, Key Value based, Graph based and Document oriented databases. There are approximately 120 real solutions existing for these categories; most commonly used solutions are elaborated in Introduction section. Several research works have been carried out to analyze these NoSQL technology solutions. These studies have not mentioned the situations in which a particular data storage technique is to be chosen. In this study and analysis, we have tried our best to provide answer on technology selection based on specific requirement to the reader. In previous research, comparisons amongNoSQL data storage techniques have been described by using real examples like MongoDB, Neo4J etc. Our observation is that if users have adequate knowledge of NoSQL categories and their comparison, then it is easy for them to choose best suitable category and then real solutions can be selected from this category.

  • References

    1. [1] Kaur K and Rani R (2013), Modeling and Querying Data in NoSQL Database, IEEE International Conference on Big Data, pp: 1 – 7, DOI: http://dx.doi.org/10.1109/BigData.2013.6691765.

      [2] Strozzi C (1998), Nosql – a relational database management system, Lainattu 5, 2014.

      [3] Nayak A ,Poriya A and Poojary D (2013), Type of NOSQL Databases and its Comparison with Relational Databases, IJAIS, vol. 5, no. 4,pp.16-19.

      [4] Moniruzzaman A B M and Hossain SA (2013), NoSQL Database : New Era of Databases for Big Data Analytics-Classification, Characteristics and Comparison, International Journal of Database Theory and Application ,vol. 6, no.4.

      [5] Chen CLP and Zhang CY(2014), Data-intensive applications, challenges, techniques and technologies: A survey on Big Data, Information Sciences, vol. 275, pp. 314-347.

      [6] Strohbach M, Daubert J, Ravkin H and Lischka M (2016) , Big Data Storage, In New-Horizons for a Data-Driven Economy , pp.119-141.

      [7] Dean J and Ghemawat S (2008), Mapreduce: simplified data processing on large clusters, Communications of the ACM, vol. 51, no. 1, pp. 107–113, http://doi.acm.org/10.1145/1327452.1327492.

      [8] Grolinger K, Higashino WA, Tiwari A and Capretz MAM (2013) , Data management in cloud environments : NOSQL and NEWSQL data stores, Journal of Cloud Computing : Advances, Systems and Applications, vol. 2 no.1.

      [9] Chang F, Dean J, Ghemawat S, Hsieh WC, Wallach DA, Burrows M, Chandra T, Fikes A, and Gruber RE (2008), Bigtable: a distributed storage system for structured data, ACM transaction on Computer Systems, vol.26 , no.2.

      [10] HBase, http://hbase.apache.org

      [11] Kaur K and Rani R (2015), Managing Data in Healthcare Information Systems: Many Models, One Solutionâ€, Computer, IEEE Computer Society, vol.48, no.3, pp.52-59.

      [12] DeCandia G, Hastorun D, Jampani M, Kakulapati G, Lakshman A, Pilchin A, Sivasubramanian S, Vosshall P, and Vogels W (2007), Dynamo: amazon’s highly available key-value store, In ACM SIGOPS operating systems review, vol.41 no.6, pp. 205–220.

      [13] Kaur K and Rani R (2015), Smart Polyglot Solution for Healthcare Big Data, IT Professional, IEEE Computer Society, vol.17 no.6, pp. 48-55.

      [14] Khan N, Yaqoob I, Hashem IAT, ,InayatZ,Ali W, KamaledinM,Alam M, Shiraz M and Gani A (2014), Big Data : Survey, Technologies, Opportunities and Challenges, Scientific World Journal , vol. 2014, Article id 712826.

      [15] Lourenco JR, Cabral B, Carreiro P, Vieira M and Bernardino J (2015), Choosing the right NOSQL database for the job : a quality attribute evaluation , Journal of Big Data vol.2 , no.1.

      [16] Bello-Orgaz G, Jung JJ and Camacho D(2016), Social Big Data: Recent achievements and new challenges, Information Fusion, Science Direct, vol. 28, pp. 45-59.

      [17] Strauch C, Sites ULS and KrihaW(2011), NoSQL databases, Lecture Notes,Stuttgart Media University .

      [18] Tudorica BG and Bucur C (2011), A comparison between several NoSQL databases with comments and notes, Roedunet International Conference, pp.1-5.

      [19] Hecht R and Jablonski S (2011), NoSQL evaluation: A use case oriented survey, In Cloud and Service Computing (CSC) International Conference, IEEE, pp. 336-341.

      [20] Tauro CJ, Patil, BR and Prashanth KR (2013). A comparative analysis of different nosql databases on data model, query model and replication model. In Proceedings of the International Conference on ERCICA.

  • Downloads

  • How to Cite

    Bathla, G., Rani, R., & Aggarwal, H. (2018). Comparative study of NoSQL databases for big data storage. International Journal of Engineering & Technology, 7(2.6), 83-87. https://doi.org/10.14419/ijet.v7i2.6.10072