SodhanaRef: a reference management software built using hybrid semantic measure

 
 
 
  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract


    Reference management softwares are widely used by the researchers to maintain their collection of scholarly literature that exist in PDF format. Though widely used most of the reference management softwares have no sophisticated Information retrieval except few which offer advanced search that includes search for title, author etc., These softwares in the present day market do not give importance to the semantic similarity or relatedness concept, query expansion and finding the context within the query to find the concept behind the user mentioned query.

    With SodhanaRef, a solution is offered to deal with the above-mentioned issues by building reference management software using a mix up of corpus-based and knowledge-based semantic measures. Based on the evolution done on about 200 various scholarly literatures in the PDF form, SodhanaRef shows a good performance over Mendeley when compared between these two reference management softwares for title search. The other evaluations for finding the semantic similarity between the user mentioned query and the existing titles in the title search and for identifying the concept behind the query along with identifying the concept of a research publication have shown good results with an average precision between 0.8 to 1 for each query.


  • Keywords


    Corpus-Based Semantic Measures; Knowledge-Based Semantic Measures; Reference Management Software; Ontology; RDF.

  • References


      [1] Beel, Jöran, and Bela Gipp. "Google Scholar’s ranking algorithm: an introductory overview." Proceedings of the 12th International Conference on Scientometrics and Informetrics (ISSI’09). Vol. 1. 2009.

      [2] Zhu, Yongjun, Erjia Yan, and Fei Wang. "Semantic relatedness and similarity of biomedical terms: examining the effects of recency, size, and section of biomedical publications on the performance of word2vec." BMC medical informatics and decision-making 17.1 2017: 95.

      [3] Gabrilovich, Evgeniy, and Shaul Markovitch. "Computing semantic relatedness using wikipedia-based explicit semantic analysis." IJcAI. Vol. 7. 2007.

      [4] Ensan, Faezeh, and Ebrahim Bagheri. "Document retrieval model through semantic linking." Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. ACM, 2017. https://doi.org/10.1145/3018661.3018692.

      [5] Bai, Jing, et al. "Using query contexts in information retrieval." Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2007.

      [6] Hensley, Merinda Kaye. “Citation management software: features and futures.” Reference & User Services Quarterly 50.3, 204-208. 2011. https://doi.org/10.5860/rusq.50n3.204.

      [7] Basak, Sujit Kumar. “A Comparison of Researcher's Reference Management Software: RefWorks, Mendeley, and EndNote.” Journal of Economics and Behavioral Studies 6.7, 561, 2014.

      [8] Gilmour, Ron, and Laura Cobus-Kuo. “Reference management software: a comparative analysis of four products.” Issues in Science and Technology Librarianship 66.66, 63-75, 2011.

      [9] Beel, Joeran, et al. "Docear: An academic literature suite for searching, organizing and creating academic literature." Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries. ACM, 2011. https://doi.org/10.1145/1998076.1998188.

      [10] Ray, Aswini Kumar, and D. B. Ramesh. "Zotero: Open Source Citation Management Tool for Researchers."International journal of library and information sciences. 2017

      [11] Parabhoi, Lambodara, Arabinda Kumar Seth, and Sushanta Kumar Pathy. "Citation Management Software Tools: a Comparison with Special Reference to Zotero and Mendeley." Journal of Advances in Library and Information Science 6.3 2017: 288-293.

      [12] Chirita, Paul Alexandru, et al. "Using ODP metadata to personalize search." Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2005. https://doi.org/10.1145/1076034.1076067.

      [13] Dumais, Susan, et al. "Stuff I've seen: a system for personal information retrieval and re-use." ACM SIGIR Forum. Vol. 49. No. 2. ACM, 2016. https://doi.org/10.1145/2888422.2888425.

      [14] Kim, H. R., and Philip K. Chan. "Personalized ranking of search results with learned user interest hierarchies from bookmarks." WEBKDD’05 Workshop. 2005.

      [15] Schamber, Linda, Michael B. Eisenberg, and Michael S. Nilan. "A re-examination of relevance: toward a dynamic, situational definition∗." Information processing & management 26.6 1990: 755-776. https://doi.org/10.1016/0306-4573(90)90050-C.

      [16] Teevan, Jaime, Susan T. Dumais, and Eric Horvitz. "Personalizing search via automated analysis of interests and activities." Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2005.

      [17] Liu, Fang, Clement Yu, and Weiyi Meng. "Personalized web search by mapping user queries to categories." Proceedings of the eleventh international conference on Information and knowledge management. ACM, 2002. https://doi.org/10.1145/584792.584884.

      [18] Croft, W. Bruce, and Xing Wei. Context-based topic models for query modification. CIIR Technical Report, University of Massachusetts, 2005.

      [19] Li, Dandan, Jianwei Du, and Shuzhen Yao. "Research on Computer Science Domain Ontology Construction and Information Retrieval." Knowledge Engineering and Management. Springer, Berlin, Heidelberg, 2011. 603-608. https://doi.org/10.1007/978-3-642-25661-5_74.

      [20] Harispe, Sébastien, et al. "Semantic similarity from natural language and ontology analysis." Synthesis Lectures on Human Language Technologies 8.1 2015: 1-254. https://doi.org/10.2200/S00639ED1V01Y201504HLT027.

      [21] Singhal, Amit. "Introducing the knowledge graph: things, not strings." Official google blog 2012.

      [22] Xun, Guangxu, et al. "A survey on context learning." IEEE Transactions on Knowledge and Data Engineering 29.1 2017: 38-56. https://doi.org/10.1109/TKDE.2016.2614508.

      [23] S. Cucerzan, “Large-scale named entity disambiguation based on Wikipedia data,” in Proc. EMNLP-CoNLL, 2007, vol. 7, pp. 708–716.

      [24] Mandava Kranthi Kiran and K. Thammi Reddy. “ An Approach Towards Establishing Reference Linking in Desktop Reference Manager”, Journal of Information and Knowledge Management, World Scientific Publication (in Press)

      [25] Vicknair, Chad, et al. "A comparison of a graph database and a relational database: a data provenance perspective." Proceedings of the 48th annual southeast regional conference. ACM, 2010. https://doi.org/10.1145/1900008.1900067.

      [26] Casanovas, Pompeu, et al. "Semantic web for the legal domain: the next step." Semantic Web 7.3 2016: 213-227. https://doi.org/10.3233/SW-160224.

      [27] Di Iorio, Angelo, et al. "Describing bibliographic references in RDF." SePublica. 2014.

      [28] Noy, Natalya F., and Deborah L. McGuinness. "Ontology development 101: A guide to creating your first ontology." 2001.

      [29] http://dumps.wikimedia.org/enwiki/20171001

      [30] Decker, Stefan, and Martin Frank. "The social semantic desktop." Digital Enterprise Research Institute, DERI Technical Report May 2 2004: 7.

      [31] Cuzzocrea, Alfredo, et al. "MapReduce-based algorithms for managing big RDF graphs: State-of-the-art analysis, paradigms, and future directions." Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE Press, 2017. https://doi.org/10.1109/CCGRID.2017.109.

      [32] Reddy, K. Thammi, M. Shashi, and L. Pratap Reddy. "Hybrid Clustering Approach for Concept Generation." International Journal of Computer Science and Network Security (IJCSNS) 7.4 2007: 62-69.

      [33] [33] Klyuev, Vitaly, and Yannis Haralambous. "Query expansion: Term selection using the ewc semantic relatedness measure." Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on. IEEE, 2011.

      [34] Zhang, Jiuling, Beixing Deng, and Xing Li. "Concept based query expansion using wordnet." Proceedings of the 2009 international e-conference on advanced science and technology. IEEE Computer Society, 2009. https://doi.org/10.1109/AST.2009.24.

      [35] Varelas, Giannis, et al. "Semantic similarity methods in wordNet and their application to information retrieval on the web." Proceedings of the 7th annual ACM international workshop on Web information and data management. ACM, 2005. https://doi.org/10.1145/1097047.1097051.

      [36] Boubacar, Abdoulahi, and Zhendong Niu. "Concept Based Query Expansion." Semantics, Knowledge and Grids (SKG), 2013 Ninth International Conference on. IEEE, 2013. https://doi.org/10.1109/SKG.2013.10.

      [37] Bhavani, M., K. Thammi, and M. Shashi. "A rough set based approach to detect plagiarism." TENCON 2009-2009 IEEE Region 10 Conference. IEEE, 2009.

      [38] Qiu, Yonggang, and Hans-Peter Frei. "Concept based query expansion." Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 1993.

      [39] https://wordnet.princeton.edu/.

      [40] https://www.marklogic.com/.


 

View

Download

Article ID: 9544
 
DOI: 10.14419/ijet.v7i2.9544




Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.