Abstract
Citation index measures the impact or quality of a research publication. Currently, all the standard journal citation indices are used to measure the impact of individual research article published in those journals and are based on the citation count, making them a pure quantitative measure. To address this, as our first contribution, we propose to assign weights to the edges of citation network using three context based quality factors: 1. Sentiment analysis of the text surrounding the citation in the citing article, 2. Self-citations, 3. Semantic similarity between citing and cited article. Prior approaches make use of PageRank algorithm to compute the citation scores. This being an iterative process is not essential for acyclic citation networks. As our second contribution, we propose a non-iterative graph traversal based approach, which uses the edge weights and the initial scores of the non-cited nodes to compute the citation indices by visiting the nodes in topologically sorted order. Experimental results depict that rankings of citation indices obtained by our approach are improved over the traditional citation count based ranks. Also, our rankings are similar to that of PageRank based methods; but, our algorithm is simpler and 70 % more efficient. Lastly, we propose a new model for future reference, which computes the citation indices based on solution of system of linear inequalities, in which human-expert’s judgment is modeled by suitable linear constraints.
Similar content being viewed by others
References
Amaldi, E., & Kann, V. (1995). The complexity and approximability of finding maximum feasible subsystems of linear relations. Theoretical Computer Science, 147(1), 181–210.
Amin, M., & Mabe, M. A. (2003). Impact factors: Use and abuse. Medicina (Buenos Aires), 63(4), 347–354.
Athar, A., & Teufel, S. (2012). Detection of implicit citations for sentiment detection. In Proceedings of the Workshop on Detecting Structure in Scholarly Discourse (pp. 18–26). Association for Computational Linguistics.
Casal, G. B. (2004). Assessing the quality of articles and scientific journals: Proposal for weighted impact factor and a quality index. Psychology in Spain, 8, 60–76.
Cavalcanti, D. C., Prudêncio, R. B. C., Pradhan, S. S., Shah, J. Y., & Pietrobon, S. (2011). Good to be bad? Distinguishing between positive and negative citations in scientific impact. In Proceedings of 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2011 (pp. 156–162).
Chinneck, J. W. (2001). Fast heuristics for the maximum feasible subsystem problem. INFORMS Journal on Computing, 13(3), 210–223.
Duda, R. O., & Hart, P. E. (1973). Pattern classification and scene analysis (Vol. 3). New York: Wiley.
Esuli, A., & Sebastiani, F. (2006). Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of LREC (Vol. 6, pp. 417–422).
Eugene, G. (1955). Citation indexes for science: A new dimension in documentation through association of ideas. Science, 122(3159), 108–111.
Garfield, E. (1972). Citation analysis as a tool in journal evaluation. American Association for the Advancement of Science.
Greenberg, H. J., & Murphy, F. H. (1991). Approaches to diagnosing infeasible linear programs. ORSA Journal on Computing, 3(3), 253–261.
Haddadene, H. A., Harik, H., & Salhi, S. (2012). On the PageRank algorithm for the articles ranking. In Proceedings of the World Congress on Engineering (Vol. 1, pp. 4–6).
Harter, S. P., Nisonger, T. E., & Weng, A. (1993). Semantic relationships between cited and citing articles in library and information science journals. Journal of the American Society for Information Science, 44(9), 543–552.
Hoffgen, K. U., Simon, H. U., & Vanhorn, K. S. (1995). Robust trainability of single neurons. Journal of Computer and System Sciences, 50(1), 114–125.
Langville, A. N., & Meyer, C. D. (2011). Google’s PageRank and beyond: The science of search engine rankings. Princeton: Princeton University Press.
Leacock, C., & Chodorow, M. (1998). Combining local context and WordNet similarity for word sense identification. WordNet: An Electronic Lexical Database, 49(2), 265–283.
Li, J., & Willett, P. (2009). ArticleRank: A PageRank-based alternative to numbers of citations for analysing citation networks. In Aslib Proceedings (Vol. 61, No. 6, pp. 605–618). Emerald Group Publishing Limited.
Liu, B. (2012). Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1), 1–167.
Ma, N., Guan, J., & Zhao, Y. (2008). Bringing PageRank to the citation analysis. Information Processing and Management, 44(2), 800–810.
Marchand, M., & Golea, M. (1993). An approximation algorithm to find the largest linearly separable subset of training examples. In IEEE World Congress on Neural Networks (Vol. 3).
Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.
Nicolaisen, J. (2007). Citation analysis. Annual Review of Information Science and Technology, 41(1), 609–641.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the Web. Stanford InfoLab: Technical Report.
Piao, S., Ananiadou, S., Tsuruoka, Y., Sasaki, Y., & McNaught, J. (2007). Mining opinion polarity relations of citations. In International Workshop on Computational Semantics (IWCS) (pp. 366–371).
Qiao, H., Wang, Y., & Liang, Y. (2012). A value evaluation method for papers based on improved PageRank algorithm. In Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on (pp. 2201–2205). IEEE.
Renegar, J. (1988). A polynomial-time algorithm, based on Newton’s method, for linear programming. Mathematical Programming, 40(1–3), 59–93.
Saha, S., Saint, S., & Christakis, D. A. (2003). Impact factor: A valid measure of journal quality? Journal of the Medical Library Association, 91(1), 42.
Sayyadi, H., & Getoor, L. (2009). FutureRank: Ranking scientific articles by predicting their future PageRank. In SDM (pp. 533–544).
Sidorov, G., Gelbukh, A., Gómez-Adorno, H., & Pinto, D. (2014). Soft similarity and soft cosine measure: Similarity of features in vector space model. Computación y Sistemas, 18(3), 491–504.
Singh, A., Shubhankar, K., & Pudi, V. (2011). An efficient algorithm for ranking research papers based on citation network. In Data Mining and Optimization (DMO), 2011 3rd Conference on (pp. 88-95). IEEE.
Spearman, C. (1904). The proof and measurement of association between two things. The American journal of psychology, 15(1), 72–101.
Stamou, S., Mpouloumpasis, N., & Kozanidis, L. (2009). Deriving the impact of scientific publications by mining citation opinion terms. JDIM, 7(5), 283–289.
Su, C., Pan, Y., Zhen, Y., Ma, Z., Yuan, J., Guo, H., Yu, Z., Ma, C., & Wu, Y. (2011). PrestigeRank: A new evaluation method for papers and journals. Journal of Informetrics, 5(1), 1–13.
Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (Vol. 1, pp. 173–180). Association for Computational Linguistics.
Walker, D., Xie, H., Yan, K. K., & Maslov, S. (2007). Ranking scientific publications using a model of network traffic. Journal of Statistical Mechanics: Theory and Experiment, 2007(06), P06010.
Warmack, R. E., & Gonzalez, R. C. (1973). An algorithm for the optimal solution of linear inequalities and its application to pattern recognition. Computers, IEEE Transactions on, 100(12), 1065–1075.
Yan, E., Ding, Y., & Sugimoto, C. R. (2011). P-Rank: An indicator measuring prestige in heterogeneous scholarly networks. Journal of the American Society for Information Science and Technology, 62(3), 467–477.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kazi, P., Patwardhan, M. & Joglekar, P. Towards a new perspective on context based citation index of research articles. Scientometrics 107, 103–121 (2016). https://doi.org/10.1007/s11192-016-1844-2
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-016-1844-2