Skip to main content
Log in

Towards a new perspective on context based citation index of research articles

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Citation index measures the impact or quality of a research publication. Currently, all the standard journal citation indices are used to measure the impact of individual research article published in those journals and are based on the citation count, making them a pure quantitative measure. To address this, as our first contribution, we propose to assign weights to the edges of citation network using three context based quality factors: 1. Sentiment analysis of the text surrounding the citation in the citing article, 2. Self-citations, 3. Semantic similarity between citing and cited article. Prior approaches make use of PageRank algorithm to compute the citation scores. This being an iterative process is not essential for acyclic citation networks. As our second contribution, we propose a non-iterative graph traversal based approach, which uses the edge weights and the initial scores of the non-cited nodes to compute the citation indices by visiting the nodes in topologically sorted order. Experimental results depict that rankings of citation indices obtained by our approach are improved over the traditional citation count based ranks. Also, our rankings are similar to that of PageRank based methods; but, our algorithm is simpler and 70 % more efficient. Lastly, we propose a new model for future reference, which computes the citation indices based on solution of system of linear inequalities, in which human-expert’s judgment is modeled by suitable linear constraints.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Amaldi, E., & Kann, V. (1995). The complexity and approximability of finding maximum feasible subsystems of linear relations. Theoretical Computer Science, 147(1), 181–210.

    Article  MathSciNet  MATH  Google Scholar 

  • Amin, M., & Mabe, M. A. (2003). Impact factors: Use and abuse. Medicina (Buenos Aires), 63(4), 347–354.

    Google Scholar 

  • Athar, A., & Teufel, S. (2012). Detection of implicit citations for sentiment detection. In Proceedings of the Workshop on Detecting Structure in Scholarly Discourse (pp. 18–26). Association for Computational Linguistics.

  • Casal, G. B. (2004). Assessing the quality of articles and scientific journals: Proposal for weighted impact factor and a quality index. Psychology in Spain, 8, 60–76.

    Google Scholar 

  • Cavalcanti, D. C., Prudêncio, R. B. C., Pradhan, S. S., Shah, J. Y., & Pietrobon, S. (2011). Good to be bad? Distinguishing between positive and negative citations in scientific impact. In Proceedings of 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2011 (pp. 156–162).

  • Chinneck, J. W. (2001). Fast heuristics for the maximum feasible subsystem problem. INFORMS Journal on Computing, 13(3), 210–223.

    Article  MathSciNet  MATH  Google Scholar 

  • Duda, R. O., & Hart, P. E. (1973). Pattern classification and scene analysis (Vol. 3). New York: Wiley.

    MATH  Google Scholar 

  • Esuli, A., & Sebastiani, F. (2006). Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of LREC (Vol. 6, pp. 417–422).

  • Eugene, G. (1955). Citation indexes for science: A new dimension in documentation through association of ideas. Science, 122(3159), 108–111.

    Article  Google Scholar 

  • Garfield, E. (1972). Citation analysis as a tool in journal evaluation. American Association for the Advancement of Science.

  • Greenberg, H. J., & Murphy, F. H. (1991). Approaches to diagnosing infeasible linear programs. ORSA Journal on Computing, 3(3), 253–261.

    Article  MATH  Google Scholar 

  • Haddadene, H. A., Harik, H., & Salhi, S. (2012). On the PageRank algorithm for the articles ranking. In Proceedings of the World Congress on Engineering (Vol. 1, pp. 4–6).

  • Harter, S. P., Nisonger, T. E., & Weng, A. (1993). Semantic relationships between cited and citing articles in library and information science journals. Journal of the American Society for Information Science, 44(9), 543–552.

    Article  Google Scholar 

  • Hoffgen, K. U., Simon, H. U., & Vanhorn, K. S. (1995). Robust trainability of single neurons. Journal of Computer and System Sciences, 50(1), 114–125.

    Article  MathSciNet  MATH  Google Scholar 

  • Langville, A. N., & Meyer, C. D. (2011). Google’s PageRank and beyond: The science of search engine rankings. Princeton: Princeton University Press.

    MATH  Google Scholar 

  • Leacock, C., & Chodorow, M. (1998). Combining local context and WordNet similarity for word sense identification. WordNet: An Electronic Lexical Database, 49(2), 265–283.

    Google Scholar 

  • Li, J., & Willett, P. (2009). ArticleRank: A PageRank-based alternative to numbers of citations for analysing citation networks. In Aslib Proceedings (Vol. 61, No. 6, pp. 605–618). Emerald Group Publishing Limited.

  • Liu, B. (2012). Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1), 1–167.

    Article  Google Scholar 

  • Ma, N., Guan, J., & Zhao, Y. (2008). Bringing PageRank to the citation analysis. Information Processing and Management, 44(2), 800–810.

    Article  Google Scholar 

  • Marchand, M., & Golea, M. (1993). An approximation algorithm to find the largest linearly separable subset of training examples. In IEEE World Congress on Neural Networks (Vol. 3).

  • Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39–41.

    Article  Google Scholar 

  • Nicolaisen, J. (2007). Citation analysis. Annual Review of Information Science and Technology, 41(1), 609–641.

    Article  Google Scholar 

  • Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the Web. Stanford InfoLab: Technical Report.

    Google Scholar 

  • Piao, S., Ananiadou, S., Tsuruoka, Y., Sasaki, Y., & McNaught, J. (2007). Mining opinion polarity relations of citations. In International Workshop on Computational Semantics (IWCS) (pp. 366–371).

  • Qiao, H., Wang, Y., & Liang, Y. (2012). A value evaluation method for papers based on improved PageRank algorithm. In Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on (pp. 2201–2205). IEEE.

  • Renegar, J. (1988). A polynomial-time algorithm, based on Newton’s method, for linear programming. Mathematical Programming, 40(1–3), 59–93.

    Article  MathSciNet  MATH  Google Scholar 

  • Saha, S., Saint, S., & Christakis, D. A. (2003). Impact factor: A valid measure of journal quality? Journal of the Medical Library Association, 91(1), 42.

    Google Scholar 

  • Sayyadi, H., & Getoor, L. (2009). FutureRank: Ranking scientific articles by predicting their future PageRank. In SDM (pp. 533–544).

  • Sidorov, G., Gelbukh, A., Gómez-Adorno, H., & Pinto, D. (2014). Soft similarity and soft cosine measure: Similarity of features in vector space model. Computación y Sistemas, 18(3), 491–504.

    Article  Google Scholar 

  • Singh, A., Shubhankar, K., & Pudi, V. (2011). An efficient algorithm for ranking research papers based on citation network. In Data Mining and Optimization (DMO), 2011 3rd Conference on (pp. 88-95). IEEE.

  • Spearman, C. (1904). The proof and measurement of association between two things. The American journal of psychology, 15(1), 72–101.

    Article  Google Scholar 

  • Stamou, S., Mpouloumpasis, N., & Kozanidis, L. (2009). Deriving the impact of scientific publications by mining citation opinion terms. JDIM, 7(5), 283–289.

    Google Scholar 

  • Su, C., Pan, Y., Zhen, Y., Ma, Z., Yuan, J., Guo, H., Yu, Z., Ma, C.,  & Wu, Y. (2011). PrestigeRank: A new evaluation method for papers and journals. Journal of Informetrics, 5(1), 1–13.

    Article  Google Scholar 

  • Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (Vol. 1, pp. 173–180). Association for Computational Linguistics.

  • Walker, D., Xie, H., Yan, K. K., & Maslov, S. (2007). Ranking scientific publications using a model of network traffic. Journal of Statistical Mechanics: Theory and Experiment, 2007(06), P06010.

    Article  Google Scholar 

  • Warmack, R. E., & Gonzalez, R. C. (1973). An algorithm for the optimal solution of linear inequalities and its application to pattern recognition. Computers, IEEE Transactions on, 100(12), 1065–1075.

    Article  MathSciNet  MATH  Google Scholar 

  • Yan, E., Ding, Y., & Sugimoto, C. R. (2011). P-Rank: An indicator measuring prestige in heterogeneous scholarly networks. Journal of the American Society for Information Science and Technology, 62(3), 467–477.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to ParvezAhamad Kazi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kazi, P., Patwardhan, M. & Joglekar, P. Towards a new perspective on context based citation index of research articles. Scientometrics 107, 103–121 (2016). https://doi.org/10.1007/s11192-016-1844-2

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-016-1844-2

Keywords

Navigation