Evaluating Multiple Summaries Without Human Models: A First Experiment with a Trivergent Model

Cabrera-Diego, Luis Adrián; Torres-Moreno, Juan-Manuel; Durette, Barthélémy

doi:10.1007/978-3-319-41754-7_8

Luis Adrián Cabrera-Diego¹⁸,
Juan-Manuel Torres-Moreno^18,19 &
Barthélémy Durette²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9612))

Included in the following conference series:

International Conference on Applications of Natural Language to Information Systems

2160 Accesses
4 Citations

Abstract

In this work, we extend the task of evaluating summaries without human models by using a trivergent model. In this model, three elements are compared simultaneously: a summary to evaluate, its source document and a set of other summaries from the same source. We present in this paper, a first pilot experiment using a French corpus from which we obtained promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://homepages.inf.ed.ac.uk/alouis/.
2.
http://fresa.talne.eu.
3.
The cardinality condition consists in computing the divergences from the smallest distribution to the largest one. For example, using as base Eqs. 1 and 2, the condition would be \(|P|> |Q| > |R|\).
4.
We did not test the combinations given by the possible trivergence’s associative property.
5.
The corpus can be downloaded from: http://dev.termwatch.es/~fresa/CORPUS/PUCES/.
6.
https://essential-mining.com.
7.
http://snowballstem.org.
8.
We applied a Kruskal-Wallis test and a Games-Howell post hoc test as the results had heterogeneous variances.

References

Cabrera-Diego, L.A.: Automatic methods for assisted recruitment. Ph.D. thesis, Université d’Avignon et des Pays de Vaucluse, December 2015
Google Scholar
Fernández, S., SanJuan, E., Torres-Moreno, J.M.: Textual energy of associative memories: performant applications of enertex algorithm in text summarization and topic segmentation. In: Gelbukh, A., Kuri Morales, A.F. (eds.) MICAI 2007. LNCS (LNAI), vol. 4827, pp. 861–871. Springer, Heidelberg (2007)
Chapter Google Scholar
Gamer, M., Lemon, J., Singh, I.F.P.: irr: Various Coefficients of Interrater Reliability and Agreement (2012), rpackageversion0.84. https://CRAN.R-project.org/package=irr
Good, I.J.: The population frequencies of species and the estimation of population parameters. Biometrika 40(3–4), 237–264 (1953)
Article MathSciNet MATH Google Scholar
Hovy, E., Lin, C.Y., Zhou, L., Fukumoto, J.: Automated summarization evaluation with basic elements. In: Proceedings of the Fifth Conference on Language Resources and Evaluation (LREC 2006), pp. 604–611 (2006)
Google Scholar
Jing, H., Barzilay, R., McKeown, K., Elhadad, M.: Summarization evaluation methods: experiments and analysis. In: AAAI Symposium on Intelligent Summarization, pp. 51–59 (1998)
Google Scholar
Katz, S.M.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans. Acoustics Speech Signal Process. 35(3), 400–401 (1987)
Article Google Scholar
Kendall, M.G., Stuart, A.: The Advanced Theory of Statistics, vol. 1, 2nd edn. Charles Griffin and Co., London (1948)
MATH Google Scholar
Kruskal, W.H., Wallis, W.A.: Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47(260), 583–621 (1952)
Article MATH Google Scholar
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Article MathSciNet MATH Google Scholar
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the Association for Computational Linguistics 2004 Workshop, vol. 8 (2004)
Google Scholar
Lin, J.: Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 37(1), 145–151 (1991)
Article MathSciNet MATH Google Scholar
Louis, A., Nenkova, A.: Automatically evaluating content selection in summarization without human models. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 306–314. Association for Computational Linguistics (2009)
Google Scholar
Mani, I.: Summarization evaluation: an overview. In: Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL) Workshop on Automatic Summarization (2001)
Google Scholar
Nenkova, A., Passonneau, R.: Evaluating content selection in summarization: the pyramid method. In: Susan Dumais, D.M., Roukos, S. (eds.) Proceedings of HLT-NAACL 2004, pp. 145–152. Association for Computational Linguistics, Boston (2004)
Google Scholar
R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2016). https://www.R-project.org/
Saggion, H., Torres-Moreno, J.M., da Cunha, I., SanJuan, E., Velázquez-Morales, P.: Multilingual summarization evaluation without human models. In: 23rd International Conference on Computational Linguistics (COLING 2010), pp. 1059–1067. Association for Computational Linguistics (2010)
Google Scholar
Spärck-Jones, K., Galliers, J.R.: Evaluating Natural Language Processing Systems: An Analysis and Review. LNCS(LNAI), vol. 1083. Springer, New York (1996)
Google Scholar
Steinberger, J., Ježek, K.: Evaluation measures for text summarization. Comput. Inform. 28(2), 251–275 (2012)
Google Scholar
Torres-Moreno, J.M.: Artex is another text summarizer (2012). arXiv:1210.3312
Torres-Moreno, J.M.: Automatic Text Summarization. Wiley, New York (2014)
Book Google Scholar
Torres-Moreno, J.M.: Trivergence of Probability Distributions, at Glance. Computing Research Repository (CoRR) abs/1506.06205 (2015). http://arxiv.org/abs/1506.06205
Torres-Moreno, J.M., Velázquez-Morales, P., Meunier, J.G.: Condensés automatiques de textes. Lexicometrica. L’analyse de données textuelles: De l’enquête aux corpus littéraires, Special (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

LIA, Université d’Avignon et des Pays de Vaucluse, Avignon, France
Luis Adrián Cabrera-Diego & Juan-Manuel Torres-Moreno
École Polytechnique de Montréal, Montréal, Canada
Juan-Manuel Torres-Moreno
Adoc Talent Management, Paris, France
Barthélémy Durette

Authors

Luis Adrián Cabrera-Diego
View author publications
You can also search for this author in PubMed Google Scholar
Juan-Manuel Torres-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Barthélémy Durette
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luis Adrián Cabrera-Diego .

Editor information

Editors and Affiliations

ConservatoireNational desArts et Métiers, Paris, France
Elisabeth Métais
University of Salford, Salford, United Kingdom
Farid Meziane
University of Salford, Salford, United Kingdom
Mohamad Saraee
Oakland University, Rochester, Michigan, USA
Vijayan Sugumaran
University of Salford, Salford, United Kingdom
Sunil Vadera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cabrera-Diego, L.A., Torres-Moreno, JM., Durette, B. (2016). Evaluating Multiple Summaries Without Human Models: A First Experiment with a Trivergent Model. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2016. Lecture Notes in Computer Science(), vol 9612. Springer, Cham. https://doi.org/10.1007/978-3-319-41754-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-41754-7_8
Published: 17 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41753-0
Online ISBN: 978-3-319-41754-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics