Abstract
In recent years, with the promotion and application of artificial intelligence technology in all fields, English has also had a lot of attention and automatic grading areas, but not much in the text of the characterization of breakthrough, the traditional technology based on latent semantic analysis of the text said, more latent semantic analysis technology can extract thematic information, and information is ignored. The purpose of this paper is to better reflect the content of the text, evaluate the English level more accurately, and improve students' English level better. This paper proposes a text representation method based on word vector clustering and a text representation method based on vector space model. A TF-IDF algorithm based on word vector is proposed. It is verified that the quadratic weighted Kappa value of the prediction results of the multi-model fusion algorithm based on word vector is better than the first place in Kaggle international English composition scoring competition, which verifies the effectiveness of the algorithm.
Export citation and abstract BibTeX RIS
Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.