Article

Free Access

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

Authors:
Andrew McCallum

University of Massachusetts Amherst, Amherst, MA

University of Massachusetts Amherst, Amherst, MA
View Profile

,
Wei Li

University of Massachusetts Amherst, Amherst, MA

University of Massachusetts Amherst, Amherst, MA
View Profile

CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4May 2003Pages 188–191https://doi.org/10.3115/1119176.1119206

Published:31 May 2003Publication History

CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4

Pages 188–191

ABSTRACT

Models for many natural language tasks benefit from the flexibility to use overlapping, non-independent features. For example, the need for labeled data can be drastically reduced by taking advantage of domain knowledge in the form of word lists, part-of-speech tags, character n-grams, and capitalization patterns. While it is difficult to capture such inter-dependent features with a generative probabilistic model, conditionally-trained models, such as conditional maximum entropy models, handle them well. There has been significant work with such models for greedy sequence modeling in NLP (Ratnaparkhi, 1996; Borthwick et al., 1998).

References

A. Borthwick, J. Sterling, E. Agichtein, and R. Grishman. 1998. Exploiting diverse knowledge sources via maximum entropy in named entity recognition. In Proceedings of the Sixth Workshop on Very Large Corpora, Association for Computational Linguistics.Google Scholar
M. Collins and Y. Singer. 1999. Unsupervised models for named entity classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora.Google Scholar
Stephen Della Pietra, Vincent J. Della Pietra, and John D. Lafferty. 1997. Inducing Features of Random Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4):380--393. Google ScholarDigital Library
Rosie Jones, Andrew McCallum, Kamal Nigam, and Ellen Riloff. 1999. Bootstrapping for Text Learning Tasks. In IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications.Google Scholar
John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proc. ICML. Google ScholarDigital Library
Robert Malouf. 2002. A comparison of algorithms for maximum entropy parameter estimation. In Sixth Workshop on Computational Language Learning (CoNLL-2002). Google ScholarDigital Library
Andrew McCallum and Fang-Fang Feng. 2003. Chinese Word Segmentation with Conditional Random Fields and Integrated Domain Knowledge. In Unpublished Manuscript.Google Scholar
Andrew McCallum. 2003. Efficiently Inducing Features of Conditional Random Fields. In Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI03). (Submitted). Google ScholarDigital Library
Adwait Ratnaparkhi. 1996. A Maximum Entropy Model for Part-of-Speech Tagging. In Eric Brill and Kenneth Church, editors, Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 133--142. Association for Computational Linguistics.Google Scholar
Fei Sha and Fernando Pereira. 2003. Shallow Parsing with Conditional Random Fields. In Proceedings of Human Language Technology, NAACL. Google ScholarDigital Library

Recommendations

Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features
Advances in Artificial Intelligence - IBERAMIA 2016
Abstract
Unsupervised features based on word representations such as word embeddings and word collocations have shown to significantly improve supervised NER for English. In this work we investigate whether such unsupervised features can also boost ...
Read More
Named Entity Recognition in Hindi Using Conditional Random Fields
ICTCS '16: Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies

Named Entity Recognition (NER) is the process of finding the Named Entities or proper nouns from a text. In the following paper, we have presented NER in Hindi using CRF++0.58. We have discussed about NER, Challenges in NER in Indian languages and CRF++...
Read More
Metonymy Resolution through Named Entity Recognition
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
May 2003
213 pages
Conference Chairs:
Walter Daelemans
University of Antwerp and Tilburg University
,
Miles Osborne
University of Edinburgh
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 31 May 2003
Qualifiers
- Article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 180
  Total Citations
  View Citations
- 4,461
  Total Downloads
- Downloads (Last 12 months)76
- Downloads (Last 6 weeks)14
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4

ABSTRACT

References

Cited By

Recommendations

Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features

Named Entity Recognition in Hindi Using Conditional Random Fields

Metonymy Resolution through Named Entity Recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4

ABSTRACT

References

Cited By

Recommendations

Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features

Named Entity Recognition in Hindi Using Conditional Random Fields

Metonymy Resolution through Named Entity Recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media