ABSTRACT
An emerging approach to knowledge acquisition is to collect statements from volunteer contributors over the Web. In this approach, the design of the acquisition interface is key to focusing on statements of interest, avoiding spurious entries, retaining the contributors, etc. Several such volunteer-contribution-based systems have been deployed to date, each with its own idiosyncratic interface. This paper discusses some key challenges faced by volunteer collection interfaces, and outlines the design features that we have found effective in addressing some aspects of those challenges. The paper discusses how these features have been implemented in deployed collection systems, and reflects on the data collected to extract lessons for future work in this research area.
- Allen, J., Byron, D, Dzikovska, M, Ferguson, G, Galescu, L, and Stent, A. "Towards Conversational Human-Computer Interaction," AI Magazine 22(4), pages 27--38, Winter, 2001. Google ScholarDigital Library
- Androutsopoulos, I., Ritchie, G. D., and Thanisch, P. "Natural Language Interfaces to Databases: An Introduction". Natural Language Engineering, 1(1), 1995.Google Scholar
- Barker, K., Blythe, J., et al. "A knowledge acquisition tool for course of action analysis." Proceedings of the Innovative Applications of Artificial Intelligence Conference (IAAI-2003). Acapulco, 43--50. 2003.Google Scholar
- Blythe, J., Kim, J., Ramachandran, S., and Gil, Y. "An Integrated Environment for Knowledge Acquisition." Proceedings of the 2001 International Conference on Intelligent User Interfaces (IUI-2001), 2001. Google ScholarDigital Library
- Chklovski, T. "Using Analogy to Acquire Commonsense Knowledge from Human Contributors," PhD thesis. MIT Artificial Intelligence Laboratory technical report AITR-2003-002, 2003.Google Scholar
- Chklovski, T. LEARNER: A System for Acquiring Commonsense Knowledge by Analogy. In Proceedings of Second International Conference on Knowledge Capture (KCAP), 2003. Google ScholarDigital Library
- Chklovski, T. and Gil, Y. An Analysis of Knowledge Collected from Volunteer Contributors. To appear in Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI-05), 2005. Google ScholarDigital Library
- Chklovski, T. 2005. Collecting Paraphrase Corpora from Volunteer Contributors. In Proceedings of International Conference on Knowledge Capture, K-CAP 2005. Google ScholarDigital Library
- Chklovski, T. and Mihalcea, R. Building a Sense Tagged Corpus with Open Mind Word Expert. In Proceedings of the Workshop on "Word Sense Disambiguation: Recent Successes and Future Directions", ACL 2002. Google ScholarDigital Library
- Workshop on Distributed Collaborative Knowledge Capture (DC-KCAP 03). Held in conjunction with KCAP 03. http://www.isi.edu/~timc/dc-kcap/Google Scholar
- Etzioni, O., Cafarella, M., Downey, D., et al. 2004. Methods for Domain-Independent Information Extraction from the Web: An Experimental Comparison. In Proc. of AAAI-2004. Google ScholarDigital Library
- Gennari, J., Musen, M., Fergerson, R., Grosso, W., Crubezy, M., Eriksson, H., Noy, N., Tu, S. The Evolution of Protege: An Environment for Knowledge-Based Systems Development International Journal of Human-Computer Studies, 58(1), 2002. Google ScholarDigital Library
- Gottlieb, H. "The Jack Principles of the Interactive Conversation Interface". Jellivision Inc. 2002. Google ScholarDigital Library
- Gupta, R., and Kochenderfer, M. 2004. Common sense data acquisition for indoor mobile robots. In Nineteenth National Conference on Artificial Intelligence (AAAI-04). Google ScholarDigital Library
- Handschuh, S., Staab, S. and Ciravegna, F., S-CREAM: Semi-automatic CREAtion of Metadata. Proceedings of EKAW'02. (2002). Google ScholarDigital Library
- Symposium on Knowledge Collection from Volunteer Contributors (KCVC-05). AAAI 2005 Spring Symposium. http://teach-computers.org/kcvc05.htmlGoogle Scholar
- Lam, C. and Stork, D. Evaluating classifiers by means of test data with noisy labels, IJCAI-2003. pp. 513--518. Google ScholarDigital Library
- Lenat, D. CYC: A large-scale investment in knowledge infrastructure. Communications of the ACM, 38 (11), 1995. Google ScholarDigital Library
- Lesh, N.; Marks, J.; Rich, C.; Sidner, C.L., "Man-Computer Symbiosis Revisited: Achieving Natural Communication and Collaboration with Computers", Transactions on Electronics, December 2004.Google Scholar
- Lieberman, H., Liu, H., Singh, P., and Barry, B. 2004. Beating common sense into interactive applications. AI Magazine, Winter 2004, 25(4):63--76. AAAI Press.Google Scholar
- McIlraith, S., Peppas, P., and Thielscher, M. (Eds) Symposium Series on Logical Formalizations of Commonsense Reasoning, http://www.iccl.tu-dresden.de/announce/CommonSense-2005. 2005.Google Scholar
- Mihalcea, R. and Chklovski, T. Building Sense Tagged Corpora with Volunteer Contributions over the Web, book chapter in Current Issues in Linguistic Theory: Recent Advances in Natural Language Processing, Nicolas Nicolov and Ruslan Mitkov (eds), John Benjamins Publishers, 2004.Google Scholar
- Miller, G. WordNet: An On-line Lexical Database. In International Journal of Lexicography, Vol.3, No.4, 1990.Google Scholar
- Richardson, M., Domingos, P. Building large knowledge bases by mass collaboration, in Proceedings of Second International Conference on Knowledge Capture (K-CAP 2003). Google ScholarDigital Library
- Riloff, E. and Jones, R. 1999. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping. In Proc. of AAAI-99, pp. 474--479. Google ScholarDigital Library
- Schubert, L. 2002. Can we derive general world knowledge from texts? In Proc. HLT 2002, March 24-27, San Diego, CA, pp. 94--97. Google ScholarDigital Library
- Singh, P., Lin, T., Mueller, E., Lim, G., Perkins, T., Zhu, W. Open Mind Common Sense: Knowledge acquisition from the general public. In Robert Meersman & Zahir Tari (Eds.), LNCS: Vol. 2519. On the Move to Meaningful Internet Systems: DOA/CoopIS/ODBASE (pp. 1223--1237). Springer-Verlag 2002. Google ScholarDigital Library
- Vargas-Vera, M., Motta, E., Domingue, J, Lanzoni, M., Stutt, A. and Ciravegna, F. MnM: Ontology Driven Semiautomatic and Automatic Support for Semantic Markup, Proceedings of EKAW'02. (2002). Google ScholarDigital Library
- Wojcik, R. The Boeing Simplified English Checker, 2002, http://www.boeing.com/assocproducts/secheckerGoogle Scholar
Index Terms
- Improving the design of intelligent acquisition interfaces for collecting world knowledge from web contributors
Recommendations
Collecting paraphrase corpora from volunteer contributors
K-CAP '05: Proceedings of the 3rd international conference on Knowledge captureExtensive and deep paraphrase corpora are important for a variety of natural language processing and user interaction tasks. In this paper, we present an approach which i) collects multiple paraphrases per given item from volunteers and ii) incentivises ...
Designing interfaces for guided collection of knowledge about everyday objects from volunteers
IUI '05: Proceedings of the 10th international conference on Intelligent user interfacesA new generation of intelligent applications can be enabled by broad-coverage knowledge repositories about everyday objects. We distill lessons in design of intelligent user interfaces which collect such broad-coverage knowledge from untrained ...
Knowledge Acquisition and Interface Design
Describes tools, techniques, and concepts to optimize user interfaces. The best way to ensure that a software system is friendly and works is to base it on the intended users' mental models (how they view the world), knowledge structures (what they know ...
Comments