ABSTRACT
We introduce CiteSeer-API, a public API to CiteSeer-like services. CiteSeer-API is SOAP/WSDL based and allows for easy programmatical access to all the specific functionalities offered by CiteSeer services, including full text search of documents and citations and citation-based document discovery. In order to enable operability and interlinking with arbitrary software agents and digital library systems, CiteSeer-API uses digital content signatures to create system-independent handles for the Document, Citation and Group resources of CiteSeer servers. We discuss specific functionalities of CiteSeer-API that take advantage of these handlers in order to enable seamless location of CiteSeer resources. Finally we argue that the digital signature scheme used by CiteSeer-API is well suited for the creation of machine-usable semantic descriptions of digital library services which is the key toward seamless discovery and integration of services such as CiteSeer-API. CiteSeer-API is currently showcased on CiteSeer.IST, the CiteSeer server of the School of Information Science and Technology at the Pennsylvania State University.
- ACM Portal, http://portal.acm.org/portal.cfmGoogle Scholar
- M. Bawa, G.S. Manku, P. Raghavan, "SETS: search enhanced by topic segmentation", in Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), pp 306--313, 2003. Google ScholarDigital Library
- CiteSeer-API, http://citeseer.ist.psu.edu/api/Google Scholar
- CiteSeer.IST, http://citeseer.ist.psu.edu/Google Scholar
- CiteSeer, http://www.citeseer.org.Google Scholar
- CiteSeer Relator, http://www.pmbrowser.info/citeseer.phpGoogle Scholar
- Crespo, A.; Garcia-Molina, H. "Archival Storage for Digital Libraries", in Proceeding of the 3rd ACM Conference on Digital Libraries (DL'98), pp. 69--78, Pittsburgh, PA, USA, June 23-26, 1998. Google ScholarDigital Library
- The Collection of Computer Science Bibliographies, http://liinwww.ira.uka.de/bibliography/index.htmlGoogle Scholar
- DBLP, http://dblp.uni-trier.de/Google Scholar
- Dublin Core Metada Initiative, http://dublincore.org/Google Scholar
- A. Doan, Y. Lu, Y. Lee, and J. Han, "Object Matching for Data Integration: A Profile-Based Approach", in Proceedings of the IJCAI-03 Workshop on Information Integration on the Web, pp. 53--58, Acapulco, Mexico, August 9-10, 2003.Google Scholar
- eBizSearch, http://www.ebizsearch.org.Google Scholar
- C.L. Giles, K. Bollacker, S. Lawrence, "CiteSeer: An Automatic Citation Indexing System", in Proceedings of the 3rd ACM Conference on Digital Libraries (DL'98), pp 89--98, Pittsburgh, PA, USA, June 23-26, 1998. Google ScholarDigital Library
- J. Heflin, and J. Hendler, "Searching the Web with SHOE". in Artificial Intelligence for Web Search. Papers from the AAAI Workshop. WS-00-01. AAAI Press, Menlo Park, CA, 2000. pp. 35--40Google Scholar
- HomepageSearch, http://hpsearch.uni-trier.de/Google Scholar
- S. Lawrence, K. Bollacker, C.L. Giles, "Distributed Error Correction", in Proceedings of the 4th ACM Conference on Digital Libraries, pp. 232, Berkeley, CA, USA, August 11-14, 1999. Google ScholarDigital Library
- S. Lawrence, K. Bollacker and C.L. Giles, "Indexing and Retrieval of Scientific Literature", in Proceedings of the Eighth International Conference on Information and Knowledge Management (CIKM 99), pp 139--146, Kansas City, Missouri, November 2-6, 1999. Google ScholarDigital Library
- Q. Lu, L. Getoor, "Link-based Classification", in Proceedings of the 20th International Conference of Machine Learning (ICML 2003), Washington, DC, USA, pp 496--503, 2003.Google Scholar
- F. Lu, T. Johnsten, V. Raghavan and D. Traylor, "Enhancing Internet Search Engines to Achieve Concept-based Retrieval", in Proceeding of Inforum'99, Oakridge, TN, USA, May 1999.Google Scholar
- "The Open Archives Initiative Protocol for Metadata Harvesting", http://www.openarchives.org/OAI/openarchivesprotocol.htm.Google Scholar
- OWL Web Ontology Language Reference, http://www.w3.org/TR/2004/REC-owl-ref-20040210/Google Scholar
- OWL-S, http://www.daml.org/services/owl-s/1.0/Google Scholar
- Y. Petinot, P.B. Teregowda, H. Han, C.L. Giles, S. Lawrence, A. Rangaswamy and N. Pal, "eBizSearch: an OAI-Compliant Digital Library for eBusiness", in Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2003), pp 199--209, Houston (TX), May 2003. Google ScholarDigital Library
- A. Popescul, L.H. Ungar, S. Lawrence, D.M. Pennock, "Statistical Relational Learning for Document Mining", in Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), pp 275--282, 2003. Google ScholarDigital Library
- Resource Description Framework, http://www.w3.org/RDF/Google Scholar
- FIPS 180-1, "Secure Hash Standard", NIST, US Department of Commerce, Washington D.C., Apr. 1995.Google Scholar
- SMEALSearch, http://smealsearch.psu.eduGoogle Scholar
- Simple Object Access Protocol, http://www.w3.org/TR/soap/Google Scholar
- SRW - Search Retrieve Web Service, http://lcweb.loc.gov/z3950/agency/zing/srw/Google Scholar
- Web Service Description Language, http://www.w3.org/TR/wsdlGoogle Scholar
- DSpace Federation, http://www.dspace.org/Google Scholar
- Fedora, http://www.fedora.info/Google Scholar
- Yves Petinot, C. Lee Giles, Vivek Bhatnagar, Pradeep B. Teregowda, Hui Han, "Enabling Interoperability For Autonomous Digital Libraries : An API To CiteSeer Services", in Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), pp. 372--373, Tucson (AZ), June 2004. Google ScholarDigital Library
- The Digital Object Identifier System, http://www.doi.org/.Google Scholar
- The OpenURL Framework for Context-Sensitive Services, http://www.niso.org/committees/committee_ax.htmlGoogle Scholar
Index Terms
- CiteSeer-API: towards seamless resource location and interlinking for digital libraries
Recommendations
A service-oriented architecture for digital libraries
ICSOC '04: Proceedings of the 2nd international conference on Service oriented computingCiteSeer is currently a very large source of meta-data information on the World Wide Web (WWW). This meta-data is the key material for the Semantic Web. Still, CiteSeer is not yet a Semantic-enabled service and therefore its meta-data, although ...
Mining citation information from CiteSeer data
The CiteSeer digital library is a useful source of bibliographic information. It allows for retrieving citations, co-authorships, addresses, and affiliations of authors and publications. In spite of this, it has been relatively rarely used for automated ...
Bibliometric analysis of CiteSeer data for countries
This article describes the results of our analysis of the data from the CiteSeer digital library. First, we examined the data from the point of view of source top-level Internet domains from which the data were collected. Second, we measured country ...
Comments