Automatic index construction for multimedia digital libraries

doi:10.1016/j.ipm.2009.10.006

Information Processing & Management

Volume 46, Issue 3, May 2010, Pages 295-307

https://doi.org/10.1016/j.ipm.2009.10.006 Get rights and content

Abstract

Indexing remains one of the most popular tools provided by digital libraries to help users identify and understand the characteristics of the information they need. Despite extensive studies of the problem of automatic index construction for text-based digital libraries, the construction of multimedia digital libraries continues to represent a challenge, because multimedia objects usually lack sufficient text information to ensure reliable index learning. This research attempts to tackle the problem of automatic index construction for multimedia objects by employing Web usage logs and limited keywords pertaining to multimedia objects. The tests of two proposed algorithms use two different data sets with different amounts of textual information. Web usage logs offer precious information for building indexes of multimedia digital libraries with limited textual information. The proposed methods generally yield better indexes, especially for the artwork data set.

Introduction

The rapid advances of information technologies have allowed for the inclusion of vast amounts of electronic information in digital libraries. This electronic information initially was primarily text-based, but it has expanded to include graphics, animation, audio, video, and interactive media (Tjondronegoro & Spink, 2008). Thus, the ability to help users easily, efficiently, and conveniently retrieve multimedia information from the vast array available presents both an opportunity and a challenge for modern digital libraries.

In traditional text-based digital libraries, indexing provides the main tool to help users seek information and understand the topics contained within documents of interest to them (Berry & Castellanos, 2007). Many researchers address the challenge of indexing text-based information by leveraging content features derived from titles, keywords, abstracts, or full texts and thereby determining similarities among objects (Berry and Castellanos, 2007, Boley et al., 1999, Zhao, 2002). A clustering technique then develops a set of clusters, each of which receives a label, assigned manually or automatically (Yang & Pederson, 1997) that distinguishes the documents in that cluster from those in other clusters. The clusters then form an index for organizing text-based information.

Indexing multimedia information, however, is more challenging, because these data comprise opaque collections of bytes with limited textual information, such as short titles, the date of creation, or names of the artists (Mehtre, Kankanhalli, & Lee, 1997). Despite the existence of some techniques for automatic keyword extraction of multimedia objects in specific domains, the number of derived keywords and their accuracy remain limited (Tsai, McGarry, & Tait, 2006). Furthermore, with limited textual information for multimedia objects, traditional text-based clustering approach may not work well. Therefore, a pressing need emerges, namely, to integrate other sources of data to cluster objects in a multimedia digital library.

With the advent of the World Wide Web, an overwhelming number of digital libraries now provide interfaces that allow for ubiquitous information access. The usage data associated with Web-based digital libraries automatically get recorded in Web usage logs by Web servers. Therefore, each user click within a Web-based digital library results in one or more records in the Web usage log, such that each record represents the source IP, access time, access method/URL/protocol, referred URL, status, bytes transferred, browser type, and so forth. Table 1 displays a sample Web usage log for the electronic thesis and dissertation (ETD) system at National Sun Yat-Sen University. The first record in Table 1 shows that an entry with a universal resource number etd-0717101-163917 was accessed on 01/Apr/2004:00:00:02 by a user with the IP address 218.165.248.55. The second and third records in Table 1 indicate a user who has chosen to view an entry identified by etd-0130101-140550. Objects of the same category logically should have a higher chance of being accessed together compared with objects in different categories. Therefore, we propose to tackle the problem of indexing multimedia information by employing Web usage logs, in combination with limited keywords attached to multimedia information.

This article reports our endeavor to integrate textual data and usage data pertaining to multimedia objects and thus build the index. We develop two methods to construct an index for multimedia objects that employs both the (possibly limited) textual data associated with the objects and their usage data over a specified period of time, as recorded by Web servers. One method, called MCAT, applies both clustering and classification techniques, and the other, MCLU, uses only clustering techniques. We apply both methods, as well as methods that use only textual data, to two data sets derived from the World Art digital library from Airiti, Inc., in Taiwan and the ETD system at National Sun Yat-Sen University. The World Art digital library involves only a limited amount of textual information pertaining to images of artwork. The evaluation results using this data set show that an index constructed by considering both usage and content data better matches the predefined index than does an index that uses only one source of data. In addition, the resulting index effectively reduces users’ efforts to find the information they require. The ETD system contains a profound amount of textual information, in addition to usage data, so we use it to investigate how our proposed methods perform even for a digital library with rich textual data. Compared with traditional text-based approaches, the indexes created by our proposed methods are only slightly inferior in terms of matching the predefined index. Nevertheless, our proposed indexes retain the advantages of enabling users to identify the information they need quickly. We thus conclude that the proposed methods offer promising improvements for building indexes for multimedia digital libraries.

The remainder of this article is organized as follows: In Section 2, we review related research efforts. In Section 3, we describe our methods for indexing multimedia information, using textual content information and Web usage logs. We report the results of our experiments in Section 4 and evaluate the various methods by applying real-world data collected from the World Art digital library and an ETD system. Finally, in Section 5, we summarize and point to some further research directions.

Section snippets

Literature review

Digital libraries attract tremendous interest, including several research projects that attempt to address the vast challenges in this field, such as the Alexandria Digital Library (ADL) project at the University of California at Santa Barbara (Manjunath & Ma, 1996), the DLI project at the University of Illinois (Chen et al., 1996), the Informedia project at Carnegie Mellon University (Wactlar, Kanade, Smith, & Stevens, 1996), the Variations2 project at Indiana University (Byrd & Isaacson, 2003

Proposed methods

Most previous work employs textual information to construct document indexes, though more recent work also facilitates clustering with Web usage logs. We observe though that multimedia digital libraries often lack sufficient textural information and propose constructing an index of multimedia objects by employing both (textual) content and usage data. We define the content similarity between two multimedia objects according to their textual data. Specifically, every multimedia object can be

Data sets

To evaluate our proposed methods, we collected data from two test beds: the World Art Digital Library from Airiti, Inc. (http://www.airiti.com/Arts), whose home page (in Chinese) is in Fig. 3, and the ETD System at National Sun Yat-Sen University (NSYSU) (http://www.lib.nsysu.edu.tw/eThesys/), whose English home page appears in Fig. 4. The World Art Digital Library contains a limited amount of textual information, whereas the NSYSU ETD System provides abundant textual content. We also obtained

Conclusions

In this article, we address the problem of index construction for multimedia digital libraries by developing two index construction methods, MCAT and MCLU. These two methods employ primitive keywords and usage data to develop an index. The empirical experiments reveal that compared with traditional content-based clustering methods, our methods, when applied to digital libraries with limited textual data, generate indexes that exhibit better content and usage entropies. For digital libraries

References (36)

D. Boley et al.
Partitioning-based clustering for Web document categorization
Decision Support Systems
(1999)
Q. Li et al.
A probabilistic music recommender considering user opinions and audio features
Information Processing and Management
(2007)
B. Mehtre et al.
Shape measures for content based image retrieval: A comparison
Information Processing and Management
(1997)
G. Salton et al.
Term-weighting approaches in automatic retrieval
Information Processing and Management
(1988)
D. Tjondronegoro et al.
Web Search Engine Multimedia Functionality
Information Processing and Management
(2008)
E. Albuz et al.
Scalable color image indexing and retrieval using vector wavelets
IEEE Transaction on Knowledge and Data Engineering
(2001)
M.W. Berry et al.
Survey of text mining. II: Clustering, classification, and retrieval
(2007)
D. Byrd et al.
A music representation requirement specification for academia
Computer Music Journal
(2003)
J. Chen et al.
A survey on algorithms for mining frequent itemset over data streams
Knowledge and Information Systems
(2008)
H. Chen et al.
A parallel computing approach to creating engineering concept spaces for semantic retrieval: The Illinois digital library initiative project
IEEE Transactions on Pattern Analysis and Machine Intelligence
(1996)

Cooley, R., Mobasher, B., & Srivastava, J. (1999). Creating adaptive Web sites through usage-based clustering of URLs....

M.M. Gaber et al.

Mining data streams: A review

ACM SIGMOD Record

(2005)

Han, E. H., Karypis, G., Kumar, V., & Mobasher, B. (1997). Clustering based on association rule hypergraphs. In...

J. Han et al.

Data mining: Concepts and techniques

(2006)

E.H. Han et al.

Hypergraph based clustering in high dimensional data sets: A summary of results

IEEE Bulletin of the Technical Committee on Data Engineering

(1998)

S.-Y. Hwang et al.

Combining article content and Web usage for literature recommendation in digital libraries

Online Information Review

(2004)

S.-Y. Hwang et al.

A prototype WWW literature recommendation system for digital libraries

Online Information Review

(2003)

B.J. Jansen et al.

Searching for multimedia: Analysis of audio, video and image Web queries

World Wide Web

(2000)

Cited by (6)

LVTIA: A new method for keyphrase extraction from scientific video lectures
2022, Information Processing and Management
Citation Excerpt :
Keyword extraction or keyphrase extraction is used alternatively in this research, as in the literature they have the same meaning. Various studies have been done in multimedia indexing, in general, while some of them have focused on converting multimedia into text, and extracting keywords and tags from the textual content (Awad et al., 2017; Hwang, Yang, & Ting, 2010; Kaavya & LakshmiPriya, 2015). For video indexing and keyphrase extraction, several research are done based on analyzing video frames and extracting motion-based and object recognition based features from the frames (Gayathri & Mahesh, 2020; Spolaôr et al., 2020).
Due to the growth of technology, the expansion of communication infrastructure and crises of COVID-19 pandemic, e-learning and virtual education is expanding. One of the best ways to access and organize these information is indexing using automatic intelligent methods. Indexing requires assigning keywords or keyphrases to each video, to represent its content. The main focus of this research is to propose an approach by which appropriate keyphrases are assigned to scientific video lectures. For this purpose, a new algorithm called LVTIA, Lecture Video Text mining-base Indexing Algorithm, is proposed in which the textual content of video frames along with the text extracted from audio signal are merged together, and a new keyphrase extraction method is proposed. The proposed method considers new local and global features for each candidate phrases, along with a new feature reflecting the occurrence of each phrase in the audio signals or video frames. The method is implemented using five distinct data sets in English and Persian. The results are evaluated based on precision, recall, F1-measure and MAP@K metrics and compared with some of the well-known keyphrase extraction algorithms. Based on the results, the best MAP@K for English videos is related to LVTIA algorithm with the values of, 0.7912, 0.8069, 0.8069 for $k = 5, 10, 15$ , respectively. In addition, LVTIA is able to provide best MAP@K for Persian videos which are 0.6367, 0.6866, 0.6874 for $k = 5, 10, 15$ , respectively. According to Friedman nonparametric statistical test, the performance of different algorithms in precision, recall, F1-measure metrics, are statistically different from LVTIA as well.
Automatic subject indexing of textt
2019, Knowledge Organization
Fuzzy linguistic recommender systems for the selective diffusion of information in digital libraries
2017, Journal of Information Processing Systems
Multimedia networking issues for digital video libraries
2014, Electronic Library
Design and implementation of a multimedia database application system
2013, Journal of Theoretical and Applied Information Technology
The use of the intelligent library and tutoring system at all stages of a building life cycle
2011, Engineering Economics

View full text

Automatic index construction for multimedia digital libraries

Abstract

Introduction

Section snippets

Literature review

Proposed methods

Data sets

Conclusions

Decision Support Systems

Information Processing and Management

Information Processing and Management

Information Processing and Management

Information Processing and Management

Scalable color image indexing and retrieval using vector wavelets

IEEE Transaction on Knowledge and Data Engineering

Survey of text mining. II: Clustering, classification, and retrieval

A music representation requirement specification for academia

Computer Music Journal

A survey on algorithms for mining frequent itemset over data streams

Knowledge and Information Systems

A parallel computing approach to creating engineering concept spaces for semantic retrieval: The Illinois digital library initiative project

IEEE Transactions on Pattern Analysis and Machine Intelligence

Mining data streams: A review

ACM SIGMOD Record

Data mining: Concepts and techniques

Hypergraph based clustering in high dimensional data sets: A summary of results

IEEE Bulletin of the Technical Committee on Data Engineering

Combining article content and Web usage for literature recommendation in digital libraries

Online Information Review

A prototype WWW literature recommendation system for digital libraries

Online Information Review

Searching for multimedia: Analysis of audio, video and image Web queries

World Wide Web