A model of multimedia information retrieval
- 1 September 2001
- journal article
- Published by Association for Computing Machinery (ACM) in Journal of the ACM
- Vol. 48 (5), 909-970
- https://doi.org/10.1145/502102.502103
Abstract
Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.Keywords
This publication has 57 references indexed in Scilit:
- TOWARDS A LOGICAL RECONSTRUCTION OF INFORMATION RETRIEVAL THEORYCybernetics and Systems, 1999
- Querying documents in object databasesInternational Journal on Digital Libraries, 1997
- The effect of accessing nonmatching documents on relevance feedbackACM Transactions on Information Systems, 1997
- Visual image retrieval by elastic matching of user sketchesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
- Similarity searching in medical image databasesIEEE Transactions on Knowledge and Data Engineering, 1997
- Image retrieval using color and shapePattern Recognition, 1996
- Periodicity, directionality, and randomness: Wold features for image modeling and retrievalIEEE Transactions on Pattern Analysis and Machine Intelligence, 1996
- Multimedia systems—an interdisciplinary perspectiveACM Computing Surveys, 1995
- Design and evaluation of algorithms for image retrieval by spatial similarityACM Transactions on Information Systems, 1995
- A non-classical logic for information retrievalThe Computer Journal, 1986