GANNET: A Machine Learning Approach to Document Retrieval
- 14 December 1994
- journal article
- Published by Informa UK Limited in Journal of Management Information Systems
- Vol. 11 (3), 7-41
- https://doi.org/10.1080/07421222.1994.11518048
Abstract
Information retrieval using probabilistic techniques has attracted significant attention on the part of researchers in information and computer science over the past few decades. In the 1980s, knowledge-based techniques also have made an impressive contribution to “intelligent” information retrieval and indexing. More recently, information science researchers have turned to other, newer artificial intelligence– based inductive learning techniques including neural networks, symbolic learning, and genetic algorithms. The newer techniques have provided great opportunities for researchers to experiment with diverse paradigms for effective information processing and retrieval. In this article we first provide an overview of newer techniques and their usage in information science research. We then present in detail the algorithms we adopted for a hybrid Genetic Algorithms and Neural Nets based system, called GANNET. GANNET performed concept (keyword) optimization for user-selected documents during information retrieval using the genetic algorithms. It then used the optimized concepts to perform concept exploration in a large network of related concepts through the Hopfield net parallel relaxation procedure. Based on a test collection of about 3,000 articles from DIALOG and an automatically created thesaurus, and using Jaccard’s score as a performance measure, our experiment showed that GANNET improved the Jaccard’s scores by about 50 percent and it helped identify the underlying concepts (keywords) that best describe the user-selected documents.Keywords
This publication has 32 references indexed in Scilit:
- Automatic concept classification of text from electronic meetingsCommunications of the ACM, 1994
- Automatic construction of networks of concepts characterizing document databasesIEEE Transactions on Systems, Man, and Cybernetics, 1992
- Incorporating the vector space model in a neural network used for document retrievalLibrary Hi Tech, 1992
- Artificial intelligence: where has it been, and where is it going?IEEE Transactions on Knowledge and Data Engineering, 1991
- Connectionist ideas and algorithmsCommunications of the ACM, 1990
- Connectionist expert systemsCommunications of the ACM, 1988
- Collective Computation in Neuronlike CircuitsScientific American, 1987
- Building a relational database for a physician document indexMedical Informatics, 1987
- A decision theoretic foundation for indexingJournal of the American Society for Information Science, 1975
- On Relevance, Probabilistic Indexing and Information RetrievalJournal of the ACM, 1960