Aspect-based summarisation using distributed clustering and single-objective optimisation
- 21 February 2019
- journal article
- research article
- Published by SAGE Publications in Journal of Information Science
- Vol. 46 (2), 176-190
- https://doi.org/10.1177/0165551519827896
Abstract
In the user reviews of various domains, there is an increase in the accumulation of reviews in the web that presents a lot of difficulties to the readers. So it becomes necessary to generate a summary which represents the entire review in a concise manner. It is required for each feature or aspect in the reviews for the ease of users. The aspect-based summarisation plays a vital role in the field of opinion mining. This article proposes an aspect summarisation framework using sentence scoring clustering and weight-based single-objective optimisation technique by utilising evolutionary algorithm. The system uses MapReduce framework to incorporate the proposed combiner–based optimised clustering approach. Then a novel single-objective optimisation with genetic algorithm is developed. Its purpose is to retrieve top sentences from each cluster to generate feature-based summary. The accuracy of the system-generated summary is evaluated using the Recall Oriented Understanding for Gisting Evaluation tool kit using human standard reference summaries. The system is able to achieve more promising results when compared with other standard feature–based summarisation systems.Keywords
This publication has 17 references indexed in Scilit:
- Assessing sentence scoring techniques for extractive text summarizationExpert Systems with Applications, 2013
- A Survey of Text Summarization TechniquesPublished by Springer Science and Business Media LLC ,2012
- Sentence selection for generic document summarization using an adaptive differential evolution algorithmSwarm and Evolutionary Computation, 2011
- Integrating Document Clustering and Multidocument SummarizationACM Transactions on Knowledge Discovery From Data, 2011
- Automatic generic document summarization based on non-negative matrix factorizationInformation Processing & Management, 2009
- A machine learning approach to sentiment analysis in multilingual Web textsInformation Retrieval Journal, 2008
- MapReduceCommunications of the ACM, 2008
- Top 10 algorithms in data miningKnowledge and Information Systems, 2007
- Centroid-based summarization of multiple documentsInformation Processing & Management, 2004
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990