Using Word Clusters to Detect Similar Web Documents
- 1 January 2006
- book chapter
- conference paper
- Published by Springer Science and Business Media LLC in Lecture Notes in Computer Science
- p. 215-228
- https://doi.org/10.1007/11811220_19
Abstract
No abstract availableKeywords
This publication has 4 references indexed in Scilit:
- A Sentence-Based Copy Detection Approach for Web DocumentsLecture Notes in Computer Science, 2005
- Detecting similar documents using salient termsPublished by Association for Computing Machinery (ACM) ,2002
- ACM SIGMOD Record, 1995
- An algorithm for suffix strippingProgram: electronic library and information systems, 1980