Finding parts in very large corpora