TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets

Top Cited Papers
Open Access
Abstract
TGICL is a pipeline for analysis of large Expressed Sequence Tags (EST) and mRNA databases in which the sequences are first clustered based on pairwise sequence similarity, and then assembled by individual clusters (optionally with quality values) to produce longer, more complete consensus sequences. The system can run on multi-CPU architectures including SMP and PVM. Availability: http://www.tigr.org/tdb/tgi/software/ Contact: johnq@tigr.org; johnq@tigr.org * To whom correspondence should be addressed.