TEfinder: A Bioinformatics Pipeline for Detecting New Transposable Element Insertion Events in Next-Generation Sequencing Data
Open Access
- 4 February 2021
- Vol. 12 (2), 224
- https://doi.org/10.3390/genes12020224
Abstract
Transposable elements (TEs) are mobile elements capable of introducing genetic changes rapidly. Their importance has been documented in many biological processes, such as introducing genetic instability, altering patterns of gene expression, and accelerating genome evolution. Increasing appreciation of TEs has resulted in a growing number of bioinformatics software to identify insertion events. However, the application of existing tools is limited by either narrow-focused design of the package, too many dependencies on other tools, or prior knowledge required as input files that may not be readily available to all users. Here, we reported a simple pipeline, TEfinder, developed for the detection of new TE insertions with minimal software and input file dependencies. The external software requirements are BEDTools, SAMtools, and Picard. Necessary input files include the reference genome sequence in FASTA format, an alignment file from paired-end reads, existing TEs in GTF format, and a text file of TE names. We tested TEfinder among several evolving populations of Fusarium oxysporum generated through a short-term adaptation study. Our results demonstrate that this easy-to-use tool can effectively detect new TE insertion events, making it accessible and practical for TE analysis.Funding Information
- National Science Foundation (IOS-1652641)
- National Institutes of Health (R01EY030150)
- Burroughs Welcome Foundation (1014893, 27374-R)
- National Institute of Food and Agriculture (2011-35600-30379, MASR-2009-04374)
- Ministerio de Ciencia e Innovación (PID2019-108045RB-I00)
This publication has 28 references indexed in Scilit:
- Active Transposition in GenomesAnnual Review of Genetics, 2012
- The GEM mapper: fast, accurate and versatile alignment by filtrationNature Methods, 2012
- Integrative Genomics Viewer (IGV): high-performance genomics data visualization and explorationBriefings in Bioinformatics, 2012
- Fast gapped-read alignment with Bowtie 2Nature Methods, 2012
- Comparative genomics reveals mobile pathogenicity chromosomes in FusariumNature, 2010
- BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics, 2010
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- De novo identification of repeat families in large genomesBioinformatics, 2005
- Rapid preparation of DNA from filamentous fungiLetters in Applied Microbiology, 1985