Processing Oxford Nanopore Long Reads Using Amazon Web Services
Open Access
- 1 December 2020
- journal article
- Published by Institute of Biochemistry in Biomedical Chemistry: Research and Methods
- Vol. 3 (4), e00131
- https://doi.org/10.18097/bmcrm00131
Abstract
Studies of genomes and transcriptomes are performed using sequencers that read the sequence of nucleotide residues of genomic DNA, RNA, or complementary DNA (cDNA). The analysis consists of an experimental part (obtaining primary data) and bioinformatic processing of primary data. The bioinformatics part is performed with different sets of input parameters. The selection of the optimal values of the parameters, as a rule, requires significant computing power. The article describes a protocol for processing transcriptome data by virtual computers provided by the cloud platform Amazon Web Services (AWS) using the example of the recently emerging technology of long DNA and RNA sequences (Oxford Nanopore Technology). As a result, a virtual machine and instructions for its use have been developed, thus allowing a wide range of molecular biologists to independently process the results obtained using the "Oxford nanopore".Keywords
This publication has 17 references indexed in Scilit:
- Hot-starting software containers for STAR alignerGigaScience, 2018
- Minimap2: pairwise alignment for nucleotide sequencesBioinformatics, 2018
- Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS OnlineNature Protocols, 2018
- Salmon provides fast and bias-aware quantification of transcript expressionNature Methods, 2017
- The MaxQuant computational platform for mass spectrometry-based shotgun proteomicsNature Protocols, 2016
- SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell SequencingJournal of Computational Biology, 2012
- MR-Tandem: parallel X!Tandem using Hadoop MapReduce on Amazon Web ServicesBioinformatics, 2011
- RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genomeBMC Bioinformatics, 2011
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009