VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis

MacIntosh I. Cornwell, Mahesh Vangala, Len Taing, Zachary Herbert, Johannes Köster, Bo Li, Hanfei Sun, Taiwen Li, Jian Zhang, Xintao Qiu, Matthew Pun, Rinath Jeselsohn, Myles Brown, X. Shirley Liu, Henry W. Long

Research output: Contribution to journalArticle

15 Citations (Scopus)

Abstract

Background: RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. Results: Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. Conclusions: VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.

Original languageEnglish (US)
Article number135
JournalBMC Bioinformatics
Volume19
Issue number1
DOIs
StatePublished - Apr 12 2018

Fingerprint

Workflow
RNA
Work Flow
Visualization
Pipelines
Sequencing
Pathway
Complementarity Determining Regions
Technology
RNA Sequence Analysis
Alignment
Biological Science Disciplines
T-cells
Workflow Management System
Quality Control
Customization
Life sciences
Differential Expression
Complementarity
Software packages

Keywords

  • Analysis
  • Gene fusion
  • Immunological infiltrate
  • Pipeline
  • RNA-seq
  • Snakemake

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

Cite this

VIPER : Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis. / Cornwell, MacIntosh I.; Vangala, Mahesh; Taing, Len; Herbert, Zachary; Köster, Johannes; Li, Bo; Sun, Hanfei; Li, Taiwen; Zhang, Jian; Qiu, Xintao; Pun, Matthew; Jeselsohn, Rinath; Brown, Myles; Liu, X. Shirley; Long, Henry W.

In: BMC Bioinformatics, Vol. 19, No. 1, 135, 12.04.2018.

Research output: Contribution to journalArticle

Cornwell, MI, Vangala, M, Taing, L, Herbert, Z, Köster, J, Li, B, Sun, H, Li, T, Zhang, J, Qiu, X, Pun, M, Jeselsohn, R, Brown, M, Liu, XS & Long, HW 2018, 'VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis', BMC Bioinformatics, vol. 19, no. 1, 135. https://doi.org/10.1186/s12859-018-2139-9
Cornwell, MacIntosh I. ; Vangala, Mahesh ; Taing, Len ; Herbert, Zachary ; Köster, Johannes ; Li, Bo ; Sun, Hanfei ; Li, Taiwen ; Zhang, Jian ; Qiu, Xintao ; Pun, Matthew ; Jeselsohn, Rinath ; Brown, Myles ; Liu, X. Shirley ; Long, Henry W. / VIPER : Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis. In: BMC Bioinformatics. 2018 ; Vol. 19, No. 1.
@article{5ede23ad32ca4d0c809ba83644b96108,
title = "VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis",
abstract = "Background: RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. Results: Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. Conclusions: VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.",
keywords = "Analysis, Gene fusion, Immunological infiltrate, Pipeline, RNA-seq, Snakemake",
author = "Cornwell, {MacIntosh I.} and Mahesh Vangala and Len Taing and Zachary Herbert and Johannes K{\"o}ster and Bo Li and Hanfei Sun and Taiwen Li and Jian Zhang and Xintao Qiu and Matthew Pun and Rinath Jeselsohn and Myles Brown and Liu, {X. Shirley} and Long, {Henry W.}",
year = "2018",
month = "4",
day = "12",
doi = "10.1186/s12859-018-2139-9",
language = "English (US)",
volume = "19",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - VIPER

T2 - Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis

AU - Cornwell, MacIntosh I.

AU - Vangala, Mahesh

AU - Taing, Len

AU - Herbert, Zachary

AU - Köster, Johannes

AU - Li, Bo

AU - Sun, Hanfei

AU - Li, Taiwen

AU - Zhang, Jian

AU - Qiu, Xintao

AU - Pun, Matthew

AU - Jeselsohn, Rinath

AU - Brown, Myles

AU - Liu, X. Shirley

AU - Long, Henry W.

PY - 2018/4/12

Y1 - 2018/4/12

N2 - Background: RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. Results: Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. Conclusions: VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.

AB - Background: RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. Results: Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. Conclusions: VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.

KW - Analysis

KW - Gene fusion

KW - Immunological infiltrate

KW - Pipeline

KW - RNA-seq

KW - Snakemake

UR - http://www.scopus.com/inward/record.url?scp=85045378045&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045378045&partnerID=8YFLogxK

U2 - 10.1186/s12859-018-2139-9

DO - 10.1186/s12859-018-2139-9

M3 - Article

C2 - 29649993

AN - SCOPUS:85045378045

VL - 19

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

IS - 1

M1 - 135

ER -