TSRexploreR facilitates processing and exploration of data generated by global transcription start site (TSS) mapping technologies such as CAGE and STRIPE-seq. TSSs can be extracted and aggregated from mapped reads in BAM/SAM format or pre-aggregated TSSs in GRanges, bigWig, bedGraph, or CTSS format can be imported. TSSs can then be normalized and clustered into transcription start regions (TSRs), or TSRs detected with other software such as CAGEr can be imported.
A strength of TSRexploreR is the flexibility and ease with which data can be visualized. Numerous plots can be generated for TSSs and TSRs such as density plots, heatmaps, sequence logos, gene tracks, correlation heatmaps, and many more. For most plots, we also provide the ability to filter, order, quantile, and group TSSs or TSRs by various metrics such as inter-quantile range, shape index, and score. TSRexploreR also facilitates detection of differential TSR usage with DESeq2 and edgeR.
We also introduce a new approach to discovering shifts in TSS distributions within merged TSRs from distinct samples. TSRexploreR calculates a signed version of earth mover’s distance, which we term earth mover’s score (EMS), which indicates both the magnitude and direction of a shift. EMS calculation is statistically controlled by a permutation test, which generates a p-value and FDR threshold.
TSRexploreR can be installed wholly from Github (with or without a conda environment) or run from within a Singularity container.
Sys.setenv("R_REMOTES_NO_ERRORS_FROM_WARNINGS"="true")
devtools::install_github("zentnerlab/TSRexploreR")
Conda can be optionally used before the above Github installation step to download most of the package dependencies. This also has the benefit of creating an R environment separate from your main software environment, which helps avoid dependency version conflicts and makes removing associated sofware easy if desired. Installation instructions for miniconda3 can be found here.
First, create the conda environment.
conda create -n TSRexploreR -y -c conda-forge -c bioconda \
qpdf r-base>=4.0.0 r-tidyverse r-data.table r-devtools r-ggseqlogo r-cowplot r-rcpp \
r-assertthat r-testthat r-cairo r-ggrastr r-pkgdown \
bioconductor-apeglm bioconductor-genomicranges bioconductor-genomicfeatures \
bioconductor-genomicalignments bioconductor-biostrings bioconductor-rsamtools \
bioconductor-chipseeker bioconductor-edger bioconductor-deseq2 bioconductor-clusterProfiler \
bioconductor-complexheatmap bioconductor-cager bioconductor-tsrchitect bioconductor-gviz \
bioconductor-rtracklayer bioconductor-biocgenerics bioconductor-plyranges \
bioconductor-pcatools bioconductor-genomeinfodb bioconductor-bsgenome bioconductor-bioccheck \
bioconductor-bsgenome.scerevisiae.ucsc.saccer3 bioconductor-txdb.scerevisiae.ucsc.saccer3.sgdgene
After the environment has been created, it must be activated prior to use.
conda activate TSRexploreR
TSRexploreR can now be installed into this environment, where it will be available upon activation of the environment.
Sys.setenv("R_REMOTES_NO_ERRORS_FROM_WARNINGS"="true")
devtools::install_github("zentnerlab/TSRexploreR")
The TSRexploreR Singularity container is akin to a box that contains TSRexploreR and all required dependencies. The advantage of containerized software is that it requires no installation (beyond the main container software), is self-contained, and provides stable and reproducible results over time. Singularity installation instructions can be found here.
After Singularity is installed, the container can be pulled and run in a few commands. First, create a working directory with all required files and run the following commands.
Download the singularity container.
singularity pull --arch amd64 library://zentlab/default/tsrexplorer:main
You can then run R from within the container housing TSRexploreR.
singularity exec -eCB `pwd` -H `pwd` tsrexplorer_main.sif R
We provide several vignettes to showcase the various functionalities of TSRexplorer: