retrieve_seqs.Rd
Retrieve Sequences
retrieve_seqs( experiment, data_type = c("tss", "tsr", "shift"), samples = "all", genome_assembly = NULL, threshold = NULL, dominant = FALSE, fixed_size = FALSE, extend_upstream = NULL, extend_downstream = NULL, return_format = "biostrings", output_dir = NULL )
experiment | TSRexploreR object. |
---|---|
data_type | Get sequences around TSSs ('tss'), TSRs ('tsr'), or TSS shifting results ('shift'). |
samples | A vector of sample names to analyze. |
genome_assembly | Genome assembly in FASTA or BSgenome format. |
threshold | TSSs or TSRs with a score below this value will not be considered. |
dominant | If TRUE, will only consider the highest-scoring TSS per gene, transcript, or TSR or highest-scoring TSR per gene or transcript. |
fixed_size | Whether all returned sequences should have the same fixed size starting from the center of the ranges. |
extend_upstream | Distance to extend ranges upstream. |
extend_downstream | Distance to extend ranges downstream. |
return_format | Either 'table' or 'biostrings'. |
output_dir | If set, the results will be saved to this directory. If return format is set to 'biostrings' the sequences will be saved as FASTA files. If return format is a table, it will be saved as a tab delimited table. |