Build an AnnotationFilter::AnnotationFilter for selecting transcripts
build_annotation_filter.Rd
This function helps create an AnnotationFilter::AnnotationFilterList that will combine multiple standard filters based on the flags.
Usage
build_annotation_filter(
standard.chroms = TRUE,
remove.na.rows = TRUE,
protein.coding = TRUE,
transcript.support = NULL,
canonical = FALSE,
gene_id = NULL,
tx_id = NULL,
symbol = NULL,
entrez_id = NULL,
add.filter = NULL
)
Arguments
- standard.chroms
Keep only standard chromosomes. Boolean.
- remove.na.rows
Remove transcripts with NA in the ID column. Boolean.
- protein.coding
Keep only protein coding genes. Boolean.
- transcript.support
Filter by transcript support level. Integer 1-4
- canonical
Keep only the ENSEMBL canonical transcript. Boolean.
- gene_id
Keep specific ENSEMBL ID(s). Input is a vector
- tx_id
Keep specific tx_id(s). Input is a vector
- symbol
Keep specific gene symbol(s). Input is a vector
- entrez_id
Keep specific ENTREZ ID(s). Input is a vector
- add.filter
Optional AnnotationFilter::AnnotationFilterList to include
Value
An AnnotationFilter::AnnotationFilterList object
Examples
# Default: standard chromosomes, removes na tx_ids, and keep protein coding
afl = SeedMatchR::build_annotation_filter()
#> Build AnnotationFilter for transcript features based on the following parameters:
#> Keep only standard chroms: TRUE
#> Remove rows with NA in transcript ID: TRUE
#> Keep only protein coding genes and transcripts: TRUE
#> Filtering for transcripts with support level: FALSE
#> Keep only the ENSEMBL canonical transcript: FALSE
#> Filtering for specific genes: FALSE
#> Filtering for specific transcripts: FALSE
#> Filtering for specific gene symbols: FALSE
#> Filtering for specific entrez id: FALSE
# Keep only transcripts with support level 1
afl = SeedMatchR::build_annotation_filter(transcript.support = 1)
#> Build AnnotationFilter for transcript features based on the following parameters:
#> Keep only standard chroms: TRUE
#> Remove rows with NA in transcript ID: TRUE
#> Keep only protein coding genes and transcripts: TRUE
#> Filtering for transcripts with support level: 1
#> Keep only the ENSEMBL canonical transcript: FALSE
#> Filtering for specific genes: FALSE
#> Filtering for specific transcripts: FALSE
#> Filtering for specific gene symbols: FALSE
#> Filtering for specific entrez id: FALSE
#> Warning: replacing previous import ‘S4Arrays::makeNindexFromArrayViewport’ by ‘DelayedArray::makeNindexFromArrayViewport’ when loading ‘SummarizedExperiment’