Skip to contents

This function helps create an AnnotationFilter::AnnotationFilterList that will combine multiple standard filters based on the flags.

Usage

build_annotation_filter(
  standard.chroms = TRUE,
  remove.na.rows = TRUE,
  protein.coding = TRUE,
  transcript.support = NULL,
  canonical = FALSE,
  gene_id = NULL,
  tx_id = NULL,
  symbol = NULL,
  entrez_id = NULL,
  add.filter = NULL
)

Arguments

standard.chroms

Keep only standard chromosomes. Boolean.

remove.na.rows

Remove transcripts with NA in the ID column. Boolean.

protein.coding

Keep only protein coding genes. Boolean.

transcript.support

Filter by transcript support level. Integer 1-4

canonical

Keep only the ENSEMBL canonical transcript. Boolean.

gene_id

Keep specific ENSEMBL ID(s). Input is a vector

tx_id

Keep specific tx_id(s). Input is a vector

symbol

Keep specific gene symbol(s). Input is a vector

entrez_id

Keep specific ENTREZ ID(s). Input is a vector

add.filter

Optional AnnotationFilter::AnnotationFilterList to include

Examples

# Default: standard chromosomes, removes na tx_ids, and keep protein coding
afl = SeedMatchR::build_annotation_filter()
#> Build AnnotationFilter for transcript features based on the following parameters: 
#> Keep only standard chroms: TRUE
#> Remove rows with NA in transcript ID: TRUE
#> Keep only protein coding genes and transcripts: TRUE
#> Filtering for transcripts with support level: FALSE
#> Keep only the ENSEMBL canonical transcript: FALSE
#> Filtering for specific genes: FALSE
#> Filtering for specific transcripts: FALSE
#> Filtering for specific gene symbols: FALSE
#> Filtering for specific entrez id: FALSE

# Keep only transcripts with support level 1
afl = SeedMatchR::build_annotation_filter(transcript.support = 1)
#> Build AnnotationFilter for transcript features based on the following parameters: 
#> Keep only standard chroms: TRUE
#> Remove rows with NA in transcript ID: TRUE
#> Keep only protein coding genes and transcripts: TRUE
#> Filtering for transcripts with support level: 1
#> Keep only the ENSEMBL canonical transcript: FALSE
#> Filtering for specific genes: FALSE
#> Filtering for specific transcripts: FALSE
#> Filtering for specific gene symbols: FALSE
#> Filtering for specific entrez id: FALSE
#> Warning: replacing previous import ‘S4Arrays::makeNindexFromArrayViewport’ by ‘DelayedArray::makeNindexFromArrayViewport’ when loading ‘SummarizedExperiment’