2.D: PANTHER & rbioapi


Introduction

Directly quoting the the paper published by PANTHER (Protein Analysis THrough Evolutionary Relationships) authors:

The PANTHER classification system (https://www.pantherdb.org) a comprehensive system that combines genomes, gene function , pathways and statistical analysis tools to enable to analyze large-scale genome-wide experimental data. The system (PANTHER v.14.0) covers 131 complete genomes organized gene families and subfamilies; evolutionary relationships between are represented in phylogenetic trees, multiple sequence and statistical models (hidden Markov models (HMMs)). The families and subfamilies are annotated with Gene Ontology (GO) terms, sequences are assigned to PANTHER pathways. A suite of tools has built to allow users to browse and query gene functions and analyze-scale experimental data with a number of statistical tests. is widely used by bench scientists, bioinformaticians, computer and systems biologists.

(source: Mi, Huaiyu, et al. “Protocol Update for large-scale genome and gene function analysis with the PANTHER classification system (v. 14.0).” Nature protocols 14.3 (2019): 703-721)

The available tools in PANTHER’s RESTful API services can be divided into 3 broad categories: Mapping genes, retrieving information, and research tools. Herein, we provide a very short introduction; you can always check functions’ manuals for detailed guides and examples.


Map genes

  • rba_panther_mapping(): map your gene-set to PANTHER database and retrieve attributes and annotations associated with your genes

  • rba_panther_ortholog(): Retrieve Orthologs of your genes

  • rba_panther_homolog(): Retrieve Homologs of your genes


Get information

  • rba_panther_info(): Retrieve a list of PANTHER’s supported organisms, datasets, families, or pathways

  • rba_panther_family(): Retrieve Orthologs, MSA, or Tree topology of a given PANTHER family.


Research tools

Gene List Analysis

rba_panther_enrich() is an equivalent to Gene List analysis tool’s webpage.. Here is a usage example:

## 1 We get the available annotation datasets in PANTHER (we need to select one of them to submit an enrichment request)
annots <- rba_panther_info(what = "datasets")
#> Retrieving available annotation datasets.
# Note that you should enter the "id" of the datasets, not its label (e.g. entering "biological_process" is incorrect, you should rather enter "GO:0008150").
## 2 We create a variable with our genes' IDs
genes <- c("p53", "BRCA1", "cdk2", "Q99835", "CDC42","CDK1","KIF23","PLK1",
           "RAC2","RACGAP1","RHOA","RHOB", "PHF14", "RBM3", "MSL1")
## 3 Now we can submit the enrichment request.
enriched <- rba_panther_enrich(genes = genes,
                               organism = 9606,
                               annot_dataset = "ANNOT_TYPE_ID_PANTHER_PATHWAY",
                               cutoff = 0.05)
#> Performing over-representation enrichment analysis of 15 input genes of organism 9606 against ANNOT_TYPE_ID_PANTHER_PATHWAY datasets.

Please Note: Other services supported by rbioapi also provide Over-representation analysis tools. Please see the vignette article Do with rbioapi: Over-Representation (Enrichment) Analysis in R (link to the documentation site) for an in-depth review.

Tree grafter

rba_panther_tree_grafter() is an equivalent to the “Graft sequence into PANTHER library of trees” tool.


How to Cite?

To cite PANTHER (Please see https://www.pantherdb.org/publications.jsp#HowToCitePANTHER):

  • Huaiyu Mi, Dustin Ebert, Anushya Muruganujan, Caitlin Mills, Laurent-Philippe Albou, Tremayne Mushayamaha, Paul D Thomas, PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API, Nucleic Acids Research, Volume 49, Issue D1, 8 January 2021, Pages D394–D403, https://doi.org/10.1093/nar/gkaa1106

To cite rbioapi:

  • Moosa Rezwani, Ali Akbar Pourfathollah, Farshid Noorbakhsh, rbioapi: user-friendly R interface to biologic web services’ API, Bioinformatics, Volume 38, Issue 10, 15 May 2022, Pages 2952–2953, https://doi.org/10.1093/bioinformatics/btac172

Session info

#> R version 4.4.2 (2024-10-31)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.1 LTS
#> 
#> Matrix products: default
#> BLAS:   /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 
#> LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so;  LAPACK version 3.12.0
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
#>  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=C              
#>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
#>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
#>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> time zone: Etc/UTC
#> tzcode source: system (glibc)
#> 
#> attached base packages:
#> [1] stats     graphics  grDevices utils     datasets  methods   base     
#> 
#> other attached packages:
#> [1] rbioapi_0.8.0  rmarkdown_2.29
#> 
#> loaded via a namespace (and not attached):
#>  [1] digest_0.6.37     R6_2.5.1          fastmap_1.2.0     xfun_0.49        
#>  [5] magrittr_2.0.3    maketools_1.3.1   cachem_1.1.0      knitr_1.49       
#>  [9] htmltools_0.5.8.1 buildtools_1.0.0  lifecycle_1.0.4   DT_0.33          
#> [13] cli_3.6.3         sass_0.4.9        jquerylib_0.1.4   compiler_4.4.2   
#> [17] httr_1.4.7        sys_3.4.3         tools_4.4.2       curl_6.0.0       
#> [21] evaluate_1.0.1    bslib_0.8.0       yaml_2.3.10       htmlwidgets_1.6.4
#> [25] jsonlite_1.8.9    rlang_1.1.4       crosstalk_1.2.1