PUBLICATIONS
PHACTboost: A Phylogeny-aware Pathogenicity Predictor for the Missense Mutations via Boosting
Most algorithms that are used to predict the effects of variants rely on evolutionary conservation. However, a majority of such techniques compute evolutionary conservation by solely using the alignment of multiple sequences while overlooking the evolutionary context of substitution events. We had introduced PHACT, a scoring-based pathogenicity predictor for missense mutations that can leverage phylogenetic trees, in our previous study. By building on this foundation, we now propose PHACTboost, a gradient boosting tree-based classifier that combines PHACT scores with information from multiple sequence alignments, phylogenetic trees, and ancestral reconstruction. The results of comprehensive experiments on carefully constructed sets of variants demonstrated that PHACTboost can outperform 40 prevalent pathogenicity predictors reported in the dbNSFP, including conventional tools, meta-predictors, and deep learning-based approaches as well as state-of-the-art tools, AlphaMissense, EVE, and CPT-1. The superiority of PHACTboost over these methods was particularly evident in case of hard variants for which different pathogenicity predictors offered conflicting results. We provide predictions of 219 million missense variants over 20,191 proteins. PHACTboost can improve our understanding of genetic diseases and facilitate more accurate diagnoses.
- Related:
- Evolutionary history of Calcium-sensing receptors sheds light into hyper/hypocalcemia-causing mutations
- Structural Basis of Frizzled 7 Activation and Allosteric Regulation
- Cross-species investigation into the requirement of XPA for nucleotide excision repair
- Sibling rivalry among the ZBTB transcription factor family: homodimers versus heterodimers
- PHACT: Phylogeny-Aware Computing of Tolerance for Missense Mutations
- Common and selective signal transduction mechanisms of GPCRs
- Evolutionary association of receptor-wide amino acids with G protein-coupling selectivity in aminergic GPCRs
- Phylostat: a web-based tool to analyze paralogous clade divergence in phylogenetic trees
- The mutation profile of SARS-CoV-2 is primarily shaped by the host antiviral defense
- The utility of next-generation sequencing technologies in diagnosis of Mendelian mitochondrial diseases and reflections on clinical spectrum
- Phylogenetic analysis of SARS-CoV-2 genomes in Turkey
- Class III histidine kinases: a recently accessorized kinase domain in putative modulators of type IV pili based motility.
- Cache domains are dominant extracellular sensors for signal transduction in prokaryotes.
- Establishing the precise evolutionary history of a gene improves predicting disease-causing missense mutations.