PUBLICATIONS
PHACE: Phylogeny-Aware Detection of Molecular Coevolution
The coevolution trends of amino acids within or between genes offer key insights into protein structure and function. Existing tools for uncovering coevolutionary signals primarily rely on multiple sequence alignments, often overlooking phylogenetic relatedness and shared evolutionary history. Here, we introduce PHACE, a phylogeny-aware coevolution algorithm that maps amino acid substitutions onto a phylogenetic tree to detect molecular coevolution. PHACE categorizes amino acids at each position into “tolerable” and “intolerable” groups, based on their independent recurrence across the tree, reflecting a position’s tolerance to specific substitutions. Gaps are treated as a third character type, with only phylogenetically independent gap changes considered. The method computes substitution scores per branch by traversing the tree and quantifying probability differences across adjacent nodes for each group. To avoid artifacts from alignment errors, we apply a multiple sequence alignment–masking procedure. Compared to phylogeny-based methods (CAPS, CoMap) and state-of-the-art multiple sequence alignment–based approaches (DCA, GaussDCA, PSICOV, mutual information), PHACE shows significantly superior accuracy in identifying coevolving residue pairs, as measured by statistical metrics including Matthews correlation coefficient, area under the ROC curve, and F1 score. This performance stems from PHACE’s explicit modeling of phylogenetic dependencies, often ignored in coevolution analyses.
- Related:
- Evolutionary history of Calcium-sensing receptors sheds light into hyper/hypocalcemia-causing mutations
- Structural Basis of Frizzled 7 Activation and Allosteric Regulation
- PHACTboost: A Phylogeny-aware Pathogenicity Predictor for the Missense Mutations via Boosting
- Cross-species investigation into the requirement of XPA for nucleotide excision repair
- Sibling rivalry among the ZBTB transcription factor family: homodimers versus heterodimers
- PHACT: Phylogeny-Aware Computing of Tolerance for Missense Mutations
- Common and selective signal transduction mechanisms of GPCRs
- Evolutionary association of receptor-wide amino acids with G protein-coupling selectivity in aminergic GPCRs
- Phylostat: a web-based tool to analyze paralogous clade divergence in phylogenetic trees
- The mutation profile of SARS-CoV-2 is primarily shaped by the host antiviral defense
- The utility of next-generation sequencing technologies in diagnosis of Mendelian mitochondrial diseases and reflections on clinical spectrum
- Phylogenetic analysis of SARS-CoV-2 genomes in Turkey
- Class III histidine kinases: a recently accessorized kinase domain in putative modulators of type IV pili based motility.
- Cache domains that are homologous to, but different from PAS domains comprise the largest superfamily of extracellular sensors in prokaryotes.
- Establishing the precise evolutionary history of a gene improves prediction of disease-causing missense mutations.