Moving in a crowded cellular environment, proteins have to recognize and bind to each other with high specificity. This specificity reflects in a combination of geometric and chemical complementarities at the core of interacting regions that ultimately influences binding stability. Exploiting such peculiar complementarity patterns, we recently developed CIRNet, a neural network architecture capable of identifying pairs of protein core interacting residues and assisting docking algorithms by rescaling the proposed poses. Here, we present a detailed analysis of the geometric and chemical descriptors utilized by CIRNet, investigating its decision-making process to gain deeper insights into the interactions governing protein-protein binding and their interdependence. Specifically, we quantitatively assess (i) the relative importance of chemical and physical features in network training and (ii) their interplay at protein interfaces. We show that shape and hydrophobic-hydrophilic complementarities contain the most predictive information about the classification outcome. Electrostatic complementarity alone does not achieve high classification accuracy but is required to boost learning. Ultimately, our findings suggest that identifying the most information-dense features may enhance our understanding of the mechanisms driving protein-protein interactions at core interfaces.

Exploring neural networks to uncover information-richer features for protein interaction prediction

Lorenzo Di Rienzo;
2025-01-01

Abstract

Moving in a crowded cellular environment, proteins have to recognize and bind to each other with high specificity. This specificity reflects in a combination of geometric and chemical complementarities at the core of interacting regions that ultimately influences binding stability. Exploiting such peculiar complementarity patterns, we recently developed CIRNet, a neural network architecture capable of identifying pairs of protein core interacting residues and assisting docking algorithms by rescaling the proposed poses. Here, we present a detailed analysis of the geometric and chemical descriptors utilized by CIRNet, investigating its decision-making process to gain deeper insights into the interactions governing protein-protein binding and their interdependence. Specifically, we quantitatively assess (i) the relative importance of chemical and physical features in network training and (ii) their interplay at protein interfaces. We show that shape and hydrophobic-hydrophilic complementarities contain the most predictive information about the classification outcome. Electrostatic complementarity alone does not achieve high classification accuracy but is required to boost learning. Ultimately, our findings suggest that identifying the most information-dense features may enhance our understanding of the mechanisms driving protein-protein interactions at core interfaces.
2025
protein-protein interactions
machine learning
computational biology
File in questo prodotto:
File Dimensione Formato  
Grassmann_Exploring-neural-networks_2025.pdf

non disponibili

Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.5 MB
Formato Adobe PDF
1.5 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14245/16970
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
social impact