A new SFB1551 paper is out! Congrats to Andrade, Dormann & Walther Labs!
Another SFB1551 paper is out! 

Congratulations to Eric Schumbera, Dorothee Dormann, Andreas Walther and Miguel Andrade on their recent publication in BMC Genomics: “Computational investigation of the sequence context of arginine/glycine-rich motifs in the human proteome”.

In this paper, the authors present a computational framework to systematically explore how RG (arginine/glycine)-rich motifs—commonly found in intrinsically disordered protein regions—differ between functional (RNA-binding, phase-separating) and nonfunctional proteins.

This publication is the fruit of a joint collaboration within the SFB1551 framework, specifically tied to Project R13: “Pronuclei as a Platform to Study Phase Behaviour and Biological Function of Arginine/Glycine-Rich Proteins”.That project explores how RG-rich proteins behave in constrained nuclear microenvironments (like pronuclei), and how their phase behaviours relate to biological function.

Abstract

Arginine-glycine (RG)-rich motifs are among the most prevalent RNA-binding elements within intrinsically disordered regions (IDRs) of proteins and play crucial roles in RNA metabolism, gene regulation, and the formation of membraneless organelles via liquid phase separation (LLPS). Despite their biological relevance and implication in neurological disorders and cancer, the sequence features and context dependencies that define functional RG motifs remain poorly characterized owing to their disordered nature and sequence variability. In this study, we present a computational framework to dissect the sequence and structural context of RG motifs across the human proteome. By contrasting a functionally defined positive dataset—enriched for RNA-binding and phase-separating proteins—with a negative dataset of RG motif proteins lacking these annotations, we identified distinct compositional and contextual signatures. RG motifs in the functionally defined positive dataset show increased enrichment of phenylalanine, tyrosine, aspartic acid, and asparagine, both within and around the motif, as well as nonrandom spatial relationships with structured RNA-binding domains. Notably, phenylalanine and tyrosine exhibit divergent positional and functional profiles, suggesting distinct mechanistic roles. Our analysis highlights the potential of sequence-based approaches to uncover functional determinants in disordered protein regions and further advances our understanding of the properties of RG motifs, offering a transferable framework for the study of other low-complexity motifs.

👉 Read the full article here: Computational investigation of the sequence context of arginine/glycine-rich motifs in the human proteome | BMC Genomics | Full Text

Many thanks to all collaborators, computational and experimental teams alike, for making this possible!