Hey everyone!
I’m Elisa Gómez de Lope, and I’m excited about joining the DeepChem community as a contributor, as part of the Google Summer of Code 2024 program.
The project: I’ll be working on integrating the pre-trained ESM-2 protein representations from Hugging Face transformers with DeepChem’s framework capabilities for predicting binding sites. The goal is to develop a straightforward way to predict protein binding sites in DeepChem, leveraging the strengths of both libraries, and documenting the process through a tutorial.
About me: My interests root in applying machine learning to biological data. I recently graduated from my PhD and am now a researcher at the university of Luxembourg, where I study graph representation learning methods for modeling omics data in the context of Parkinson’s disease. I’ve been following the developments in LLMs and am particularly fascinated by the potential of these models in predicting protein functions and properties.
I’m eager to contribute to the DeepChem open source library by developing a tutorial that guides researchers and enthusiasts to further exploit protein language models, not only for binding site prediction, but for other use cases and functionalities as well (e.g., generating peptide binders, predicting the effect of mutations, or protein-protein interactions), ultimately enabling larger adoption and accelerating drug discovery efforts.
This thread will be updated weekly with the latest updates on the project, so stay tuned if you’re interested in following up with my work over the next few months.
You can also find me on…
Github: elisagdelope
Twitter: @elisagdelope