DeepChem GSoC 2025 Project Ideas

Dear DeepChem Team,

My name is Alona Kharina, and I am currently in my 4th year at the National University of Pharmacy in Ukraine, specializing in Pharmaceutical Technologies Engineering. I have gained valuable experience, including participation in the iEngage Summer Student Research Programme at the Wellcome-Wolfson Institute for Experimental Medicine in the UK. Additionally, I attended the Machine Learning School in Health & Bio at the Mathematical Institute, University of Oxford, and I have completed online courses through Harvard Medical School.

I am very interested in applying for GSoC 2025 under DeepChem and have reviewed the available project ideas. Two specific project ideas that particularly caught my attention are:

  • Improving New Drug Modality Support
  • Improving Support for Drug Formulations

If you require any further information regarding my background, or if you would like to see my CV, motivation letter, or any additional materials, please feel free to contact me at alonakharina27@gmail.com.

Thank you for your time and consideration. I look forward to hearing from you.

Best regards,
Alona Kharina

Iā€™m a 7th-semester undergraduate student in Systems Engineering at UIS in Colombia. Iā€™m currently part of a research hotbed on Computer Vision and AI, where I contribute to building educational resources and open-source tools. Through this experience, Iā€™ve developed a strong interest in democratizing AI and Machine learning and enhancing the community of development on these fields.

Iā€™d love to join this project to deepen my practical understanding of layer-based design and help expand DeepChemā€™s tutorial ecosystem. While most of my experience is with PyTorch, Iā€™m comfortable working with modular architectures and eager to dive into the inner workings of DeepChemā€™s layer abstractions.

We also maintain a public-facing website where we share tutorials, results, and experiments, so I already have experience building technical content from scratch. Iā€™m excited about the chance to work with DeepChemā€™s team, contribute new tutorials, and help make ML tooling more approachable for everyone.

Hereā€™s a link to our site: HoCV

Hello, DeepChem Team,

Iā€™m a 7th-semester undergraduate student in Systems Engineering at UIS in Colombia. Iā€™m currently part of a research hotbed on Computer Vision and AI, where I contribute to building educational resources and open-source tools. Through this experience, Iā€™ve developed a strong interest in democratizing AI and Machine learning and enhancing the community of development on these fields.

Iā€™m interested in one particular idea: Layer Tutorials

Iā€™d love to join this project to deepen my practical understanding of layer-based design and help expand DeepChemā€™s tutorial ecosystem. While most of my experience is with PyTorch, Iā€™m comfortable working with modular architectures and eager to dive into the inner workings of DeepChemā€™s layer abstractions.

We also maintain a public-facing website where we share tutorials, results, and experiments, so I already have experience building technical content from scratch. Iā€™m excited about the chance to work with DeepChemā€™s team, contribute new tutorials, and help make ML tooling more approachable for everyone.

Hereā€™s a link to our site: HoCV

Thanks for reading, wish you the best,
Juan CalderĆ³n

Hi DeepChem Team,

I hope youā€™re doing well. My name is Dhandeep, and Iā€™m excited to express my interest in applying for GSoC 2025 under DeepChem. After carefully going through the list of project ideas, the ā€œConversion of SMILES to IUPAC and IUPAC to SMILESā€ project stood out to me as a perfect match for my skills, interests, and long-term goals.

Why this project?

As someone deeply passionate about the intersection of machine learning and molecular science, I find this project incredibly compelling. SMILES and IUPAC are essential representations in cheminformatics, and building robust tools for bidirectional conversion would not only enhance DeepChemā€™s utility but also enable smoother integrations with external datasets and tools. Working on this project would allow me to deepen my understanding of chemical data structures, graph representations, and algorithm optimization, while making a tangible contribution to the open-source scientific community.

Relevant Background

I have completed the Machine Learning and Deep Learning Specializations by DeepLearning.AI, and Iā€™ve been actively exploring AI applications in drug discovery and healthcare. Iā€™ve worked on hybrid deep learning models (e.g., combining CNNs with structured state-space models), and have experience designing and debugging complex architectures, particularly in PyTorch. My academic foundation is solid, and Iā€™m currently focused on bridging AI with meaningful real-world applications in the biotech and pharmaceutical sectors.

While Iā€™m new to contributing to DeepChem, I have been working on open-source projects and research-focused development, and Iā€™m confident in writing clean, maintainable code, performing in-depth debugging, and documenting my work effectively.

Iā€™d love your feedback on how I can best align my skills with this project and begin contributing meaningfully ahead of the proposal submission phase. Thank you for building such a welcoming and impactful open-source ecosystemā€”Iā€™m really excited about the possibility of contributing.

Looking forward to your suggestions and guidance!

Warm regards,
Dhandeep

Heloo DeepChem team,

I know I might be late while replying to the comment chain on the last day of the project proposal deadline as I have just come across the project and program but I would to still like to get my word across my deep interest in the project " Improve Equivariance Support" I am a masters student pursuing AI . I am looking for opportunities to learn grow and get more insights in the field of Machine Learning and AI I feel this would would help me achieve my goals towards that. Looking forward to contribute towards advancement of this project and learn together!
Github: https://github.com/manaswithasunkara
Email: manaswitha.sunkara@gmail.com

Hello DeepChem Team,
Iā€™m interest in the ā€œImprove Equivariance Supportā€ project idea and would love the opportunity to contribute. Through two machine learning internships, Iā€™ve gained hands-on experience working with complex ML systems, and Iā€™m eager to apply that knowledge to the development of equivariant models. My academic background includes coursework using TensorFlow and Keras, where I implemented transformer architectures and explored modeling techniques to quantify noise and uncertainty. What draws me to this project is the chance to work with the same principles that powered systems like AlphaFold2. And I see this as a great opportunity to contribute meaningfully to the open-source community while improving my understanding of how machine learning can impact the scientific field. Thank you for considering me!

Hi,

Iā€™m a third-year Computer Science student with a strong foundation in coding, and Iā€™m very interested in contributing to DeepChem through GSoC 2025. Specifically, Iā€™m excited about the following two beginner-friendly project ideas:

  1. Layer Tutorials ā€“ Iā€™d love to help expand and improve the documentation and tutorials for DeepChemā€™s layer collection. I enjoy technical writing and have experience creating clear Jupyter/Colab notebooks.
  2. Improving New Drug Modality Support ā€“ Iā€™m fascinated by the potential of emerging drug modalities and would be excited to help add tutorials and datasets for PROTACs, ADCs, and other innovative therapeutics.

I would really appreciate any guidance on how I can get started and connect with mentors or the community. Looking forward to contributing!

Best,
Aayush

Hi DeepChem Team,

Iā€™m Shubham Vishwakarma, a final-year undergrad from India, and Iā€™m excited about contributing to DeepChem through GSoC 2025. The project on SMILES ā†” IUPAC conversion closely aligns with my interests in computational chemistry, software development, and open-source contribution.

Iā€™ve submitted my proposal along with a working demo based on my initial idea for the project. I have experience with machine learning, NLP, and chemical data structures, and have previously worked on real-world applications of AI, including satellite image analysis and educational tools. My open-source contributions include work in GSSoC, C4GT, and various Chrome extension projects focused on accessibility.

Iā€™d really appreciate any feedback or suggestions on how I can improve or align better with the DeepChem ecosystem. Looking forward to the opportunity to contribute and learn.

Best regards,
Shubham

Github : https://github.com/vi-shub
Email : smsharma3121@gmail.com

Dear DeepChem Team,

My name is Alikhan, currently an undergraduate student majoring in Computer Science with a minor in Chemistry, and I am working as a research assistant in the Grzybowski Lab at Ulsan National Institute of Science and Technology and the Center for Algorithmic and Robotized Synthesis (CARS-IBS).

For the past year, Iā€™ve been working with cheminformatics pipelines. My work included processing, filtering, and clustering an open-source database of natural products, vectorization of 50.000 organic reactions, developing an algorithm to automate NMR spectra characterization, as well as working on QM and SQM calculations for small organic molecules.

In one of my projects, I was conducting a big data analysis of the collection of open natural products (COCONUT database), which included filtering and processing of over 400,000 organic molecules, as well as visualizing 10,000 of them on a 2D plot using the t-SNE method. While doing so, I learned data science techniques as well as useful medicinal chemistry methods for drug discovery, including the all-time classic Lipinskiā€™s Rule of Five for filtering drug candidates. So I am particularly interested in ā€œImproving support for drug formulationsā€ as well as in ā€œConversion of Smiles to IUPAC and IUPAC to Smilesā€ projects.

I welcome the chance to discuss how my skills can drive innovative solutions in your team and am looking forward to your response!

Sincerely,
Alikhan Smet