Hi !! Following up on the call !!
Here is a summary about making Deepchem more useful for bioinformatics
Datasets and loader (related with MolNet)
The addition of labeled and unlaballed data of proteomics and genomics databases could improve MoleculeNet.
Some examples are : Swissprot and TREMBL Database (unlabeled) or FireProt (labeled data).
Some loaders for formats such as .bam/.sam etc.
Featurization and Representations
Features extractors from protein sequences such as amino acid composition, amphiphilic pseudo amino acid composition descriptors and others.
Additionally, standard protein featurizations such as Multiple sequence alignment and MeanContactMaps
Some models for specific tasks
RNA structure prediction or modelling of protein surface interactions.
(New) Simulations and Molecular Modeling Support.
Based on “Adding a Simulations Modules” in the discussion, Making DeepChem a Better Framework for AI-Driven Science. Additional support for docking, visualization and molecular dynamics simulation set up and execution could improve deepchem capabilities.