Multimodal Representation Learning for Proteins
- Developing multi-modal (ie. sequence, structure) representation learning pipeline for protein fitness (ie. binding, catalysis) prediction
- Building easy-to-use protein single- to multi-mutant fitness prediction by incorporating mutation sequence context (coevolution, stability, and biochemical rules) on 11 diverse datasets