First Ascent by
Nathan Tran
The Molecular Glue Problem
Medicinal Chemistry / Targeted Protein Degradation
The Proposed Route
A route through the emerging landscape of molecular glue degraders (MGDs) โ small molecules that hijack the cell's protein disposal machinery. The climber navigates ~100 compounds from their own research, comparing Ridge regression on Morgan fingerprints to Chemprop (a message-passing neural network), predicting degradation efficiency (pDC50, Dmax) against both the target protein (ZBTB11) and an off-target (IKZF1). The route asks: can ML help design selective degraders?
๐ง The Crux
Nโ100 is small for Chemprop โ the MPNN may not have enough data to outperform simpler fingerprint methods. Noisy or incomplete pDC50 values could poison the model. The dual-target setup (ZBTB11 vs IKZF1) is the scientific hook but doubles the modeling complexity.
โ ๏ธ Pre-Climb Checklist
โ Dataset is from own research โ real, relevant, and available. โ Chemprop v2 with Optuna hyperparameter tuning is well-specified. โ ๏ธ With ~100 compounds, expect Chemprop to struggle โ that's a valid finding. โ ๏ธ AlphaFold3 stretch goal (ternary complex modeling) is ambitious โ complete core pipeline first.
Guidance
- The selectivity question (ZBTB11 vs IKZF1) is the real story โ lead with it
- If Ridge beats Chemprop, that's interesting and publishable for small-data regimes
- AF3 interface scores as features is a clever idea โ but validate on a few examples first
Source proposal: Tran_Nathan_finalProject_proposal.docx
CHEM 169/269 ยท Applied AI & Machine Learning for Biochemistry