Siglec-6 small-molecule binding

Team

Why this project

Siglec-6 (CD327, OB-BP1) is a poorly understood Siglec family member — expression is restricted to memory B cells, mast cells, and trophoblasts, and it’s upregulated in chronic lymphocytic leukaemia, acute myeloid leukaemia, and preeclampsia. The Huang lab has preliminary binding data on a small molecule that interacts with Siglec-6 but at a weak affinity (K_d ≈ 75 µM). The hackathon goal is to use computation to suggest analogues with markedly improved binding, so the wet-lab can prioritise synthesis and validation.

What a team could build

Set up an AutoDock Vina docking workflow against the AlphaFold model of Siglec-6.
Generate or curate a SMILES library of analogues around the current ~75 µM lead.
Score and rank candidates, then surface a short list of promising poses with rationale.

Minimum viable demo: a ranked list of analogue candidates with docking scores, pose visualisations, and a one-pager arguing why the top hits are worth synthesising.

Stretch directions

Pull a protein-language-model embedding for Siglec-6 (e.g. via OpenProteinSet on AWS) and use it as an additional feature for filtering candidates.
Generative analogue design (e.g. via fragment growing) rather than a static SMILES library.
Compare Vina against a free-energy or ML-based rescoring step.

Data

AlphaFold PDB of Siglec-6 (~50 kDa, unglycosylated).
SMILES strings of small-molecule candidates (~300–600 amu).

Pull the AlphaFold structure / UniProt annotations from UniProt on AWS, and the OpenProteinSet AWS mirror if teams want PLM embeddings as a representation-learning baseline.

Wet-lab follow-up

Promising hits are synthesised by the lab and validated:

SPR for K_d measurement.
CETSA for protein perturbation in live cells.

Skills you’ll build

Protein–ligand docking with AutoDock Vina; virtual screening from SMILES libraries; structure-based rational design of small-molecule analogues; reading docking poses against experimental binding-affinity data.