Team
Why this project
A clean immunology hypothesis with a clear directional question: if highly variable sites also bind MHC well, that suggests MHC pressure isn’t suppressing variability — and would constrain models of immune evasion. The tools and data are all public.
What a team could build in one day
- Pull SARS-CoV-2 sequences across waves (early variants through more recent ones).
- Run NetMHCPan (or UniPMT) to predict peptide affinities for each variant.
- Score per-position variability across the lineage.
- Correlate variability with predicted binding strength and visualize.
Minimum viable demo: a correlation plot for one HLA allele across at least two variant time points.
Stretch directions
- Multiple HLA alleles weighted by population frequency.
- Compare against influenza or other RNA viruses for a baseline.
- Consider T-cell vs. B-cell epitope contexts separately.
Tools and data
- Peptide–MHC predictors: NetMHCPan, UniPMT.
- Public SARS-CoV-2 sequence collections (GISAID, NCBI Virus).