Saurav Panigrahi
I’m a research engineer currently working on LLM post-training and alignment at Zoho Labs.
I am also actively involved in AI safety research. I have worked on self-preservation behaviors in LLMs with Robert McCarthy at UCL, and on value alignment with Jonathan Chang and Prof. Lionel Levine at Cornell. I also contribute to training and evaluation of LLMs in high-stakes domains (medical reasoning) at MEDARC
Updates
Wrapped SPAR Fall '25 Fellowship. Technical report
SPAR was great, doing it again with Jonathan and Lionel.
Medmarks public release, most comprehensive medical benchmark for LLMs yet. Twitter
Our SPAR work got accepted at ICML '26 Pluralistic Alignment.
Wrapped up SPAR Spring '26 Fellowship. Technical report
Medmarks got accepted into ICML '26 FM4LS.
Papers
Side Effects of Character Training: Quantifying Cross-Constitution Drift in LLMs
ICML '26 Pluralistic Alignment