I build ML and data engineering systems for drug discovery, currently focused on oncology. At SoulBio, I work closely with biotech teams to design and deploy end-to-end infrastructure—data pipelines, models, cloud systems, and internal tools—that help them move faster from raw data to usable biological insight.
I began in NLP research, but moving into biological data changed how I think about ML. In drug discovery, the data is messy, context matters, and models don’t live in isolation. A lot of the real work is in the pipelines, the assumptions, and the handoff between computation and biology. That’s where I tend to focus.
Alongside client work, I also build internal tools at SoulBio, including AI agents and bioinformatics workflows, especially around RNA-seq, to reduce repetitive analysis and help scientists move faster. I enjoy working on problems where good engineering choices make complex work feel simpler and more reliable.
My background is in applied machine learning and data engineering rather than deep biology, and I like building technology that simplifies complex workflows through automation and better tooling.