I'm a ML Engineer and AI Safety Research Fellow from Sydney, Australia. I think AI is important but we're a long way off having it be safe.
goals
- Meaningfully contribute toward ensuring AI is good for humanity
- Run the 7 World Marathon Majors (currently 0/7)
- Release an EP
- Learn the piano
things I like
- Songs that feel like a transitional time
- DJing & Producing Music
- Running
things I dislike
- Giving AI autonomy haphazardly
Experiences
- Supervised Program for Alignment Research (SPAR) Research Fellow — building interpretability agents
- Ampelic AI ML Engineer — building a POC for seed round
- Technical Alignment Research Accelerator (TARA) Participant
- Oxford AI Safety Initiative (OAISI) ARBOx Participant
- nVest Strategy Intern
- Produced the Australian Retail Investment Technology Report 2023
Research / Projects
- arm-ai.org (AI Risk Monitor) — Safety & risk index for frontier AI models (work in progress!)
- SpotTheSlop.com — Guess between AI vs Human media
- Investigated the properties of Subliminal Learning
- Recreated and scaled stage-wise model-diffing with crosscoders
- Experimented with model diffing between base and Deepseek distilled CoT models
writing that's inspired me
- Life is Short by Paul Graham
- Research Without Permission by Priyanka Bharadwaj
- Want What You Have by Isabel Unraveled
- Escaping Flatland by Alex Ong