Latent Space: The AI Engineer Podcast cover image

The First Mechanistic Interpretability Frontier Lab — Myra Deng & Mark Bissell of Goodfire AI

Latent Space: The AI Engineer Podcast

00:00

Equivalence of activation steering and in‑context learning

Mark describes research mapping steering strength to prompting and predicting required examples for jailbreaking.

Play episode from 31:07
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app