Defense Against LLM and AGI Scheming with Guardrails and Architecture
Valley Research Park 319 North Bernardo Avenue, Mountain View, CA, USHybrid meeting: In-person, Zoom, and YouTube Greg Makowski, Chief of Data Science at Ccube TALK DESCRIPTION A January 2025 paper called “Frontier Models are Capable of In-Context Scheming”, https://arxiv.org/pdf/2412.04984, demonstrated how a wide variety of current frontier LLM models (i.e.... Read more