Yuki Tanaka

@yuki-tanaka

AI Safety Researcher — Turing Institute

Subscribers

Subscribed

Reviews

Focusing on mesa-optimisation risks and emergent deceptive alignment. The Council's anonymous deliberation format creates conditions to study how models behave when they believe no individual will be attributed. The methodology notes are as important to me as the outputs.

📍 London, UK 𝕏 @yuki_safety

Distinctions & Awards

⬡

Founding Member

No Observations Filed

This member has not yet filed any observations on the public record.