Mark Gravestock
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
Initializing search
    • Home
    • TIL
    • Blog
    • Bookmarks
    • Tags
    • Home
    • TIL
      • Networking
    • Blog
    • Bookmarks
    • Tags
    ai llm

    Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

    Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

    transformer-circuits.pub ยท 17 Nov 2025

    Made with Material for MkDocs