anshradh / elk Goto Github PK
View Code? Open in Web Editor NEWThis project forked from eleutherai/elk
Keeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)
License: Other