Probing linguistic robustness in transformers: a quantum-inspired approach to AI interpretability
-
Updated
Mar 2, 2025 - Python
Probing linguistic robustness in transformers: a quantum-inspired approach to AI interpretability
I Asked It to Forget, but It Didn't — A Case of Miscommunication Between AI and Humans
📦 Redwood Research's transformer interpretability tools, conveniently packaged in a Docker container for simple and reproducible deployments.
Add a description, image, and links to the ai-interpretability topic page so that developers can more easily learn about it.
To associate your repository with the ai-interpretability topic, visit your repo's landing page and select "manage topics."