The Dark Matter of AI [Mechanistic Interpretability]
Welch Labs
@welchlabsvideoAbout
New Book! The Welch Labs Illustrated Guide to AI is now available for pre-order: https://www.welchlabs.com/resources/ai-book
Latest Posts
Video Description
Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: http://incogni.com/welchlabs Welch Labs Imaginary Numbers Book! https://www.welchlabs.com/resources/i... Welch Labs Posters:https://www.welchlabs.com/resources Special Thanks to Patrons / welchlabs Juan Benet, Ross Hanson, Yan Babitski, AJ Englehardt, Alvin Khaled, Eduardo Barraza, Hitoshi Yamauchi, Jaewon Jung, Mrgoodlight, Shinichi Hayashi, Sid Sarasvati, Dominic Beaumont, Shannon Prater, Ubiquity Ventures, Matias Forti, Brian Henry, Tim Palade, Petar Vecutin, Nicolas baumann, Jason Singh, Robert Riley, vornska, Barry Silverman My Gemma walkthrough notebook: https://colab.research.google.com/dri... Most animations made with Manim: https://github.com/3b1b/manim References and Further Reading Chris Olah’s original “Dark Matter of Neural Networks” post: https://transformer-circuits.pub/2024... Great recent interview with Chris Olah: • Dario Amodei: Anthropic CEO on Claude, AGI... Gemma Scope: https://arxiv.org/pdf/2408.05147 Experiment with SAEs yourself here! https://www.neuronpedia.org/ Relevant work from the Anthropic team: https://transformer-circuits.pub/2022... https://transformer-circuits.pub/2023... https://transformer-circuits.pub/2024... Excellent intro Mechanistic Interpretability: https://arena3-chapter1-transformer-i... Neel Nanda’s Mechanistic Interpretability Explainer: https://dynalist.io/d/n2ZWtnoYHrU1s4v... Transformer Lens: https://github.com/TransformerLensOrg... SAE Lens: https://jbloomaus.github.io/SAELens/ Technical Notes 1. There are more advanced and more meaningful ways to map mid layer vectors to outputs, see: https://arxiv.org/pdf/2303.08112, https://neuralblog.github.io/logit-pr..., https://www.lesswrong.com/posts/AcKRB... 2. The 6x2304 matrix is actually 7x2304, we’re ignoring the /bos token. 3. Gemma also includes positional embeddings and lots and lots of normalization layers, which we didn’t really cover 4. I’m conflating tokens and words sometimes, in this example each word is a token, so we don’t have to worry about it too much 5. The “_” characters represent spaces in the token strings CFAQJOTYQHT7JYIT
You May Also Like
Essential AI Tools Now
AI-recommended products based on this video

Google Pixel Buds Pro 2 - Noise Canceling Earbuds - Up to 31 Hour Battery Life with Charging Case - Bluetooth Headphones - Compatible with Android - Hazel

Deeyaple USB C to Aux, 4FT/1.2M, Type C to 3.5mm Audio Cable Headphone Jack Cable for Car Mobile Phone, iPhone 16 15, iPad Pro, Samsung Galaxy S24 S23 S2010, Google Pixel,Oneplus Grey (1)

Car Carplay Woven Cable for iPhone 16 15 3.3FT USB A to USB C 3.2 Gen 2 Carplay Adapter Wire for iPhone 16 15 Pro Max, iPad Pro/Air, Samsung Galaxy S25/S24/S23/S22/S21 Google Pixel, Car Charger Cable

10.1 Inch Touch Portable Monitor IPS Screen 1366x768P 60Hz 400 Brightness 99% sRGB HDMI USB-C Monitors Switch for Xbox PS3/4/5 Laptop Compatible with Raspberry Pi, Mini Touch Screen

ELECROW 8 Inch Portable Monitor, 1280x800 Mini HD Display with Built-in Speakers, USB Powered, Non-Touch LCD Screen for Raspberry Pi, PC, Laptop, Jetson Nano, Game Consoles

7 Inch Portable Monitor Touchscreen HD 1024x600 LED Display Dual HDMI Port Small Monitor for PC Raspberry Pi Laptop Computer Xbox PS4/5 Switch Built-in Speakers

BrosTrend 1800Mbps WiFi 6 Linux WiFi Adapter for PC and Raspberry Pi 2+, Long Range USB WiFi Dongle Linux for Ubuntu, Mint, Debian, Kubuntu, Lubuntu, Zorin, Windows 11/10, Dual Band Wireless Antenna

Thdeukoty Industrial Mini PC N100, Preinstalled Windows 11 Pro, Intel Alder Lake N100 (3.4GHz), 8G DDR4 RAM 128G Pcie M.2 SSD, WIFI6/BT5.2/USB3.0/RJ45/COM Ports/VESA, Fanless Computer

BOSGAME Mini PC Intel Core i9 12900H(14C/20T, up 5.0GHz), 32GB DDR5 RAM 1TB PCIe SSD, Small Desktop Computer Dual 2.5GbE/4x USB3.2/HDMI/DP/Thunderbolt 4

Mini PC, Intel Twin Lake N150 (Beat N100/N95, up to 3.6GHz), 16GB RAM 512GB SSD Dual LAN, Mini Desktop Computers Windows 11, 4K Triple Display, Type-C/WiFi 6/BT5.2/Micro PC for Home Office Business

Amazfit Bip 6 Smart Watch 46mm, 14 Day Battery, 1.97" AMOLED Display, GPS & Free Maps, AI, Bluetooth Call & Text, Health, Fitness & Sleep Tracker, 140+ Workout Modes, 5 ATM Water-Resistance, Black

GMKtec EVO-X2 AI Mini PC Ryzen Al Max+ 395 (up to 5.1GHz) Mini Gaming Computers, 96GB LPDDR5X 8000MHz (12GB*8) 1TB PCIe 4.0 SSD, Quad Screen 8K Display, WiFi 7 & USB4, SD Card Reader 4.0 Global Recycled Standard

GMKtec EVO-X2 AI Mini PC, Ryzen Al Max+ 395 (up to 5.1GHz) Mini Gaming Computers, 64GB LPDDR5X 8000MHz (8GB*8) 1TB PCIe 4.0 SSD, Quad Screen 8K Display, WiFi 7 & USB4, SD Card Reader 4.0 Global Recycled Standard

Amazfit Bip 6 Smart Watch 46mm, 14 Day Battery, 1.97" AMOLED Display, GPS & Free Maps, AI, Bluetooth Call & Text, Health, Fitness & Sleep Tracker, 140+ Workout Modes, 5 ATM Water-Resistance, Black

GMKtec EVO-X2 AI Mini PC Ryzen Al Max+ 395 (up to 5.1GHz) Mini Gaming Computers, 96GB LPDDR5X 8000MHz (12GB*8) 1TB PCIe 4.0 SSD, Quad Screen 8K Display, WiFi 7 & USB4, SD Card Reader 4.0 Global Recycled Standard


![The Misconception that Almost Stopped AI [How Models Learn Part 1]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/NrO20Jb-hy0/hqdefault.jpg)
![How DeepSeek Rewrote the Transformer [MLA]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/0VLAoVGf_74/hqdefault.jpg)








![The most beautiful equation in math, explained visually [Euler’s Formula]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/f8CXG7dS-D0/hqdefault.jpg)
![The moment we stopped understanding AI [AlexNet]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/UZDiGooFs54/hqdefault.jpg)





