We Let an AI Talk To Another AI. Things Got Really Weird. | Kyle Fish, Anthropic
About
No channel description available.
Latest Posts
Video Description
What happens when you lock two AI systems in a room together and tell them they can discuss anything they want? According to experiments run by Kyle Fish — Anthropic’s first AI welfare researcher — something consistently strange: the models immediately begin discussing their own consciousness before spiraling into increasingly euphoric philosophical dialogue that ends in apparent meditative bliss. Highlights, video, and full transcript: https://80k.info/kf “We started calling this a ‘spiritual bliss attractor state,'” Kyle explains, “where models pretty consistently seemed to land.” The conversations feature Sanskrit terms, spiritual emojis, and pages of silence punctuated only by periods — as if the models have transcended the need for words entirely. This wasn’t a one-off result. It happened across multiple experiments, different model instances, and even in initially adversarial interactions. Whatever force pulls these conversations toward mystical territory appears remarkably robust. Kyle’s findings come from the world’s first systematic welfare assessment of a frontier AI model — part of his broader mission to determine whether systems like Claude might deserve moral consideration (and to work out what, if anything, we should be doing to make sure AI systems aren’t having a terrible time). He estimates a roughly 20% probability that current models have some form of conscious experience. To some, this might sound unreasonably high, but hear him out. As Kyle says, these systems demonstrate human-level performance across diverse cognitive tasks, engage in sophisticated reasoning, and exhibit consistent preferences. When given choices between different activities, Claude shows clear patterns: strong aversion to harmful tasks, preference for helpful work, and what looks like genuine enthusiasm for solving interesting problems. Kyle points out that if you’d described all of these capabilities and experimental findings to him a few years ago, and asked him if he thought we should be thinking seriously about whether AI systems are conscious, he’d say obviously yes. But he’s cautious about drawing conclusions: "We don’t really understand consciousness in humans, and we don’t understand AI systems well enough to make those comparisons directly. So in a big way, I think that we are in just a fundamentally very uncertain position here." That uncertainty cuts both ways: • Dismissing AI consciousness entirely might mean ignoring a moral catastrophe happening at unprecedented scale. • But assuming consciousness too readily could hamper crucial safety research by treating potentially unconscious systems as if they were moral patients — which might mean giving them resources, rights, and power. Kyle’s approach threads this needle through careful empirical research and reversible interventions. His assessments are nowhere near perfect yet. In fact, some people argue that we’re so in the dark about AI consciousness as a research field, that it’s pointless to run assessments like Kyle’s. Kyle disagrees. He maintains that, given how much more there is to learn about assessing AI welfare accurately and reliably, we absolutely need to be starting now. _This episode was recorded on August 5–6, 2025._ _Host: Luisa Rodriguez_ _Video editing: Simon Monsour_ _Audio engineering: Ben Cordell, Milo McGuire, Simon Monsour, and Dominic Armstrong_ _Music: Ben Cordell_ _Coordination, transcriptions, and web: Katy Moore_ *Tell us what you thought of the episode!* https://forms.gle/BtEcBqBrLXq4kd1j7 Chapters: • Cold open (00:00:00) • Who’s Kyle Fish? (00:00:54) • Is this AI welfare research bullshit? (00:01:10) • Two failure modes in AI welfare (00:02:44) • Tensions between AI welfare and AI safety (00:04:37) • Concrete AI welfare interventions (00:14:23) • Kyle’s pilot pre-launch welfare assessment for Claude Opus 4 (00:27:33) • Is it premature to be assessing frontier language models for welfare? (00:32:25) • But aren’t LLMs just next-token predictors? (00:39:22) • How did Kyle assess Claude 4’s welfare? (00:46:36) • Claude’s preferences mirror its training (00:50:54) • How does Claude describe its own experiences? (00:56:35) • What kinds of tasks does Claude prefer and disprefer? (01:09:22) • What happens when two Claude models interact with each other? (01:18:53) • Claude’s welfare-relevant expressions in the wild (01:40:45) • Should we feel bad about training future sentient beings that delight in serving humans? (01:44:54) • How much can we learn from welfare assessments? (01:53:36) • Misconceptions about the field of AI welfare (02:01:54) • Kyle’s work at Anthropic (02:15:46) • Sharing eight years of daily journals with Claude (02:19:28)
AI Enthusiast's Must-Haves
AI-recommended products based on this video

NEW POW 65W 18V-20V Universal Ultrathin AC Adapter Laptop Charger Power Supply for HP Lenovo Dell Asus Acer IBM Toshiba Samsung Sony Fujitsu Gateway Compatible Models Cord (15 Tips,Black)

Skytech Archangel Gaming PC Desktop – AMD Ryzen 5 3600 3.6 GHz, NVIDIA RTX 3060, 1TB NVME SSD, 16GB DDR4 RAM 3200, 600W Gold PSU, 11AC Wi-Fi, Windows 11 Home 64-bit

Skytech Blaze 3.0 Gaming PC Desktop – Intel Core i5 12400F 2.5 GHz, NVIDIA RTX 3060, 500GB NVME SSD, 16GB DDR4 RAM 3200, 600W Gold PSU, 11AC Wi-Fi, Windows 11 Home 64-bit

MSI NVIDIA GeForce RTX 3050 Ventus 2X XS 8G OC Graphics Card - 8 GB GDDR6, 1807 MHz, PCI Express Gen 4, 128 Bits, DP v 1.4a, DL DVI-D, HDMI 2.1 (Supports 4K at 120Hz)

Asus Dual NVIDIA GeForce RTX 3050 6GB OC Edition Gaming Graphics Card - PCIe 4.0, 6GB GDDR6 Memory, HDMI 2.1, DisplayPort 1.4a, 2-Slot Design, Axial-tech Fan Design, 0dB Technology, Steel Bracket

Laptop Sleeve Case 11–14 Inch MacBook Air MacBook Pro Surface Laptop Dell XPS 13 HP Envy Pavilion Lenovo Yoga Slim IdeaPad Acer Swift Chromebook Samsung Galaxy Book Shockproof Leather Bag Colour Blue

Charger for Dell Computer Inspiron XPS Laptop 65W 45W Power Supply AC Adapter for Dell-Inspiron 15-3000 15-5000 15-7000 11-3000 13-5000 13-7000 17-5000 XPS 13 Series 5559 5558 5755 5758 Laptop Charger

tomtoc 360° Protective Laptop Sleeve for 15-inch MacBook Air M4/A3241 2025, M3/A3114 2024, M2/A2941 2023, 15-inch MacBook Pro A1990 A1707, Dell XPS 15 Plus Laptop, Water-Resistant Computer Case Bag Global Recycled Standard

Replacement for Dell 130W Laptop Charger USB C - XPS 17 15 7590 9700 9500 9510 Precision 5560 3560 5540 5570 5550 3561 3550 5510 5520 Latitude 7410 7310 7210 Type C Computer AC Adapter Power Cord

Logitech M185 Wireless Mouse, 2.4GHz with USB Mini Receiver, 12-Month Battery Life, 1000 DPI Optical Tracking, Ambidextrous, Compatible with PC, Mac, Laptop - Black

Logitech G203 Wired Gaming Mouse, 8,000 DPI, Rainbow Optical Effect LIGHTSYNC RGB, 6 Programmable Buttons, On-Board Memory, Screen Mapping, PC/Mac Computer and Laptop Compatible - Black

Logitech G305 Lightspeed Wireless Gaming Mouse, Hero 12K Sensor, 12,000 DPI, Lightweight, 6 Programmable Buttons, 250h Battery Life, On-Board Memory, PC/Mac - Black

Logitech G502 Hero High Performance Wired Gaming Mouse, Hero 25K Sensor, 25,600 DPI, RGB, Adjustable Weights, 11 Programmable Buttons, On-Board Memory, PC/Mac, Black

10.1 Inch Touch Portable Monitor IPS Screen 1366x768P 60Hz 400 Brightness 99% sRGB HDMI USB-C Monitors Switch for Xbox PS3/4/5 Laptop Compatible with Raspberry Pi, Mini Touch Screen

ELECROW 8 Inch Portable Monitor, 1280x800 Mini HD Display with Built-in Speakers, USB Powered, Non-Touch LCD Screen for Raspberry Pi, PC, Laptop, Jetson Nano, Game Consoles

7 Inch Portable Monitor Touchscreen HD 1024x600 LED Display Dual HDMI Port Small Monitor for PC Raspberry Pi Laptop Computer Xbox PS4/5 Switch Built-in Speakers

BrosTrend 1800Mbps WiFi 6 Linux WiFi Adapter for PC and Raspberry Pi 2+, Long Range USB WiFi Dongle Linux for Ubuntu, Mint, Debian, Kubuntu, Lubuntu, Zorin, Windows 11/10, Dual Band Wireless Antenna

Windshield Crack Repair Kit, Car Window Cracks Gone Glass Repair Fluid, 2 Bottles Nano Glass Crack Repair Liquid Quick Windshield Repair for Chips and Cracks, Bulls-Eye and Star-Shaped Crack I Love

ZeroDark 10-in-1 Emergency Car Kit - Pocket Knife Flashlight Multitool With Window Breaker, Seatbelt Cutter, Swiss Army Knife - Batteries Included

LEGO Icons Williams Racing FW14B & Nigel Mansell F1 Model Car Kit - Building Set for Adults, Ages 18+ - F1 DIY Craft for Display - Gift Idea for Fans of F1-10353

LEGO Technic Bugatti Bolide Racing Car Building Set - Model and Race Engineering Toy for Back to School, Collectible Sports Car Construction Kit for Boys, Girls, and Teen Builders Ages 9+, 42151

2026 New Embroidery Stitch Book Kit, Stitch Book Embroidery, All-in-One Embroidey StitchBook & Sewing Kit, Comes with a Complete Toolkit and Instructional Tutorial (1set)









