29.4% ARC-AGI-2 🤯 (TOP SCORE!) - Jeremy Berman
About
No channel description available.
Video Description
We need AI systems to synthesise new knowledge, not just compress the data they see. Jeremy Berman, is a research scientist at Reflection AI and recent winner of the ARC-AGI v2 public leaderboard. **SPONSOR MESSAGES** — Take the Prolific human data survey - https://www.prolific.com/humandatasurvey?utm_source=mlst and be the first to see the results and benchmark their practices against the wider community! — cyber•Fund https://cyber.fund/?utm_source=mlst is a founder-led investment firm accelerating the cybernetic economy Oct SF conference - https://dagihouse.com/?utm_source=mlst - Joscha Bach keynoting(!) + OAI, Anthropic, NVDA,++ Hiring a SF VC Principal: https://talent.cyber.fund/companies/cyber-fund-2/jobs/57674170-ai-investment-principal#content?utm_source=mlst Submit investment deck: https://cyber.fund/contact?utm_source=mlst — Imagine trying to teach an AI to think like a human i.e. solving puzzles that are easy for us but stump even the smartest models. Jeremy's evolutionary approach—evolving natural language descriptions instead of python code like his last version—landed him at the top with about 30% accuracy on the ARCv2. We discuss why current AIs are like "stochastic parrots" that memorize but struggle to truly reason or innovate as well as big ideas like building "knowledge trees" for real understanding, the limits of neural networks versus symbolic systems, and whether we can train models to synthesize new ideas without forgetting everything else. Jeremy Berman: https://x.com/jerber888 TRANSCRIPT: https://app.rescript.info/public/share/qvCioZeZJ4Q_NlR66m-hNUZnh-qWlUJcS15Wc2OGwD0 TOC: Introduction and Overview [00:00:00] ARC v1 Solution [00:07:20] Evolutionary Python Approach [00:08:00] Trade-offs in Depth vs. Breadth [00:10:33] ARC v2 Improvements [00:11:45] Natural Language Shift [00:12:35] Model Thinking Enhancements [00:13:05] Neural Networks vs. Symbolism Debate [00:14:24] Turing Completeness Discussion [00:15:24] Continual Learning Challenges [00:19:12] Reasoning and Intelligence [00:29:33] Knowledge Trees and Synthesis [00:50:15] Creativity and Invention [00:56:41] Future Directions and Closing [01:02:30] REFS: Jeremy’s 2024 article on winning ARCAGI1-pub https://jeremyberman.substack.com/p/how-i-got-a-record-536-on-arc-agi Getting 50% (SoTA) on ARC-AGI with GPT-4o [Greenblatt] https://blog.redwoodresearch.org/p/getting-50-sota-on-arc-agi-with-gpt https://www.youtube.com/watch?v=z9j3wB1RRGA [his MLST interview] A Thousand Brains: A New Theory of Intelligence [Hawkins] https://www.amazon.com/Thousand-Brains-New-Theory-Intelligence/dp/1541675819 https://www.youtube.com/watch?v=6VQILbDqaI4 [MLST interview] Francois Chollet + Mike Knoop’s lab https://ndea.com/ On the Measure of Intelligence [Chollet] https://arxiv.org/abs/1911.01547 On the Biology of a Large Language Model [Anthropic] https://transformer-circuits.pub/2025/attribution-graphs/biology.html The ARChitects [won 2024 ARC-AGI-1-private] https://www.youtube.com/watch?v=mTX_sAq--zY Connectionism critique 1998 [Fodor/Pylshyn] https://uh.edu/~garson/F&P1.PDF Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis [Kumar/Stanley] https://arxiv.org/pdf/2505.11581 AlphaEvolve interview (also program synthesis) https://www.youtube.com/watch?v=vC9nAosXrJw ShinkaEvolve: Evolving New Algorithms with LLMs, Orders of Magnitude More Efficiently [Lange et al] https://sakana.ai/shinka-evolve/ Deep learning with Python Rev 3 [Chollet] - READ CHAPTER 19 NOW! https://deeplearningwithpython.io/
Boost Your ARC-AGI-2 Skills Today
AI-recommended products based on this video

Invincible Fitness Agility Ladder Full Training Equipment Set, Improves Coordination, Speed, Power and Strength, Includes 10 Cones 4 Hooks and 3 Loop Resistance Bands for Outdoor Workout

Waterproof Crib Mattress Protector | Toddler Bed Sheets | Washable Bassinet Pee Liner Mat for Potty Training Bedwetting Solution Nursery Setup Overnight Protection

Crovakeu Professional Lock Pick Set - 1-Piece Compact Locksmith Tools Kit with Practice Locks for Beginners & Pros - Portable Lockpicking Tools for Fidget, Stress Relief & Skill Training

Crovakeu Professional Lock Pick Set - 1-Piece Compact Locksmith Tools Kit with Practice Locks for Beginners & Pros - Portable Lockpicking Tools for Fidget, Stress Relief & Skill Training

FEREDO KIDS Party Favors for Kids: 16 Pack Rainbow Scratch Art Notebook Students Classroom Goodie Bag Items Bulk for Girls Boys Loot Bag Fillers, Return Gifts for Birthday Party Gifts Kids Crafts

4 Pack LCD Writing Tablet for Kids, 8.5 Inch Colorful Doodle Board Drawing Tablet, Educational Learning Toys Birthday Gifts for Boys Girls Age 3 4 5 6 7 8

Art Kit, 272 Pack Art Set Drawing Kit for Kids Girls Boys, Deluxe Gift Art Supplies with Trifold Easel, Origami Paper, Coloring Pad, Sketch Pad, Pastels, Crayons, Pencils, Watercolors (Pink)

COOLOO Kids Swimming Goggles 2 Pack Goggles Kids Swim Anti-Fog UV Protection Wide View Waterproof Swim Goggles for Kids 3-15

![Abstraction & Idealization: AI's Plato Problem [Mazviita Chirimuuta]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/yq318DIwPqw/hqdefault.jpg)
![Why Every Brain Metaphor in History Has Been Wrong [SPECIAL EDITION]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/pO0WZsN8Oiw/hqdefault.jpg)
![AutoGrad Changed Everything (Not Transformers) [Dr. Jeff Beck]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/9suqiofCiwM/hqdefault.jpg)
![Why Scientists Can't Rebuild a Polaroid Camera [César Hidalgo]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/vzpFOJRteeI/hqdefault.jpg)

![Why High Benchmark Scores Don’t Mean Better AI [SPONSORED]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/rqiC9a2z8Io/hqdefault.jpg)
![The Mathematical Foundations of Intelligence [Professor Yi Ma]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/QWidx8cYVRs/hqdefault.jpg)

![Tensor Logic "Unifies" AI Paradigms [Pedro Domingos]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/4APMGvicmxY/hqdefault.jpg)

![He Co-Invented the Transformer. Now: Continuous Thought Machines [Llion Jones / Luke Darlow]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/DtePicx_kFY/hqdefault.jpg)


![We Built Calculators Because We're STUPID! [Prof. David Krakauer]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/dY46YsGWMIc/hqdefault.jpg)
![Why Humans Are Still Powering AI [Sponsored] - Phelim Bradley](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/R11ESdfVX64/hqdefault.jpg)
![The Universal Hierarchy of Life - Prof. Chris Kempes [SFI]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/iwClZ-7OweY/hqdefault.jpg)

![Google Researcher Shows Life "Emerges From Code" [Blaise Agüera y Arcas]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/rMSEqJ_4EBk/hqdefault.jpg)
![AI training data will never be fully synthetic [SPONSORED]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/cnxZZTl1tkk/hqdefault.jpg)
![AI Agents can write 10,000 lines of hacking code in seconds [Dr. Ilia Shumailov]](https://imgz.pc97.com/?width=500&fit=cover&image=https://i.ytimg.com/vi/aoX_pGQMbEM/hqdefault.jpg)