The Agent Company: Benchmarking LLM Agents on Consequential Real World Tasks
Samuel Albanie
•
January 5, 2025
Samuel Albanie
View ChannelAbout
AI research. Note that opinions expressed are my own. However, for conflict of interest purposes, please note that I'm employed by Google DeepMind (GDM). This almost certainly biases my judgment to some degree. I think GDM is pretty great. Other related content: - misc/outdoor stuff channel: https://www.youtube.com/@samuelalbaniemisc - https://x.com/SamuelAlbanie - https://bsky.app/profile/samuelalbanie.bsky.social FAQ: Software used to make videos - keynote on mac (to make slides) - Adobe Premiere Pro for editing
Latest Posts
Empower Your AI Journey
AI-recommended products based on this video
Loading...

Firefly Variety 8 Pack - Fire Starter Accessory for Swiss Army Victorinox Knives (Neon Green-Yellow Glow)
(181)
$70.15$61.13
$5.05 delivery Thu, Jun 26Only 3 left in stock.
Loading...

9-in-1 5000A 150PSI Car Battery Booster Jump Starter with Air Compressor (All Gas/9L Diesel), Portable Car Battery Booster Pack, Safe Durable Car Jump Starter with Extended Jumper Cables, Glove, Light
(261)
$99.99
FREE delivery Sat, Sep 20
1K+ bought in past month
Loading...

Seasonic Focus V4 GX-1000 (ATX3) - 1000W - 80+ Gold - ATX 3.0 & PCIe 5.1 Ready -Full-Modular -ATX Form Factor -Premium Japanese Capacitor -10 Year Warranty -Nvidia RTX 30/40 Super & AMD GPU Compatible
(22)
$423.35
FREE delivery Oct 8 - 10



