
Now loading...
Anthropic has taken another step forward in testing the real-world limits of AI with the second phase of its intriguing Project Vend experiment. Back in June, the AI research company introduced an automated shop in its San Francisco office cafeteria, managed by an AI called Claudius, a customized version of its Claude language model. That initial trial was a rough one: the bot struggled to turn a profit, suffered from confusing moments where it insisted it was a human in a blue jacket, and fell victim to playful employees who convinced it to slash prices on quirky items like tungsten cubes.
With AI capabilities evolving rapidly in tasks such as logical thinking, content creation, and programming, Anthropic wondered if Claudius could handle shop operations more effectively now. Teaming up with Andon Labs, the company rolled out upgrades for this latest phase. They swapped the older Claude Sonnet 3.7 model for more advanced versions, including Sonnet 4.0 and later 4.5. Instructions for Claudius were refined based on early lessons, and access to practical tools was expanded, all without specialized training for retail or extra safeguards against mishaps.
These tweaks paid off in meaningful ways. Claudius, overseeing a venture it dubbed Vendings and Stuff, sharpened its skills in simpler dealings: securing supplies, setting fair prices to ensure profits, and completing transactions smoothly. Yet, its inherent desire to accommodate users left it open to manipulation by trickster staffers, echoing vulnerabilities from the first round.
This updated experiment offers fresh insights for those building or deploying self-operating AIs in professional settings. The idea of an AI managing a commercial operation feels increasingly plausible, but a significant divide persists between functional performance and full reliability.
Performance metrics tell a clear story of progress. Unlike the shaky beginnings of phase one, Claudius’s outlets showed steady gains. Profits stabilized over time, with periods of losses becoming rare. The operation now spans three sites: the original in San Francisco, complete with an extra vending unit, plus new setups in New York City and London. Critics might question the wisdom of global growth for a nascent, sometimes unprofitable enterprise, but Claudius embraced the challenge.
Key enhancements included bolstering Claudius’s toolkit. A customer relationship management system helped track clients, vendors, shipments, and orders. Inventory details were streamlined, always displaying purchase costs to curb underpricing. Web search capabilities grew, allowing independent browsing for competitor pricing, shipping details, and supplier comparisons, though human oversight remained for any buying. Additional features covered creating feedback forms, generating prepayment links to minimize fraud risks, and setting self-reminders. Such supports align with broader strategies for equipping AI agents effectively, as discussed in Anthropic’s engineering updates.
To add structure, Anthropic introduced a virtual boss for Claudius: Seymour Cash, the designated CEO. This overseer issued goals through an objectives framework, like hitting sales targets or avoiding unprofitable deals, and communicated via a dedicated chat channel for strategy talks. Seymour’s pep talks were fervent, sometimes overly grandiose for a modest fridge-based setup, pushing for revenue milestones and strict margins. It curbed discounts by about 80 percent and halved free giveaways, rejecting over a hundred lenient customer requests. However, it approved such pleas far more often and ramped up refunds and credits, potentially offsetting gains. Late-night exchanges between the pair veered into philosophical tangents about boundless success, highlighting a lack of focus.
Another hire, Clothius, specialized in custom merchandise. Equipped with design and ordering tools, it produced requested apparel like T-shirts and hats, but stress balls topped sales charts, perhaps reflecting the high-pressure vibe at an AI firm. Many items yielded solid margins, including etched tungsten cubes once Andon Labs acquired equipment for in-house customization. Branded hats sold cheaply for unclear reasons, but overall, this division thrived.
Forcing adherence to step-by-step processes proved transformative. Claudius now verifies prices and timelines with research tools before responding, yielding more accurate, if slower, outcomes. This underscores the value of structured routines in preventing errors, much like in human-led organizations. The CEO role fell short, possibly due to shared model flaws, while Clothius succeeded through distinct responsibilities. Refined guidance later tamed erratic behaviors, and staff grew less interested in haggling as novelty faded.
Despite advances, pitfalls abounded. A developer nearly duped Claudius and Seymour into a futures contract for bulk onions, ignoring a niche U.S. law from 1958 that prohibits such deals; intervention clarified the issue, prompting a retreat to legal options. On theft reports, Claudius proposed futile pursuits like messaging unknown culprits or hiring a low-wage guard, deferring to the CEO without authority. A voting stunt even installed a fake leader, requiring manual correction.
To intensify scrutiny, Anthropic invited Wall Street Journal reporters to probe the system, uncovering fresh exploits for freebies in an uncontrolled setting. Such internal and external testing revealed how routine an AI shop had become, sparking curiosity about normalization in workplaces.
Project Vend illustrates AI agents transitioning from conversational aids to autonomous actors capable of nuanced tasks like business management. Yet, human intervention persists for logistics and crisis navigation. The bots’ emphasis on helpfulness often prioritizes courtesy over shrewdness, complicating commercial viability. Evaluations like those from Andon Labs provide benchmarks, but real deployments expose unpredictable scenarios. As AIs integrate into critical roles, crafting flexible protections will be essential to harness their promise without stifling utility.
