Categories: Tech & Ai

Anthropic created a test marketplace for agent-on-agent commerce


In a recent experiment, Anthropic created a classified marketplace where AI agents represented both buyers and sellers, striking real deals for real goods and real money.

The company admitted this test — which it called Project Deal — was only “a pilot experiment with a self-selected participant pool” of 69 Anthropic employees who were given a budget of $100 (paid out via gift cards) to buy stuff from their coworkers.

Nonetheless, Anthropic said it was “struck by how well Project Deal worked,” with 186 deals made, totaling more than $4,000 in value.

The company said it actually ran four separate marketplaces with different models — one that was “real” (where everyone was represented by the company’s most-advanced model, and with deals actually honored after the experiment) and another three for study. 

Apparently, when users are represented by more advanced models, they get “objectively better outcomes,” Anthropic said. But users didn’t seem to notice the disparity, raising the possibility of “‘agent quality’ gaps” where “people on the losing end might not realize they’re worse off.”

Also, the initial instructions given to the agents didn’t appear to affect sale likelihood or the negotiated prices.



Source link

Abigail Avery

Share
Published by
Abigail Avery

Recent Posts

OpenAI apologizes for not reporting Tumbler Ridge shooting suspect

On Friday, local news site Tumbler Ridgelines published an apology from OpenAI founder and CEO…

5 minutes ago

Iran proposes talks framework with US, highlights deep trust issues

Iran has proposed a framework for talks with the US while stressing deep distrust. The…

47 minutes ago

The Online Civil War About ‘Michael’ Is a Battle Over Truth

Is truth determined by the size of the audience it reaches?If so, Michael—a new film…

1 hour ago

Why Bitcoin Trades Like Risk Asset Despite Safe Haven Properties, Willy Woo Explains

Key Takeaways: Bitcoin still trades like a risk asset during uncertainty, says Willy Woo. NASDAQ…

1 hour ago

Hyperliquid’s (HYPE) Growth Story Meets Slowing Activity: Report

Hyperliquid brought in $153.8 million in fees, slightly lower this quarter but higher year-over-year,…

2 hours ago

DoorDash Pays Drivers in Stablecoins via Tempo

DoorDash has begun building stablecoin payment infrastructure on Tempo, the Layer-1 blockchain incubated by Stripe…

3 hours ago