{"href":"https://api.simplecast.com/oembed?url=https%3A%2F%2Fai-a16z.simplecast.com%2Fepisodes%2Fbenchmarking-ai-agents-on-full-stack-coding-QGpiWJ91","width":444,"version":"1.0","type":"rich","title":"Benchmarking AI Agents on Full-Stack Coding","thumbnail_width":300,"thumbnail_url":"https://image.simplecastcdn.com/images/c816dcf3-bf6b-4383-bfd0-ce36db7a9d83/41f119e9-df47-4c08-b4bb-cf83b7c20576/ai-20-20a16z-20pod-20-20benchmarking-20ai-20agents-20on-20full-stack-20coding-201-1.jpg","thumbnail_height":300,"provider_url":"https://simplecast.com","provider_name":"Simplecast","html":"<iframe src=\"https://player.simplecast.com/8d45a574-ae4a-494e-846f-18d238b2850f\" height=\"200\" width=\"100%\" title=\"Benchmarking AI Agents on Full-Stack Coding\" frameborder=\"0\" scrolling=\"no\"></iframe>","height":200,"description":"In this episode, a16z General Partner Martin Casado sits down with Sujay Jayakar, co-founder and Chief Scientist at Convex, to talk about his team’s latest work benchmarking AI agents on full-stack coding tasks."}