360 billion tokens, 3 million customers, 6 engineers
Impact at a glanceDurable ships new production agents to customers in a single dayAI features and agents serve ~1.
In early January, we gave the entire company a challenge: figure out how to multiply your output.People created agents.
Impact at a glanceDurable ships new production agents to customers in a single dayAI features and agents serve ~1.
Leonardo.AI processes more than 4.5 million images every day across cities worldwide, and Relevance AI's agents run autonomously across time zones, touching Salesforce, HubSpot, Slack, and dozens of other systems without pause. Neither company has a dedicated DevOps team.That's not an oversight.
Most knowledge agents start the same way. You pick a vector database, then build a chunking pipeline.
Impact at a glance Started with Next.js on Vercel, which made it easier to expand to a React Native iOS app without rebuilding their backendEngineers focus on AI design and iteration instead of platform plumbingOrchestrates OpenAI, Claude, and Gemini by task to optimize cost vs outputScaled from an internal pilot to 800–900+ real estate agents without replatformingWhen Jeremy Bunting joined SERHANT. as VP of Engineering in February 2024, S.MPLE was already showing promise. 200 real estate agents were piloting the AI product, which was designed to save time by automating cumbersome and repetitive daily tasks, like market analysis and contact management.S.MPLE was a Next.js progressive web app deployed on Vercel, and that foundation gave the team leverage. They could keep the API layer steady while expanding the client experience, including expansion to a React Native iOS app, all without a backend rebuild.But Bunting had a problem that keeps many engineering leaders up at night: the AI landscape changes faster than most teams can implement infrastructure updates. The team needed to move fast, scale confidently, and stay flexible enough to swap models, add new capabilities, and adapt to the rapidly changing AI landscape. Traditional approaches meant choosing between velocity and flexibility, but Bunting wanted both.AI SDK: Moving fast without vendor lock-inAs S.MPLE shifted from "one model” experiments to a production AI product, Bunting's team started evaluating Vercel's AI SDK, and he initially had concerns. "I asked, how much is this going to tie us in directly to Vercel?" he recalls.Then one of his engineers pushed back. The AI SDK wasn't infrastructure lock-in, it was infrastructure independence. “It's just an SDK that abstracts away the complexity of working with different model providers”, the engineer pointed out.Bunting also realized that if the team picked one frontier model and built tightly around it, every future change would come with a rewrite, with no clean path to fallback when reliability or cost shifted. With AI SDK, iteration meant simple configuration changes, not feature overhauls. "We are building agentic tools," said Bunting. "Having that consistent abstraction layer... really reduces the cognitive load."AI Gateway added another layer of leverage: consolidated visibility into usage across apps and prototypes, even when teams bring their own keys. The result is faster debugging, faster optimization, and a clearer feedback loop on cost.Using multiple models to balance cost, speed, and complexityBecause the SERHANT. S.MPLE team was not spending its time rebuilding infrastructure or maintaining one-off AI integrations, they could shift their attention to testing models against real product tasks and choosing the right tool for each job:Claude Sonnet for complex, accuracy-critical analysis like comparative market analysis, where strong structured-data reasoning mattersClaude Haiku for lightweight intent and field-filling tasks where speed mattersOpenAI models for conversational voice and general chat behaviorsGemini for image generation, browser automation, and computer-use workflows where reliability and speed are the priorityThey are also experimenting with “models as guardrails” to validate or critique outputs, and with caching strategies to rein in token spend as usage grows.Worry-free scale: Adding users and assetsThe value of their stack decisions became clear when S.MPLE launched publicly. "We moved from being an internal pilot program to more than 900 users without a lot of worry on infrastructure or scale," Bunting says. The API layer didn't require a single change, and Fluid compute handled the increase in workloads automatically. That seamless scale matters because SERHANT. operates at a content generation pace that would break most systems. "SERHANT. generates about 35% more content than the top five brokerages combined," Bunting notes. Between property videos, listing descriptions, marketing materials, and now AI-generated assets, the volume is staggering.Greg Parsons, Technical Director on the S.MPLE team, said that AI Gateway gives him visibility across their platform that wasn’t possible before. "We can gain insight into all of the disparate applications we are building across the business," Parsons explained. What’s next: from linear workflows to conversational AI agentsS.MPLE began with linear workflows: real estate agents would trigger a single action, it would run end-to-end then return the result.But real-world workflows are more complex and users want to execute multiple tasks in a single run, like producing all of the assets needed to market a property listing. Bunting’s team is now building toward conversational experiences where humans can run agents, steer and correct mid-flight, and combine multiple “recipes” into a single request.It is a shift from one-off automations to a coordinated network of specialized agents, built to evolve as fast as the AI ecosystem itself.Future-proofing an unpredictable landscapeGreg Chan, SERHANT.’s CTO, sees flexibility as the point. "In AI, things are evolving fast. What it looks like now is different than even three-to-six months ago. And it’ll be different months from now," Chan says.For Chan, the win is that the team can keep building inside the ecosystem instead of rewriting the stack every time the world changes. "The last thing we want is to rebuild our stack every time a new model drops," he said. About SERHANT.SERHANT. is an AI-native real estate and media company, and is the most-followed real estate brand in the world. Founded in 2020 by top real estate broker and entrepreneur Ryan Serhant, SERHANT. brings together brokerage, media, and education, with proprietary technology to revolutionize how properties are marketed, sold, and experienced. SERHANT. sells residential, commercial, luxury, and new development properties nationally through its specialized divisions, including SERHANT. Signature for high-net-worth clients and SERHANT. New Development, which delivers end-to-end branding, marketing, and sales for ground-up residential projects. Powered by S.MPLE, SERHANT.’s proprietary AI platform, agents are empowered with real-time data and workflow automation across listings, transactions, and marketing to deliver faster, smarter, and more impactful results while saving time. Award-winning SERHANT. Studios produces original content across social and streaming platforms, while SellIt.com is the company’s digital education hub, engaging members globally in more than 130 countries.Learn more at serhant.com. Read more
v0 and new.website have joined forces to accelerate our vision of helping anyone ship complete, production-ready software with AI.new.website was founded to make it effortless to create beautiful websites with all the tools included, from built-in forms to SEO.
If you're shipping AI features, you already have usage data. The problem is that it's split across providers, keys, and dashboards, so it's hard to answer basic questions before the bill shows up.You've probably felt the drift into after-the-fact reconciliation.
Turborepo is now 81-91% faster to compute its task graph in our repositories, scaling with repo size. On our 1,000+ package monorepo, turbo run now feels instant.
The following is based on an internal talk given at Vercel. We're sharing it publicly because the problem it describes isn't unique to us, and the framework is useful for any team shipping with agents.Coding agents generate code at unprecedented speeds.
FLORA on Vercel2x faster to production with their generation systemZero infrastructure debates after migration50+ image models orchestratedA seasonal fashion launch is a story, not a single frame.Crafting that story is a process of exploration: It’s the same piece, worn by different models.
Mighty Hacker Recovery: Advanced Solutions for Cryptocurrency Asset Retrieval As digital currencies continue to reshape the global financial landscape, the risks associated with cryptocurrency ownership have grown just as rapidly.

Improve Your Review Score 🌟✨💫🔥 Telegram: @progmbofficial 🌟✨💫🔥 WhatsApp: +1 (984) 291-3274 🌟✨💫🔥 Telegram: @progmbofficial 🌟✨💫🔥 [email protected] 🌟✨💫🔥 Visit Our Website:https://www.progmb.com/product/buy-google-map-reviews/ Thanks to ProGmb.
