Building AI Agents for Enterprise Operations
Pablo Palafox and Luis Parra, co-founders of Happy Robot, discuss how they built a voice AI platform for enterprise logistics and supply chain operations, starting with freight broker use cases and expanding to serve major global enterprises. They explain how voice was the critical unlock for automating complex operational workflows, and how their forward-deployed engineering model helped them build a flexible platform that solves enterprise coordination problems across industries.
Summary
Happy Robot was founded by Pablo Palafox, Luis Parra, and Javi (Pablo's brother), who met during college and were drawn to complex technical challenges. The company originated from Javi's experience as CFO of a large olive oil distributor, where he had to hire interns just to call drivers and track shipments. Recognizing voice as the critical unlock for logistics operations, they built fine-tuned LLMs (Mistral and Llama) in late 2023 to power realistic, low-latency voice agents capable of negotiating freight rates and tracking shipments—capabilities that larger, slower models like GPT-4 couldn't deliver at the time.
The founders describe their core technical philosophy as always identifying and solving the 'limiting factor' rather than reinventing everything or simply wrapping existing APIs. A key early innovation was their approach to negotiation: rather than exposing maximum buy rates to the LLM and risking hallucination, they built external deterministic negotiation algorithms where the agent would 'ask permission' by calling a tool, mimicking how a human salesperson would check with their manager. This mix of probabilistic AI and deterministic guardrails was essential for winning enterprise trust with companies like C.H. Robinson and Uber Freight.
Happy Robot has scaled to serve nine of the top ten US freight brokers, seven of the top ten tracking companies, and two of the largest ocean carriers globally, including CMA-CGM. A highlighted use case with Kuehne+Nagel illustrates the depth of their orchestration: a simple 'where is my shipment?' query triggers a chain of agents that scrape airline websites, send emails, monitor SLA timelines, and place phone calls until the information is retrieved—far beyond simple knowledge-base customer support.
The company employs a forward-deployed engineering (FDE) model, where engineers embed with customers to discover workflows, seed the platform's context layer, and deploy agents tailored to each enterprise's specific operations. The founders emphasize that enterprises operate too differently for a one-size-fits-all approach, and that the real value is in building a flexible platform around a deployment lifecycle—scoping, building, testing, monitoring, and self-learning—rather than hardcoding specific tasks.
They introduce the concept of 'Twin,' a data layer that connects customer systems of record (CRM, ERP, TMS, Snowflake) with Happy Robot-native agent-generated records, arguing that executing work via agents progressively cleans and enriches enterprise data rather than requiring clean data as a prerequisite. This execution-first approach builds a compounding context layer that allows agents to climb what they call the 'pyramid of complexity'—from high-volume, repeatable tasks at the base to low-volume, high-value strategic decisions at the top.
The founders argue that the core problem they're solving is not supply chain-specific but an enterprise coordination problem applicable across industries. With DHL, they've deployed over 40 agents across 80 countries. They are now seeing pull from telecoms, utilities, insurance, and oil and gas—industries sharing the same pattern of fragmented information, complex multi-party coordination, and operationally messy workflows.
On voice AI specifically, Luis argues that end-of-turn detection and conversation flow management—knowing when to speak, when to wait, and when to pause and reason—are the real bottlenecks to deployment, not latency or voice realism. They have invested heavily in interruption handling, filler word detection, and background noise filtering. The founders believe the human-like quality of the experience is critical: even when users are told they're speaking to an AI, they quickly forget and engage naturally, which makes the technology effective in the real world.
Key Insights
- Luis argues that end-of-turn detection—knowing when to talk and when not to talk—is the primary bottleneck in voice AI deployment, not latency or voice realism, and that faster models can actually make conversation flow worse by increasing unwanted interruptions.
- Happy Robot designed their negotiation agents to never see the maximum buy rate in their context window; instead, agents call an external deterministic tool to 'ask permission' for higher rates, mimicking how a human would check with a manager and preventing hallucination of pricing.
- Pablo claims that the enterprise coordination problem they discovered in logistics—fragmented information across systems, teams, emails, and phone calls—is the same fundamental problem faced by utilities, telcos, and insurers, which is driving their cross-industry expansion.
- The founders argue that agents executing work actually clean enterprise data as a byproduct, rather than requiring clean data as a prerequisite, because AI is more diligent than humans at recording information consistently into the right systems.
- Luis describes sharing negotiation context across simultaneous inbound calls for the same freight load, so agents can signal to each other that a load is 'hot' and push harder—a capability that requires cross-agent context sharing that no general-purpose model provides out of the box.
- Pablo describes their 'pyramid of complexity' framework, arguing that the highest economic leverage for enterprises lives at the top in low-volume, highly contextualized strategic decisions, but that companies cannot reach those decisions without first capturing context from high-volume base-level operations.
- The founders assert that their forward-deployed engineers function as context seeders for the platform's state graph, and that the deployment lifecycle itself—not just the agents—is the core product, with FDEs serving as an extension of product development rather than a services team.
- Pablo argues that automating operationally tedious work like payment collection and shipment tracking allows human employees to redirect their time toward relationship-building, citing a DHL example where employees previously tied up scheduling deliveries by phone were freed to take customers to dinner.
Topics
Full transcript available for MurmurCast members
Sign Up to Access