
AI in Supply Chain - 8 of 10
Agent-to-Agent Negotiation — Your Digital Team Starts Talking
In our last few articles, we've built a powerful foundation. We gave our agents a nervous system with MCP (Part 5), a library card with RAG (Part 6), and a relational map of our entire operation with Graph RAG (Part 7). Our agents can now sense issues, understand context, and see the ripple effects of every disruption.
But so far, their collaboration has been like a well-drilled relay race: one agent runs its leg and passes the baton to the next in a sequence dictated by an orchestrator. The path is predetermined. This is efficient — but it's not resilient.
In my 20+ years in warehouse and supply chain operations, I learned that the most resilient warehouses aren't the ones with the most rigid processes. They're the ones where the receiving supervisor can have a quick, smart conversation with the outbound lead to solve a problem on the fly. Thirty seconds of negotiation between two experienced people can prevent hours of downstream chaos.
This is the final capability piece before we talk governance: moving from orchestrated collaboration to active Agent-to-Agent (A2A) Negotiation. We're not just connecting our agents — we're teaching them how to talk, bargain, and compromise to achieve the best outcome, all within your rules and without human intervention for routine exceptions.
From a Relay Race to a Rugby Team
Think of the progression this way. Multi-agent orchestration — what we have now — is a relay race. The Alert Agent runs its leg and passes the baton to the Reasoning Agent, who passes it to the Action Agent. The sequence is fixed.
A2A Negotiation is a rugby team. The players have specialised roles, but they can pass the ball to whoever is in the best position to advance it. They communicate constantly, adjusting strategy in real time based on what's happening on the field.
A2A negotiation allows agents to propose solutions, evaluate counter-offers, and converge on a plan that respects each other's constraints — labor capacity, equipment availability, carrier cutoffs, budget caps. It's the digital equivalent of your best supervisors huddling for 30 seconds to avert a crisis.
How Agents Actually Negotiate
A2A negotiation uses a structured pattern called a contract-net protocol — a proven approach adapted for warehouse operations. Here's how it flows:
- The Orchestrator broadcasts the problem, constraints, and deadline — this is the Call for Proposals.
- Relevant agents submit plans, each with a cost, time, risk estimate, and cited sources — these are the Proposals.
- The Orchestrator scores proposals against policy and selects the best — this is the Award.
- The winning agent executes; others stand down — this is the Commit and Execute step.
- The full negotiation is logged for compliance, learning, and refinement — this is the Audit.
Every proposal must include an action plan with steps and duration, a cost and time impact in real numbers, the risks and preconditions with data freshness timestamps, citations from RAG sources and Graph RAG paths, and a confidence score.
This isn't free-form chatting. It's structured, auditable, and bounded by your policies.
To make this intuitive, think of three building blocks. Intents are a clear statement of need — "I need 10 units of this SKU at staging lane D-07 by 16:30." Bids are offers to fulfil the intent with trade-offs — "I can cross-dock from the inbound truck. Cost: plus $12 labor. Risk: 30-minute putaway delay." And Utility Functions are a simple scoring system that aligns each agent's priorities with your business goals — for example, the Outbound Agent might weight on-time dispatch at 80%, labor cost at 10%, and freight cost at 10%. Think of these as negotiation tokens — labor hours, freight dollars, SLA minutes — that agents trade to find the least disruptive resolution.
The Late Shipment Scenario: A Live Negotiation
Let's revisit Shipment S-123, at risk due to a shortfall of SKU #ABC-123. In Part 6, a RAG-enabled agent proposed a substitution. In Part 7, Graph RAG traced the ripple effects across docks, carriers, and downstream shipments. Now let's see how a team of negotiating agents resolves it — faster and more optimally than any single agent could.
The participants, all connected via MCP, include a WMS Agent that knows pick status and cross-dock feasibility, a TMS Agent that knows carrier cutoffs and available capacity, a Labor Agent that knows who's free and expected pick times, a Finance Agent that enforces budget caps, a Customer Rules Agent that validates substitutions and labeling, and a Compliance Agent that regenerates shipping documents and notifies stakeholders.
The Orchestrator kicks things off at 15:30: "S-123 is at risk. Need a plan to fulfil 10 units by 16:30. Carrier cutoff at 17:00. Budget delta no more than $50. Customer allows substitution per their SOP. Proposals due in five minutes."
Two proposals come back. The Inbound Agent proposes cross-docking from an inbound truck arriving soon — cost is $12 in labor, but Graph RAG reveals this delays putaway for two other shipments on the same dock. The Inventory Agent proposes using the approved substitute SKU — cost is $45 in freight re-rate due to weight difference, but no ripple to any other shipment.
The Orchestrator evaluates and awards Proposal B — the substitution — because the $45 cost is known, contained, and within the budget cap, while the cross-dock risk is uncontained. The Inventory Agent allocates the substitute. The TMS Agent regenerates the bill of lading. The Finance Agent logs the cost variance. The Compliance Agent confirms labeling. The entire resolution is logged with full source citations.
The whole process took 47 seconds. No planner ping-pong. No last-minute panic calls. And every decision is traceable.
See the difference from Part 4, where we were still sending alerts and waiting for humans to mediate? That's the shift — from coordination toil to autonomous resolution.
Three More A2A Patterns That Pay Off Fast
The Late Shipment scenario is just the beginning. Here are three more high-friction workflows where A2A negotiation delivers immediate value.
Dock Rebooking on ETA Slips
Trigger: A carrier visibility feed moves an inbound ETA by 60 minutes or more. The TMS Agent proposes a new dock slot. The Yard Agent checks door availability and dwell exposure. The Labor Agent rebalances the unload team. The Finance Agent caps detention cost. Result: rebooked within minutes, dwell fees avoided, dock conflicts reduced.
I remember managing a large multi-site rollout where dock conflicts were resolved by phone calls between shift leads at different locations. Imagine if those systems could have negotiated directly — recovery would have been minutes, not shifts.
Wave Reprioritisation for Hot Orders
Trigger: A high-priority drop order lands during an active batch wave. The WMS Agent proposes a micro-wave injection. The Labor Agent bids crew availability and estimated completion time. The Packaging Agent confirms carton and label templates are compatible. Result: hot order injected without disrupting the main wave, SLAs met with minimal human intervention.
Recall and Hold Containment
Trigger: A supplier lot recall notification arrives. The Quality Agent requests a hold. The WMS Agent traces all affected inventory across picks, waves, and staged shipments using Graph RAG. The TMS Agent halts manifested but not yet shipped loads. The Customer Agent drafts notifications per SLA requirements. Result: containment in minutes instead of days. This is critical in food, pharma, and any regulated environment — the cost of a missed recall isn't just financial, it's reputational.
Building Your First A2A System
Just like we discussed in Part 3, you don't need perfect infrastructure to start.
Use MCP as your foundation. Register your negotiation tools in MCP: propose trade, counter offer, accept trade, rebook dock, reallocate labor, regenerate bill of lading, apply hold. All agents share the same tool registry, so negotiations stay consistent.
Ground every proposal with RAG for policies and contracts, Graph RAG for ripple analysis, and live APIs for real-time facts. No proposal should be awarded without cited sources and quantified impacts.
Add a Policy Engine using simple rules encoded in YAML or JSON to enforce constraints: budget caps with auto-approve thresholds and human approval above, SLA tiers where your top customers are never deprioritised, confidence thresholds below which the system escalates to a human, and round limits so a negotiation that can't resolve within three counter-offers or 60 seconds falls back to a safe default.
Enforce Separation of Duties: the agent that awards the contract — the Orchestrator — should never be one of the bidders. This prevents biased decisions and mirrors the financial controls your operations team already understands.
Manage Chatter by using topic-based subscriptions in MCP so only relevant agents participate. The Packaging Agent doesn't need to bid on a dock conflict. Rate-limit intents and whitelist eligible bidders by topic to keep things clean.
Your 30-60-90 Day Rollout Plan
Days 1 to 30 — Pilot One Bounded Negotiation
Start with dock door scheduling conflicts between inbound and outbound. Define a minimal proposal-and-award schema. Register two or three MCP tools. Simulate 10 to 20 historical conflicts with a human planner reviewing and approving every award. Measure planner decision time before and after — aim for a 50% reduction.
Days 31 to 60 — Add Cost Intelligence and Auto-Approve
Move to labor reallocation for late shipments. Implement utility functions and policy gates. Allow auto-approvals for decisions within your low-risk thresholds. Integrate the Finance Agent for budget validation. Add Graph RAG to flag knock-on conflicts. Target 40 to 50 percent of minor conflicts resolved autonomously with MTTR improving alongside.
Days 61 to 90 — Expand to Cross-System Negotiation
Extend to WMS and TMS carrier slot negotiation — the WMS Agent detects an early-ready shipment and the TMS Agent bids with new pickup options and costs. Harden response time targets so floor-facing decisions land in under 60 seconds. Add chaos tests for conflicting proposals and stale ETAs. Measure planner time saved per shift, rework rate, on-time dispatch improvement, and prevented detention fees.
When NOT to Use A2A
Not every problem needs a negotiation. If a single system can handle a deterministic action, just call the API — don't negotiate what you can execute directly. For low-stakes lookups like "where is this pallet?", a copilot answer is enough. And if you don't have clear policies or budget boundaries yet, write the rules first. Agents can't negotiate without boundaries.
What to Watch Out For
Deadlocks happen when agents can't converge. Set maximum round counts and wall-clock limits. If no agreement is reached, fall back to the lowest-risk compliant plan or escalate to a human.
Conflicting local goals are a common trap. Make sure each agent's utility function aligns with your global KPIs. The Orchestrator's job is to enforce the big picture — service level first, cost second, efficiency third.
Stale data turns proposals into liabilities. Every bid must include timestamps on its data sources. A proposal based on a two-hour-old ETA is a risk, not a solution.
Trust gaps are real early on. Start with human-gated decisions. Let the audit data build confidence before expanding autonomy.
And in multi-client environments, enforce access controls in MCP. An agent serving Client A must never see Client B's data. Test for leakage explicitly.
Final Thoughts: Doing More with Less
We've now connected the full arc: alerts (Part 4), shared tools and context (Part 5), facts and policies (Part 6), network effects (Part 7), and now negotiations (Part 8).
This is where your warehouse starts to feel self-healing. Attention is the scarcest resource on the floor. Every minute a planner spends mediating between systems is a minute not spent on strategic improvements. A2A eliminates most of that coordination toil by compressing decision loops into seconds, within your cost and SLA boundaries.
And here's the compounding effect: every resolved exception becomes a reusable pattern. The next time a similar conflict arises, the system already knows the playbook. Your agents don't just execute — they learn.
Question for you: If two agents in your operation could start negotiating tomorrow, which pair would save your team the most time — WMS and TMS for dock conflicts, or WMS and Labor for hot picks?