Merchant-fed price intelligence network with an MCP-native API — the infrastructure layer that makes merchant catalogues discoverable by AI shopping agents. Built for Southeast Asia. Merchants who join get indexed. Merchants who don't become invisible to AI shoppers.
6-layer pipeline from merchant data sources to AI agent consumers
How merchant catalogues flow in — from 5 different source platforms
Merchants should be able to connect their store in under 5 minutes with zero technical knowledge. Every source connector meets them where they already are — their existing platform. No new tools to learn. The more friction-free the on-ramp, the faster the merchant count compounds.
Priority channel — cleanest integration
Seller API — authenticated pull
WordPress + REST API plugin
Many merchants already have this — zero extra work
Fallback for any platform — including Lazada CSV export
Why raw merchant data is useless without normalisation, and how to solve it
Merchant A lists "Nike Air Max 270 Men UK10 Black". Merchant B lists "Nike running shoe men sz10". Merchant C lists "AM270-BLK-10". These are the same product. Without normalisation, cross-merchant price comparison is impossible — you're just comparing noise.
Collector receives raw product record. Name, price, description, category (if any), images. Stored as-is in staging table.
LLM reads raw product name + description. Extracts: brand, product_line, model, variant (colour, size, material), category_l1/l2. Outputs structured JSON.
Normalised product name + extracted attributes → text-embedding-3-small (or local model). 1536-dim vector stored in pgvector.
New product embedding queried against existing catalogue. If cosine similarity >0.92 → same product entity. Linked to existing entity ID. If <0.75 → new entity created.
0.92–1.0 = auto-merge. 0.75–0.92 = human review queue. <0.75 = new entity. Edge cases flagged for merchant confirmation.
Merchant + entity ID + price + timestamp written to Price History DB. Entity now has N merchant listings attached to it.
Four specialised data stores — each optimised for its access pattern
Master entity registry — one row per unique product
Time-series price per merchant per product
Auth, consent, tier, and integration config
Enables natural language product search
Four AI-powered engines that turn raw price data into actionable intelligence
Market positioning intelligence for merchants
Velocity signals before consumers see the trend
Real-time competitive intelligence pushed to merchants
The bridge between human language and structured product data
search_products MCP tool — callable by any AI agentA bank underwriting an SME loan wants to know: is this merchant's pricing competitive? Are they losing market share? Is their category growing or declining? The Intelligence Layer produces exactly these signals — and no bank in Malaysia currently has access to cross-platform, cross-merchant price intelligence. That's a data licensing deal at RM50k–500k/year per buyer.
Model Context Protocol — the standard that makes you natively queryable by every AI agent
MCP (Model Context Protocol) is the emerging open standard for AI agents to talk to external tools. Anthropic released it, OpenAI adopted it, every major LLM provider is implementing it. It's the USB-C of AI integrations — build once, work everywhere. A merchant indexed in your MCP server is reachable from ChatGPT Operator, Claude, Gemini, Jarvis, and any future AI agent. No custom integration per platform. You become the single source of truth for product search in SEA.
User says: "Buy me the cheapest running shoes under RM200, size 10, deliver by Friday". Agent decides it needs to search for products.
Agent calls your search_products MCP tool with structured parameters: category, max_price, size, delivery_deadline.
Parameters passed to Intelligence Layer. Vector search + SQL filters applied. Top 5 matching products returned with full details.
Returns: product name, merchant name, price, stock status, checkout URL, estimated delivery. Agent picks the best match.
purchase_product MCP tool. Agent passes checkout URL, user payment token. Transaction completes through your platform. You earn GMV commission.
Three interfaces serving three different consumer types
Standard HTTPS · OpenAPI 3.0
GET /products — search catalogueGET /products/:id/prices — all merchant pricesGET /categories/:id/benchmark — market statsPOST /merchants/connect — onboardingGET /merchants/me/dashboard — SaaS dataAI agent protocol · the future interface
search_products, compare_pricespurchase_product, track_orderPush-based · event-driven
Three phases — each unlocking a new revenue stream as the data pool grows
| Stream | Who Pays | Unit Economics | Month 6 Est. | Month 24 Est. |
|---|---|---|---|---|
| Merchant SaaS | Merchants (direct) | RM99–999/mo per merchant | RM15k MRR | RM120k MRR |
| API per-query | AI platforms / developers | RM0.01–0.05 / query | – | RM20k MRR |
| Data licensing | Banks, FMCG, PE | RM50k–5M / annual contract | – | RM500k ARR |
| Consumer affiliate | Merchants (CPC) | 2–8% per click-through purchase | – | RM30k MRR |
| GMV commission | Merchants (transaction) | 0.5–2% of AI-driven GMV | – | RM80k MRR |
10 weeks to a working MVP. 12 months to a fundable business.
No well-funded player has built a merchant-fed, MCP-native commerce index for Southeast Asia yet. OpenAI Operator and Perplexity Shopping are US-centric. The regional incumbents (Lazada, Shopee) have their own data but no cross-platform aggregation and no MCP layer. You have roughly 18 months before a funded competitor notices this gap. The moat compounds fast once you have 200+ merchants — switching to a new platform means re-integrating all their sources. Move now.