China embodied AI funding has surpassed $4 billion in early 2026 — and the biggest winners aren’t the robot makers, they’re the companies selling training data.
While Robots Learn to Walk, Data Infrastructure Plays Are Already Cashing In
China’s embodied intelligence sector has raised over RMB 30 billion (~$4.1B) in early 2026 alone, with 20+ companies now valued above RMB 10 billion (~$1.4B).
But here’s the twist:
Most of that capital isn’t flowing to robot manufacturers.
It’s going to the companies selling the high-quality, real-world training data.
On April 16, two major players unveiled competing data infrastructure plays on the same day:
- JD.com: Launched a full-chain embodied AI data platform covering collection, storage, labeling, training, evaluation, simulation, and testing
- Mifeng Tech (spun out of Agibot): Debuted a “one-stop physical AI data service” targeting the same bottleneck
This isn’t coincidence. It’s confirmation:
The industry’s constraint has shifted from hardware (“can it move?”) to data (“can it learn?”).

The Data Economy: Pricing, Types, and Scale
Not all robot data is equal. The market has segmented into three tiers:
| Data Type | Description | Price (RMB/hour) | Use Case |
|---|---|---|---|
| Simulation data | Generated in virtual environments | Not transparent | Edge-case training, rare scenarios |
| Wearable/no-body data | Human motion captured via sensors (no robot) | ~RMB 150–300 (~$22–44) | Pre-training, coarse behavior learning |
| Real robot data | Actual robot executing tasks in real settings | ~RMB 500–1,000 (~$70–140) | Fine-tuning, production deployment |
At current pricing:
- 1 million hours of real robot data = RMB 1 billion (~$140M) market
- 10 million hours = RMB 10 billion (~$1.4B)
And demand is insatiable:
“We’re in a ‘buy whatever you can get’ market,” Yao confirmed.
The Crowdsourcing Playbook: JD vs. Mifeng
Both JD and Mifeng are scaling via asset-light, crowdsourced collection—but with different leverage:
JD.com: The Workforce Advantage
- Deploying JoyEgoCam, a wearable HD capture device
- Targeting 600,000 collectors from its 900,000+ employee base (couriers, warehouse staff, delivery riders)
- Goal: 10 million hours of human-scene video data within 2 years

📌 China Embodied AI Funding – Investment Takeaway:
For investors, the signal is clear:
The companies building robots may take years to monetize.
The companies selling them data are earning revenue today.
Key implications:
- Near-term winners: Data infrastructure providers (JD, Mifeng, and emerging specialists)
- Mid-term leverage: Hardware vendors that embed data collection into their sales model (Agibot via Mifeng)
- Long-term moat: Standardization, quality control, and cross-platform data interoperability
Risks to watch:
- Fragmentation: No industry-wide data standards yet—limiting reuse across platforms
- Quality variance: Crowdsourced collection risks inconsistent labeling and context
- Regulatory uncertainty: Physical interaction data may face new privacy and safety rules


