Small models, serious accuracy.
Seed AutoArch just hit 94.42% on the official Banking77 test set — under a strict full-train protocol, no leakage — at ~68.4 MiB and ~225 ms on CPU. That is the shape humanoid robotics actually needs on-board.
Built for teams shipping
02 · The Frontier
Pick the shape you ship.
The Seed AutoArch frontier is a family of models, not a single checkpoint. Slide between top accuracy, balanced, and most-efficient to see the tradeoffs on the Banking77 benchmark.
Seed AutoArch · High-Accuracy
When the ceiling matters. Push Banking77 quality higher than the published Test SOTA while staying small enough to run on CPU without apology.
Accuracy
94.42%
+0.59pp over Test SOTA
Inference
~225 ms
end-to-end
Footprint
~68.4 MiB
502,170 heads-only
Efficiency
Pareto
20–30× smaller · 50–100× faster on CPU
03 · Benchmark
The receipts.
Every row below is a published result or a SeedFrontier-measured number on the same official Banking77 test set, with matching protocol. No cherry-picked splits. No hand-waving.
SPACE
Current Main SOTA
Seed AutoArch · High-Accuracy
SeedFrontier
Test SOTA (CUD)
Published baseline
Seed AutoArch · Efficient
SeedFrontier
04 · Robotics
Humanoids don’t need bigger models.
They need the right shape — small, fast, accurate enough to run on-robot without apology. A 70B parameter model does not fit in a torso powered by a battery. A 68 MiB model in 225 ms does.
Compute is trapped on-robot
Humanoid platforms run on embedded accelerators with thermal and power budgets measured in watts. Cloud offload is not an option when control loops run at kilohertz rates.
Latency is standing vs. falling
Balance, locomotion, and manipulation policies run anywhere from 30 Hz to 1 kHz. Every millisecond of inference is a direct physical constraint on what the robot can do.
Battery budgets punish bloat
Every joule spent on inference is a joule not spent on actuators. Smaller, faster models extend runtime and make concurrent on-robot skills possible.
05 · The control stack
Every layer is a deployment constraint.
A humanoid runs many models at once, at wildly different frequencies and sizes. Large general models cannot fill these layers. Small, specialized, production-shaped models can.
06 · Get in touch
Ship the right shape.
If your platform has a thermal budget, a battery, and a control loop, the shape of your models is the shape of your product. Let’s talk about what Seed can build for you.
Talk to us about
- On-robot perception and control models
- Custom frontier tuning for your hardware target
- Banking77-class benchmarks for your domain
- Deployment profiles under strict latency budgets