Realtime AI avatar and video agent interface
Wan Streamer logoWan StreamerTrack API

Wan-Streamer v0.1 research tracker

Wan Streamer API Tracker

Follow Alibaba's realtime video AI model, API availability, Hugging Face updates, and production-ready alternatives for interactive AI avatars and video agents.

Public API

Not available yet

Open weights

Not released

Hugging Face

Paper only

Best current path

Avatar API + realtime LLM

API status

Realtime video AI is early, but the stack is already forming.

Wan-Streamer is different from normal text-to-video tools: the paper describes a model that can listen, watch, reason, speak, and generate video responses in a realtime loop. Public access has not launched, so this page tracks what is usable now and what changes.

Wan-Streamer API

Pending

No public API endpoint yet.

Model weights

Pending

No official downloadable weights.

Research

Available

Project page, arXiv paper, HF paper.

Alternatives

Products you can build with today

Most production systems combine a realtime LLM, speech, avatar rendering, and WebRTC instead of using one end-to-end model.

Tavus CVI
Realtime video agent API
Available
HeyGen Interactive Avatar
Realtime avatar rendering
Available
D-ID Streaming
Talking avatar streams
Available
Simli
Developer avatar API
Available
Wan-Streamer v0.1
End-to-end realtime audio-video model
Research demo

Realtime AI avatar

Track the stack for low-latency face-to-face video agents, from speech and LLMs to avatar streaming.

AI video support

Compare APIs for website sales agents, product explainers, onboarding guides, and customer support.

Live AI host

Follow tools that can power interactive AI anchors for education, shopping, livestreams, and role play.

Wan Streamer API FAQ

Is there a public Wan Streamer API?

Not yet. Wan-Streamer v0.1 is currently a research project and demo, not a public commercial API or downloadable model.

Is Wan Streamer the same as Wan 2.7 video generation?

No. Wan 2.7 APIs generate or edit videos asynchronously. Wan-Streamer is about realtime audio-video interaction.

What can I use before Wan Streamer ships?

Teams usually combine OpenAI Realtime or another realtime LLM with Tavus, HeyGen, D-ID, Simli, Azure Avatar, LiveKit, or Pipecat.

What will this site track?

Public API status, Hugging Face updates, papers, GitHub releases, alternatives, latency claims, pricing, and integration notes.