What StepFun does

StepFun (Jieyue Xingchen) is a China-based AI company that develops foundation models and AI tools. Headquartered in Shanghai, it builds large language models and multimodal systems spanning language, vision, video, and audio, and offers applications and an AI assistant built on those models.

Key capabilities

StepFun has released a series of Step foundation models, including a large Mixture-of-Experts language model and multimodal models. It has open-sourced models such as Step-Video for video generation and Step-Audio for speech interaction. Its product offerings include knowledge-base question answering, image creation, and multimodal reasoning capabilities.

Who it's for

StepFun serves developers and organizations building AI applications that require language understanding, multimodal generation, and reasoning. As a foundation-model and AI agent provider, it supports use cases ranging from content creation to interactive assistants. StepFun is among the prominent Chinese AI model startups.