Release Notes
1.17.0
Introduced advanced scheduling with Volcano integrated for vGPU and MIG support, enhanced infrastructure visibility with cluster resource health checks, and added deployment pending status and runtime environment tracking. The 1.17.0 update also upgrades the Agent & Skills Hub with multi-sync capabilities, improves security and authentication, and brings significant enhancements to storage and Git performance (optimized for 300k+ file repositories), reporting, system stability, and multiple bug fixes including AMD GPU inference operations. Github
Introduced Agent Memory Service for persistent context, vGPU and AMD GPU support for inference and finetuning, multi-host inference with vLLM and SGLang, PVC persistent storage for Spaces, Text-or-Image-to-Video (TI2V) multimodal capabilities, and Gradio SDK 6.2.0 upgrade, along with broad improvements to mirror syncing, content moderation, and architectural cleanup. Github
Support for local deployment of sensitive word detection services to meet enterprise compliance and data security requirements. Added License deletion capability and fixed issues related to License restrictions. Optimized JSON editing experience in the system configuration module to improve backend configuration efficiency.【EE Exclusive】
1.15.0
Introduced advanced AI Gateway capabilities (TTS and Function Calling) and significantly improved Application Spaces, community experience, resource management, and observability, with broad performance and stability enhancements. Github
Inference services now support gradual rollout of new model versions on running instances, with both Canary and Blue-Green deployment strategies available for safe and seamless upgrades. (Enterprise Edition only)
1.14.0
Migrated the core deployment scheduler to Temporal Workflow, greatly improving reliability and scalability, while expanding support for MCP, Jupyter environments, and finetuning workflows. Github
Introduce XNet Smart Trunk Accelerator, significantly enhance storage efficiency and development experience. (Enterprise Edition only)
1.12.0
Strengthened core consistency and scalability with atomic repository creation, automatic runner discovery and cluster auto-scaling, and a new streamable protocol for MCP Space execution. Github
1.11.0
Fully refactored the Runner service for secure, flexible operation both inside and outside Kubernetes, while improving error handling, i18n notifications, and workflow stability. Github