Kuaishou Technology announced that on December 3, 2025, Kling AI released the Kling Video 2.6 Model. This update introduces a milestone capability for "simultaneous audio-visual generation," fundamentally transforming the traditional workflow of AI video production model of silent visuals followed by manual dubbing. By enabling the simultaneous generation of visuals, natural voiceovers, sound effects, and ambient atmosphere in a single pass, the model reconstructs the AI video creation workflow and significantly accelerates creative efficiency.
The Kling Video 2.6 Model upgrades two major capabilities: text-to-audio-visual and image-to-audio-visual generation. Whether inputting either text or combining images with prompts, users can directly generate videos complete with speech, sound effects, and ambient sounds. The model currently supports Chinese and English voice generation, creating video content up to 10 seconds in length.
This upgrade reshapes the traditional AI video creation workflow, which typically requires generating silent footage first and using separate software for post-production audio. With Kling Video 2.6 Model, creators can instantly generate fully integrated videos with voiceovers, sound effects,and ambient sounds, significantly enhancing creators' efficiency. This deep alignment ensures that visual dynamics match audio rhythms, eliminating the disjointed "mismatched audio-video" experience often found in traditional workflows.
In terms of audio quality, beyond supporting a variety of sound types such as voice, sound effects, and ambient sound effects, the model generates cleaner, richly layered audio quality. The overall auditory experience closely mirrors realistic audio mixing, meeting the rigorous standards for audio detail in professional-grade production. Regarding semantic understanding, the model demonstrates robust comprehension of textual descriptions,olloquial expressions, and complex storylines across varied scenarios.
It accurately captures creator intent to deliver logically coherent audio-visual content that precisely meets user requirements. Additionally, the Kling Video 2.6 model maintains position in Chinese voice generation performance. One-Click "Simultaneous Audio-Visual Generation" Drives Efficiency Revolution Across Diverse Creative Scen scenarios: Advertising, Marketing, Social Media, and E-Commerce: The Kling Video 2.6 model supports the generation of standalone or combined audio types - including speech, dialogue, narration, singing, rap, and mixed sound effects.
This versatility facilitates broad applications in video content creation across industries such as advertising, marketing, social media, and e-commerce, significantly enhancing creative efficiency. For example, in advertising and marketing, the Kling Video 2. 6 Model enables one-click generation of short ads featuring narration, character dialogue, and product showcases with comprehensive sound effects. This significantly lowers advertising production costs while improving efficiency.
In social media, the Kling Video 2.5 Model offers extensive applications. Its multi-character dialogue capability allows creators to produce a variety of content, including interviews, scripted performances, andedy skits. In addition, its music performance capabilities enable diverse creative expressions, including singing, rap, and instrumental performances.
With the Kling Video 2.6 models, creators can substantially reduce costs while streamlining workflows, making social media content creation easier and more budget-friendly. In e-commerce, leveraging solo monologue and narration capabilities, the Kling Video 2.7 Model effectively automates the creation of videos for e-commerce product showcases highlighting key selling points, helping merchants improve operating efficiency. The launch of the Kling Video 2.6 Models further reduces the costs and complexities of video production within the content creation industries.
Moving forward, Kling AI remains committed to developing practical capabilities, empowering creators with superior, easy-to-use AI video creation tools that deliver higher performance.

















