icon
icon
icon
icon
$300 Off
$300 Off

News /

Articles /

Chinese Video Generation Models Revolutionize AI Landscape with Creative Advancements

Word on the StreetSaturday, Oct 5, 2024 9:00 pm ET
1min read

In recent times, there has been a surge in the development and public release of Chinese video generation models. Industry experts view this technology as a significant focus within the AI sector, rapidly advancing and poised to make a substantial impact on fields such as film production and advertisement design.

Recently, Volcano Engine, a subsidiary of ByteDance, unveiled the Doubao video generation model. This model is noted for its capability to generate consistent multi-shot scenes, dynamic camera movements, and support for 3D animation. The team highlighted its innovative diffusion model training approach, which resolves the challenge of maintaining consistency across multi-shot transitions without compromising the subject, style, or atmosphere.

Another notable development is the release of a video generation model by Tongyi Wanxiang, capable of producing detailed animations from textual descriptions. This model improves upon challenges such as motion generation and physical simulation, offering realistic portrayals that can be utilized in film creation, animation design, and advertising.

The rise of these video generation models has drawn considerable attention within the global AI industry. Companies like Kuaishou, Shensu Technology, and Zhipu AI are swiftly launching their products, showcasing the industry's momentum.

According to Deng Daozheng, Deputy Director of Saizhi Industry Research Institute, these developments are expected to significantly influence industries such as media, advertising, education, and the metaverse by reducing costs and production times in short video, live streaming, and film production.

However, while many models have emerged, experts emphasize the need for evolution from quantity to quality. Tang Jiayu, Co-Founder and CEO of Shensu Technology, points out a common issue—insufficient controllability and consistency, particularly with maintaining subject coherence in complex interactions.

Despite significant technical progress, Deng notes that video quality and continuity remain areas for improvement. Models struggle with complex scenes, often resulting in disjointed or defective visuals. Additionally, their understanding of natural language prompts is limited, leading to random and sometimes incoherent outputs.

In response, companies are accelerating model iterations. For instance, Vidu, developed by Shensu Technology in collaboration with Tsinghua University, updates its "subject reference" feature, allowing users to maintain subject consistency by uploading a single image of the subject. This enhanced capability offers seamless scene transitions driven by descriptive inputs.

Looking forward, Deng suggests fostering innovation and collaboration among enterprises, universities, and research institutions. Investing in core algorithmic advancements, building comprehensive datasets, and expanding application scenarios are crucial for elevating the quality of video generation and ensuring its broad adoption and commercialization.

Comments

Add a public comment...
Post
No Comment Yet
Disclaimer: the above is a summary showing certain market information. AInvest is not responsible for any data errors, omissions or other information that may be displayed incorrectly as the data is derived from a third party source. Communications displaying market prices, data and other information available in this post are meant for informational purposes only and are not intended as an offer or solicitation for the purchase or sale of any security. Please do your own research when investing. All investments involve risk and the past performance of a security, or financial product does not guarantee future results or returns. Keep in mind that while diversification may help spread risk, it does not assure a profit, or protect against loss in a down market.
You Can Understand News Better with AI.
Whats the News impact on stock market?
Its impact is
fork
logo
AInvest
Aime Coplilot
Invest Smarter With AI Power.
Open App