Eng

China's home-grown video-generating AI tools go creative, productive

XINHUA
發布於 8小時前 • Zhou Zhou,Zhang Manzi,quanxiaoshu(yidu),Fang Zhe
A staff member introduces the use of AIGC technology in ancient book restoration to a visitor during the 2024 World AI Conference in Shanghai, east China, July 6, 2024. (Xinhua/Fang Zhe)

BEIJING, Nov. 19 (Xinhua) -- China's generative AI tools are carving out a unique niche, offering a blend of entertainment and practical benefits, while also playing a key role in preserving cultural heritage.

Among them, an image-to-video tool called Vidu-1.5, launched last week by a Beijing-based AI startup, has been proclaimed a multimodal model to support multi-entity consistency.

廣告(請繼續閱讀本文)

In practice, this means the AI can generate a video from as few as three input images. For example, in a video shared by the company, the inputs -- "A man, a futuristic mecha suit, and a bustling cityscape at night" -- are seamlessly blended into a cohesive montage, all within just 30 seconds.

Understanding and controlling multiple entities -- such as the person, attire and environment -- has always been the biggest challenge in AI-generated video technology.

Ever since ChatGPT introduced its pioneering Sora, multiple Chinese tech firms have swiftly stepped up to the plate, rolling out offerings that boast unique characteristics. ShengShu Technology's Vidu is one popular example.

廣告(請繼續閱讀本文)

"Look how consistent is that suit," Stefano Rivera, an AI product aficionado wowed with admiration in a tweet, calling himself a "super-fan" of Vidu "from day 1."

This AI-generated content (AIGC) tool has already ignited a surge of creative enthusiasm among global individual creators, leading to playful and imaginative clips like Leonardo DiCaprio showcasing haute couture on the runway, Elon Musk cruising on an electric scooter in a flamboyant Chinese jacket, and a series of Japanese anime scenes.

Vidu's greatest breakthrough is establishing logical relationships among multiple user-specified objects within a scene, said Tang Jiayu, the CEO of Shengshu Technology, in a written response to Xinhua.

廣告(請繼續閱讀本文)

With previous text-to-video tools, generating scenes like "a boy holding the cake in a crystal setting" would yield different images of the boy, cake, and crystal each time, much like opening a blind box. Now, with multi-subject consistency, the identities of the boy, cake and crystal can be preserved throughout the video, maintaining true-to-nature continuity, said Tang.

Chinese entrepreneurs like Tang, along with global investors with substantial capital, are rapidly pouring into the AIGC sector, expanding their market footprint in China.

In August, Zhipu AI launched its large video generation model product Ying. This month, Kuaishou, a leading video platform in China, rolled out its KLING AI app on the Apple and Android stores, featuring a continuation writing that allows users to extend their generated videos by up to approximately three minutes.

Last week, Forbes China's top 50 innovative companies list featured eight large-scale model companies, which constituted the highest proportion among the selected firms.

China has filed and launched more than 180 AI generative content models that can provide services to the public, said an official from the Cyberspace Administration of China in August.

Out of over 1,300 AI large language models (LLMs) globally, China accounts for more than 30 percent, making it the second-largest contributor after the United States, according to a white paper on the global digital economy released in July by the China Academy of Information and Communications Technology.

Generative AI is set to add an estimated 7 trillion U.S. dollars to the global economy, with China expected to contribute nearly a third of this amount, accounting for approximately 2 trillion dollars, as shown by a Mckinsey report.

AI FOR CULTURAL PROTECTION

Beyond facilitating entertainment creation for online users, AIGC tools are being increasingly applied across diverse scenarios in China. The preservation and promotion of cultural heritage is one of them.

A home-grown generative AI tool coded Jimeng, developed by ByteDance, has been employed to craft a fully AI-generated sci-fi short drama aimed at promoting the ancient Chinese culture, the first of its kind in the country.

"Sanxingdui: Future Apocalypse," published in July, follows a near-future narrative in which protagonists venture into a digitally reconstructed ancient Shu kingdom dating back over 3,000 years to avert an impending civilization crisis.

The 12-episode series employed multiple generative technologies, including AI script-writing, concept and storyboard design, image-to-video conversion, video editing and media content enhancement.

Leveraging its proprietary multimodal large-scale model, ShengShu's AI engineers analyzed extensive collections of ancient mural data from the Yongle Palace, the largest Taoist temple in China.

The 800-year-old temple's murals are beset by problems like color fading, dust cover and deterioration. Yet, their grand scale, distinctive style and rich intricacy significantly complicated the restoration efforts.

The engineers have trained the AI with Chinese mural art data, allowing it to comprehend and replicate the distinctive style of those murals, from color to brush technique.

This enabled automated restoration tasks like digital coloring and filling in missing details, and the AI can mimic the brushwork of mural painters to redraw the facial features of deities in the murals, said Tang. ■

更多 Eng 相關文章

Xi unveils China's action plan as G20 tackles hunger, economic challenges
XINHUA
Airties Appoints John Lancaster-Lennox, former President of Nokia Japan, to Lead ISP Expansion Efforts Across Asia
PR Newswire (美通社)
Chinese vice premier meets with Honduran vice president
XINHUA
X Financial to Report Third Quarter 2024 Financial Results on November 27, 2024
PR Newswire (美通社)
Maytronics Expands Its Portfolio with Cutting-Edge Pool Innovations at Piscine Global 2024
PR Newswire (美通社)
X Foundation Partners with Dream Building Service Association to Rebuild Schools in Kenya's Mathare Slum
PR Newswire (美通社)
At G20 summit, Xi urges a fair, equitable global governance system
XINHUA
One of Asia's Largest Airlines to Buy SU Group's AI-Aided X-ray Screeners
PR Newswire (美通社)
Willog Awarded 'Minister of Land, Infrastructure, and Transport Citation' at Korea Logistics Awards for the Third Consecutive Time
PR Newswire (美通社)
China sees more auto trade-ins on policy support
XINHUA
swop 2024: Shanghai World of Packaging Grandly Opens in November, Leading the New Era of Packaging Innovation
PR Newswire (美通社)
SunCar Technology Group Inc. Joins the Prestigious NASDAQ Golden Dragon China Index
PR Newswire (美通社)
Prometric Unveils AI-Powered Auto Scoring Technology Designed to Transform High-Volume Grading
PR Newswire (美通社)
Report highlights China's protection of human rights for all
XINHUA
Xinhua Photo Daily | Nov. 19, 2024
XINHUA
Alstom's first integrated system in the Philippines enters service with the first phase of the Manila LRT-1 Cavite extension
PR Newswire (美通社)
The 26th China Hi-Tech Fair Came to A Successful Close with Intended Transaction Amount Exceeding CNY120 Billion
PR Newswire (美通社)
China loses to Japan in World Cup qualifier
XINHUA
Ractigen Therapeutics Announces FDA Orphan Drug Designation for RAG-21 for the Treatment of ALS
PR Newswire (美通社)
Rokid Glasses Transform AR+AI into Daily Essentials, Unveiled at Rokid Jungle 2024
PR Newswire (美通社)
Supermicro Showcases Largest Portfolio of HPC-Optimized Multi-Node Systems at SuperComputing 2024
PR Newswire (美通社)
Kenyan expert on China's role in APEC
XINHUA
Golden Heaven Group Holdings Ltd. Secures Investment of US$25.2 Million and Enters Into Amendments to Outstanding Warrants
PR Newswire (美通社)
TAHO Pharmaceuticals Initiates U.S. Phase III Clinical Trial of TAH3311 Antithrombotic Oral Dissolving Film, Dosing First Subjects
PR Newswire (美通社)
Cinematic exchanges across silver screens add color to China-U.S. cultural ties
XINHUA