Chinese AI startup takes aim at OpenAI’s Sora with image-to-video tool launch

Pictured right here is an AI-generated clip from Vidu’s web site. The instrument can create movies from textual content or picture prompts.

Evelyn Cheng | CNBC

BEIJING — Beijing-based Shengshu Know-how on Wednesday mentioned that its synthetic intelligence-powered text-to-video instrument Vidu will now have the ability to generate movies by combining pictures.

Vidu already permits customers worldwide to create 8-second clips primarily based on written prompts. Whereas OpenAI — the maker of ChatGPT — in February revealed that its AI mannequin Sora may generate one-minute movies from textual content, it has but to launch that publicly.

Vidu’s new AI function can mix three footage — reminiscent of a shirt, particular person and moped — right into a video of the particular person sporting the shirt and driving the moped by a scene, Shengshu mentioned.

Different platforms declare they’ll flip textual content or pictures into movies utilizing AI, however the high quality of output varies. The breakthrough that Shengshu claims is the flexibility to take three distinctive pictures and combine them with visible consistency into an AI-generated video.

“Very early on we pinpointed [visual consistency] as the issue, and needed to unravel it effectively,” Fan Bao, chief know-how officer at Shengshu, mentioned in Mandarin, translated by CNBC.

Vidu launched in April and its means to show two profile photographs into lifelike movies of individuals hugging went viral on TikTok.

The AI video generator is already being profitable from advertisers, animators and different companies, Shengshu co-founder and CEO Jiayu Tang mentioned in Mandarin, in line with a CNBC translation. He mentioned month-to-month utilization charges per buyer can vary from 100,000 yuan to 1 million yuan ($13,871 to $138,711).

To handle copyright points, Tang mentioned an organization may signal a take care of an artist that permits the AI to imitate the artist’s model of portray for an commercial. He mentioned he hadn’t seen vital authorized instances round shoppers’ use of pictures.

Tang added that Vidu would not enable the general public to generate content material utilizing pictures of celebrities or “delicate” people. He mentioned the AI instrument additionally bans nudes and violent pictures. As for private photographs, Tang mentioned Vidu destroys the information in accordance with basic knowledge safety regulation — a worldwide benchmark.

Shengshu was based final yr with backers together with Baidu Ventures, Alibaba-affiliate Ant Group, Chinese language startup Zhipu AI, Qiming Enterprise Companions and Beijing metropolis, in line with PitchBook.

Tang mentioned Vidu’s AI runs off rented cloud servers in China and overseas.

Source link