日日噜噜噜夜夜爽爽狠狠22_中文字幕在线不卡_久久久伦理_久久综合激情网_曰批免费视频播放免费_狠狠做五月爱婷婷综合

position: EnglishChannel  > News> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-10-24 17:41:46 | Author: Gong Qian

Emu3 text-to-image cases. (COURTESY PHOTO)

By GONG Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Large Unmanned Cargo Aircraft Makes its Debut

China's domestically developed tonne-class large unmanned transport aircraft recently completed its maiden flight in Shandong province, marking a significant advancement in the field of high-end unmanned aviation equipment.

Open Scientific Infrastructure: Catalyst for Intl. Sci-tech Cooperation

It is necessary to promote the opening up and sharing of scientific research infrastructure, make good use of multilateral mechanisms, and establish and improve international open sharing platforms, Chen Jiachang, China’s vice minister of science and technology, said at the Open Science International Forum, part of the 2025 Zhongguancun Forum Annual Conference, on March 28.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
主站蜘蛛池模板: 麻豆专媒体一区二区 | 中文成人无码精品久久久 | 久久精品一区二区免费播放 | 品色堂永远免费论坛 | 最新中文字幕av无码专区不 | 日日躁你夜夜躁你av蜜 | 免费男人和女人牲交视频全黄 | 国产a∨国片精品青草视频 精品人妻无码一区二区三区毛片 | 久久精品人人槡人妻人人玩AV | 久久99精品久久久久久久清纯 | 日韩人妻无码一区二区三区久久 | 亚洲欧美日韩Aⅴ在线观看 亚洲AV成人无码久久精品老人 | h文纯肉教室啪啪 | 日本x视频| 国产精品久久自在自线青柠 | 国产成人无码A区视频在线观看 | 精品国精品国产自在久国产不卡 | 在线观看免费91 | 黄色影院国产 | 牛和人交xxxx欧美 | 日韩小片 | 四虎www4hv | 亚洲精品精华液一区二区 | 国产男女性潮高清免费网站 | 国产免费看又黄又大又污的胸 | 精品日韩欧美一区二区 | 亚洲熟妇久久国产精品 | 无码国产精品人妻一区二区 | av无码天堂一区二区三区, | 亚洲另类无码专区丝袜 | 无码少妇一区二区三区免费 | 欧美午夜一区二区福利视频 | 国产精品99久久99久久久动漫 | 久久久国产精品午夜一区ai换脸 | 377人体裸体露私图片 | 无码国产69精品久久久久孕妇 | 久久一及片 | 免费无码又爽又刺激a片涩涩软件 | 爆乳啪啪无码成人二区亚洲欧美 | 国产成年无码久久久免费 | 日韩av无码一区二区三区不卡毛片 |