Gartner predicts 40% of GenAI solutions will be ‘multimodal’ by 2027

Artificial Intelligence Published 8th October 2024

As better ways to improve human interaction with the technology, 40% of generative AI (GenAI) solutions will become ‘multimodal’ i.e. produce a mix of text, image, audio and video, by 2027, according to industry analyst firm, Gartner.


► GenAI solutions will generate mix of text, images, audio and video to enhance human-AI interaction

► Opportunity for GenAI-enabled offerings to be differentiated


According to the firm, by generating these multimodal responses, GenAI solutions will be better at supporting people on tasks in more environments. Early adoption has the potential to lead to notable competitive advantage and time-to-market benefits, said the firm. In 2023, it noted, just 1% of Gen AI solutions were multimodal. The technique will be used to create new features and functionality in apps.

Gartner predicts 40% of GenAI solutions will be ‘multimodal’ by 2027

The other key technology likely to have the highest impact potential within the next five years are open-source large language models (LLMs), said the firm. These will enable easier access for developers, reduce costs and the potential for vendor lock-in.

It also predicted that domain-specific GenAI models, designed for specific industries, business functions or tasks, will become more prominent, as will autonomous agents that are built to achieve defined goals without human intervention.