Microsoft Launches First In-House AI Image Generator, MAI-Image-1
Microsoft has today launched MAI-Image-1, its first internally developed artificial intelligence model for generating images, integrating it into Bing Image Creator and Copilot Audio Expressions.
The company initially announced the model in October, and Microsoft AI chief Mustafa Suleyman reported on X that it will be “coming soon” to the EU. Suleyman noted the model “really excels at” creating images of food, nature, artistic lighting, and photorealistic detail. MAI-Image-1 is now listed alongside DALL-E 3 and GPT-4o, both from OpenAI, as available models on Bing’s image creator website and app.
According to Microsoft, MAI-Image-1 is particularly effective at generating photorealistic imagery, including complex lighting effects and landscapes, often surpassing larger, slower models in both speed and quality. This advancement allows users to rapidly visualize and refine their ideas, potentially streamlining creative workflows. The model will also power AI-generated artwork accompanying audio stories created within the “story mode” of Copilot’s text-to-speech platform, Copilot Audio Expressions.
The introduction of MAI-Image-1 signifies Microsoft’s growing investment in independent AI development, reducing reliance on external partners like OpenAI. Suleyman stated the company intends to continue expanding access to the model, and further refinements are expected based on user feedback.
Microsoft’s first in-house AI image generator, MAI-Image-1, is now available in two products, Bing Image Creator and Copilot Audio Expressions. The company announced the model in October. Microsoft AI chief Mustafa Suleyman wrote in a post on X that the text-to-image model will be “coming soon” to the EU.
Suleyman added that the model “really excels at” generating images of food and nature scenes, as well as artsy lighting and photorealistic detail.
Microsoft has previously posted more details on its blog: “MAI-Image-1 excels at generating photorealistic imagery, like lighting (e.g., bounce light, reflections), landscapes, and much more. This is particularly so when compared to many larger, slower models. Its combination of speed and quality means users can get their ideas on screen faster, iterate through them quickly, and then transfer their work to other tools to continue refining.”
Microsoft’s MAI-Image-1 will also create AI-generated art to accompany AI-generated audio stories in the “story mode” of Copilot’s text-to-speech platform, Copilot Audio Expressions.
MAI-Image-1 is listed as one of the three AI models available on Bing’s image creator website and app. The other two models, DALL-E 3 and GPT-4o, are from OpenAI.