Microsoft’s new AI agent can control software and robots

February 20, 2025

On Wednesday, Microsoft Research introduced Magma, an integrated AI foundation model that combines visual and language processing to control software interfaces and robotic systems. If the results hold up outside of Microsoft’s internal testing, it could mark a meaningful step forward for an all-purpose multimodal AI that can operate interactively in both real and digital spaces.

Microsoft claims that Magma is the first AI model that not only processes multimodal data (like text, images, and video) but can also natively act upon it—whether that’s navigating a user interface or manipulating physical objects. The project is a collaboration between researchers at Microsoft, KAIST, the University of Maryland, the University of

→ Continue reading at Ars Technica

Comments

Get Ready, Foodies! Newport’s 2025 Seafood & Wine Festival Begins Today

Fast-growing Variational AI raises US$5.5M to bring technology to market

Microsoft’s new AI agent can control software and robots

Related articles

Comments

Share article

Latest articles

This Vegan Fast-Food Restaurant in a Former McDonald’s Is Out to Change the World

Statistics Canada reports $1.5B trade deficit for February as exports fell

Prime Minister Mark Carney says Canada will match U.S. auto tariffs

Where to Easter Brunch (and Tea, and Dinner) in Seattle

What Happened at Mount Baker’s Oculis Lodge?

Bodega cats make New Yorkers’ hearts purr, even if they violate state regulations