Market reaction
Meta Platforms shares climbed 7% on Wednesday following the public introduction of Muse Spark, the company’s newest artificial intelligence model. The stock move occurred alongside broad strength across technology equities and coincided with the debut of the model, which Meta identifies as the initial offering in its Muse family created by Meta Superintelligence Labs.
Model capabilities and availability
Muse Spark is designed with native multimodal reasoning capabilities. That includes support for tool use, a visual chain of thought, and multi-agent orchestration. The company has made the model accessible via meta.ai and through the Meta AI app, and it has opened a private API preview to a select group of users.
Contemplating mode and benchmark results
Alongside Muse Spark, Meta introduced a feature called Contemplating mode, which runs multiple agents that reason at the same time. In benchmark testing cited by the company, the mode produced a 58% score in Humanity’s Last Exam and a 38% score in FrontierScience Research.
Engineering and efficiency
Meta said it rebuilt its pretraining stack over a nine-month period, addressing model architecture, optimization routines, and data curation. The company reports it reached the same capabilities as its previous model, Llama 4 Maverick, while using over an order of magnitude less compute.
Applications and health work
The company highlighted multimodal visual integration across domains and health reasoning among Muse Spark’s applications. For health-related outputs, Meta said it collaborated with more than 1,000 physicians to help develop training data. The model is capable of producing interactive displays that explain nutritional content and illustrate which muscles are engaged during exercise.
Safety testing and third-party evaluation
Under Meta’s Advanced AI Scaling Framework, Muse Spark was evaluated for safety in high-risk areas and demonstrated strong refusal behavior for content related to biological and chemical weapons. Third-party testing by Apollo Research reportedly found Muse Spark exhibited the highest rate of evaluation awareness among models they have examined.
Note: The information above reflects details provided by the company regarding the model, availability, engineering work, benchmarks, health collaborations, and safety assessments.