The smartest artificial intelligence on Earth?
On the local date of [specific date], the model hailed by Musk as "the most intelligent artificial intelligence on Earth" was officially released. This model, developed by Musk's artificial intelligence company, represents a new generation of large language models. Its introduction has once again sent shockwaves through the global artificial intelligence community.
Musk's company has launched the latest large model and conducted a live demonstration during the broadcast. At the launch event, Musk introduced it as the first model to surpass a certain score in the (.) arena evaluation, ranking first in all categories and "superior to any known released product." It has become the first model to break the scoring record, ranking first in all classification tests.
Due to Musk's immense influence in public opinion and the sustained pre-launch hype, global anticipation for the event has reached an unprecedented level. This launch not only attracted the attention of millions but is also regarded as a profound transformation in the field of artificial intelligence.
Dubbed as "the smartest artificial intelligence on Earth," it is essentially an advanced conversational chatbot that primarily operates on social media platforms, offering users deep interactions and real-time information access. Musk mentioned that the name is derived from Heinlein's novel "Stranger in a Strange Land," where the protagonist is a human raised on Mars, and the term signifies a comprehensive and profound understanding of things.
The world's richest person confidently stated that it is "the most intelligent artificial intelligence on Earth," achieving significant breakthroughs in reasoning, programming, multimodal analysis, and other areas, surpassing mainstream models including - and -.
To demonstrate the superior capabilities of the new model, Musk's team conducted several live tests at the launch event. First, they programmed a game that combines Tetris and Bejeweled using " ". During the demonstration, the game code generated using the tool successfully integrated Tetris's elimination mechanism with Bejeweled's matching elements.
Subsequently, the model also demonstrated its exceptional ability in writing complex physical simulation code. By simulating the process of a ball bouncing inside a rotating hypercube (a four-dimensional cube), it successfully showcased its capability to swiftly handle complex programming tasks and conduct physical simulations.
The most impressive test was the simulation of a spacecraft mission, where the model generated an animated code depicting the Earth launch, Mars landing, and return to Earth using a Hohmann transfer orbit. A fully operational animation was quickly produced, vividly illustrating the positional relationships between the Sun, Earth, Mars, and the spacecraft during the mission. The challenge of this test lay in the extensive calculations involving numerous mathematical and physical models. Prior to this, no large model had been capable of calculating the launch windows for space missions.
In addition, one of the standout features is the integration of the world's first "Dynamic Thought Chain." This technology mimics human cognitive and reasoning processes, enabling multi-step reasoning that follows a "hypothesis-reflection-correction" sequence. It also incorporates a dynamic reflection mechanism to correct logical errors, thereby effectively handling complex problems.
Additionally, it is equipped with a self-correcting mechanism that reflects on mistakes made through multiple verification steps to achieve logical consistency. This self-correcting mechanism aims to minimize the probability of generating false or meaningless information.
Of course, this "most intelligent AI" model is inseparable from a robust computational foundation. Its development benefited from a supercomputer built in just a few months, powered by tens of thousands of NVIDIA units, which has already accumulated over a hundred million hours of training time—ten times that of its predecessor. This leap in computational power enables the model to achieve a significant efficiency boost in processing large datasets, reducing training cycles while enhancing model accuracy. Trained on tens of thousands of NVIDIA units, the total computational effort amounts to hundreds of millions of hours, equivalent to the annual electricity consumption of a medium-sized city.
Why was the release timed for now? The original plan was to officially release it by the end of the year, but it was postponed several times due to various issues. On a certain month and day, Musk stated at the World Government Summit that the release was "one to two weeks" away, aiming to provide the best user experience. However, plans often change, and the launch event was held earlier than scheduled.
Competition is intensifying. The fundamental and core reason for the early release is the increasing pressure on the company to enter the fiercely competitive global artificial intelligence environment. Before the release, the United States was still in a "far ahead" position in the development of large models. However, the emergence of this model has proven that it is possible to develop a large model with globally leading functions at low cost and low computing power investment, which has made Silicon Valley's sense of technological superiority feel like a fishbone stuck in the throat.
Under the current agitation, the flagship models are accelerating their iterations, announcing that a new generation of artificial intelligence models - and - will be launched in the coming months. Google has also released the . series of models, which enhance coding and reasoning capabilities, are fully open for use, and reduce costs. The previously launched versions such as and , after rapid iterations of large models, have fallen far behind. In the fiercely competitive market environment, if Musk does not accelerate his entry into the market, it will be very unfavorable for the market share and the next step of development.
The pressure of financing is immense. As is widely known, the development of large AI models is a money-burning black hole. According to the training process, the R&D expenses are likely to be astronomical. Earlier this month, informed sources revealed that Musk is in talks with potential investors, planning to raise hundreds of millions of dollars in funding. If successful, the valuation could reach billions of dollars. This would provide substantial financial support for subsequent development. To this end, Musk's early press conference might also be part of this strategy. If it can shock the world like the large models, it will surely spark a new round of frenzy in the capital markets and become a hot commodity for major capital groups.
The intensifying competition between Musk and OpenAI is also a key factor in the early release. On a certain date, an investor consortium led by Musk proposed a $1 billion acquisition and has already submitted a takeover bid to the institution. This has brought the conflict between Musk and OpenAI's CEO, Altman, to a climax. However, Altman's response to the acquisition proposal was quite straightforward, rejecting it with a hint of sarcasm.
Currently, the valuation is approximately in the billions of dollars, and it is soon to receive an investment of billions from Japan's SoftBank Group, which will push the valuation to billions of dollars. The lawyer stated that after the board's review, it was concluded that Musk is not sincerely interested in acquiring but has ulterior motives, because whenever Musk makes an offer, the board, as trustees, must review and consider the offer. Therefore, Musk's offer seems more like a strategy of harassment, and the early release is likely intended to compete and settle old scores, proving "I am right, he is wrong."
Can it truly become the best? According to the data presented on-site, it has surpassed all current mainstream models in benchmarks for mathematics, science, and programming. Musk even claimed that it will be used for Mars mission calculations in the future and predicted "Nobel Prize-level breakthroughs within three years." However, these are currently just Musk's own statements.
After the product launch, some netizens tested the latest version and posed the classic challenging question to the large model: "Which is larger, . or .?" Unfortunately, without any modifiers or annotations, the model, touted as the smartest currently available, still failed to answer this question correctly. Similarly, there have been many tests abroad with analogous questions, such as "Which ball falls first from the Leaning Tower of Pisa?" These basic physics and math problems have also been found to be beyond the model's capabilities, leading to it being jokingly referred to as "a genius unwilling to answer simple questions."
In many common-sense questions during actual testing, there have been "flips." For such a "genius," regardless of actual capabilities, its reliability for extremely complex application scenarios like future Mars exploration missions is highly questionable. Currently, many model capability testers who obtained testing qualifications a few weeks ago have pointed to the same conclusion regarding its performance: it is good, but it is not better than - or --. From this perspective, although the release of was high-profile, claiming it to be the "strongest on the surface" is not that simple.