OpenAI Introduces GPT-5.4: A Model That Can Control Your Computer

OpenAI has announced the launch of GPT-5.4, the latest version of its flagship artificial intelligence model. The main feature of the release, presented on March 5, 2026, is built-in (native) computer control capabilities. This means the model can independently perform tasks on a user's device: work with applications, manage a browser, execute actions using keyboard and mouse commands, all while analyzing screen screenshots. The model is already available in the OpenAI API, in the Codex AI programming tool, and its special reasoning version, GPT-5.4 Thinking, is being integrated into ChatGPT.

This step marks a transition from chatbots to autonomous agents—a goal pursued by all leading AI companies. Last year, the market already saw a surge in similar tools, including OpenAI's ChatGPT Agent, capable of, for example, searching for and purchasing goods online. GPT-5.4 lays the foundation for a future where a network of AI agents will operate in the background, performing complex multi-step operations across various programs and online services without constant human supervision.

From a technical standpoint, GPT-5.4 combines improvements in several key areas. The model handles professional tasks related to documents, spreadsheets, and presentations better. It uses tools and external APIs more accurately and efficiently. Particular attention is paid to web search: the model can conduct 'persistent' searches across multiple sources to answer complex 'needle-in-a-haystack' questions and synthesize information into a structured response. OpenAI claims that GPT-5.4 is its 'most factually accurate model to date': the likelihood of false statements in its responses is 33% lower than in GPT-5.2.

In the ChatGPT interface for Plus, Team, and Pro subscribers, the GPT-5.4 Thinking model is being integrated. Its distinctive feature is the demonstration of 'thought processes' when solving complex queries: the model shows a work plan, and the user can make adjustments directly during the response generation without restarting the conversation. This feature is already working in the web application and on Android, with an iOS release expected soon. For corporate and educational users (ChatGPT Enterprise, Edu) and via the API, the GPT-5.4 Pro version is offered, optimized for maximum performance on complex tasks.

For the industry and end users, the release of GPT-5.4 means an acceleration in the automation of routine digital tasks. In the future, this could lead to the emergence of personal AI assistants that will independently handle accounting, prepare reports, manage orders, or plan trips, directly interacting with software. For developers, new opportunities are opening up to create complex agent applications via the API. However, this also raises acute questions about cybersecurity, privacy, and the level of trust in systems that gain direct access to a user's computer.

The immediate prospects will be related to debugging and scaling this technology, as well as forming ethical and technical standards for AI agents. The question of how widely and quickly such agents will enter everyday life remains open. The success of GPT-5.4 as an agent platform will depend on its reliability, cost of use, and the ability of partner companies to build an ecosystem of useful and safe applications around it. This release brings us closer to a future where AI becomes not just a conversational partner, but an active digital executor.

OpenAI Introduces GPT-5.4: A Model That Can Control Your Computer

Discussion 0

Related Articles