OpenAI releases 1 million token coding model GPT 4.1, available immediately via API

2 months ago

OpenAI has released GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano to its API suite, phasing retired GPT-4.5 Preview portion advancing codification generation, acquisition following, and long-context processing capabilities.

Essentially signaling the nonaccomplishment of GPT-4.5, the new  4.1 models present discourse windows of up to 1 cardinal tokens, enabling autochthonal handling of afloat repositories, extended documents, and analyzable multi-turn cause workflows wrong a azygous call.

While researching this article, I was capable to usage GPT-4.1 ‘vibe code,’ a elemental Python-based dungeon crawler, successful 5 minutes and 5 prompts. The exemplary made nary errors successful its code, with the lone issues related to identifying applicable sprites successful the plus atlas I imported.

Dungeon crawler demo built with GPT-4.1Dungeon crawler demo built with GPT-4.1

Due to its ample discourse window, it was besides capable to successfully place the functionality of a ample codification repo wrong a fewer prompts.

Model Capabilities and Transition Path

Per OpenAI, GPT-4.1 achieves a 54.6% people connected SWE-bench Verified, reflecting the improved quality to nutrient runnable codification patches that resoluteness real-world repository issues. This outpaces GPT-4o’s 33.2% and GPT-4.5’s 38% nether the aforesaid benchmark. The exemplary besides executes codification diffs much precisely, with 53% accuracy connected Aider’s polyglot benchmark successful diff format, much than doubling GPT-4o’s 18%.

Instruction-following fidelity is besides refined. On Scale’s MultiChallenge, GPT-4.1 reaches 38.3% accuracy, compared to 27.8% for GPT-4o. These improvements see adhering to strict output formats, complying with constraints, and pursuing nested oregon contradictory instructions.

According to the AI coding level Windsurf, interior evaluations amusement that GPT-4.1 produces cleaner diffs and is much aligned with structured developer workflows.

The models’ quality to process agelong contexts includes 1 cardinal token support, surpassing the erstwhile 128K token window.

To validate this, OpenAI released MRCR, an open-source valuation that tests a model’s quality to retrieve circumstantial details from wrong dense, distractor-heavy discourse blocks. GPT-4.1 scored 72% connected the long-video, no-subtitles class of the Video-MME benchmark, mounting a caller high.

Efficiency gains crossed the bid and cause use

The GPT-4.1 mini exemplary provides latency and outgo reductions portion maintaining comparable performance. OpenAI stated that GPT-4.1 mini reduces inference latency by astir 50% and outgo by 83% comparative to GPT-4o, with adjacent oregon superior scores connected aggregate quality evaluations.

Meanwhile, GPT-4.1 nano, optimized for low-latency tasks, achieves 80.1% connected MMLU, 50.3% connected GPQA, and 9.8% connected Aider’s polyglot coding. These gains transcend GPT-4o mini successful captious areas and presumption nano for usage successful classification, autocomplete, and reactive agentic systems.

There is nary added outgo for long-context usage crossed the GPT-4.1 series. Token usage follows modular API pricing, allowing developers to standard applications involving ample papers retrieval, repository comprehension, oregon implicit task editing without premium pricing tiers.

Improvements successful acquisition fidelity and discourse retention bolster the exemplary family’s viability for agentic applications. With OpenAI’s Responses API, developers tin deploy GPT-4.1-based systems to autonomously execute chained operations specified arsenic resolving lawsuit tickets, mining documents for insights, oregon operating crossed multi-step task environments.

GPT-4.5 Preview, launched arsenic a research-focused offering, volition beryllium sunset connected July 14. According to OpenAI, feedback from 4.5’s investigating signifier informed fine-tuning and deployment configurations, which are present embodied successful GPT-4.1. As such, GPT-4.1 is positioned arsenic a replacement way for developers utilizing 4.5 successful the API.

ChatGPT users volition proceed interacting with GPT-4o, wherever OpenAI is incrementally integrating instruction-following improvements. GPT-4.1 models, however, are API-exclusive.

Technical implications for code-first developers

The determination to standard the token discourse to 1 cardinal is apt a effect to Google’s Gemini 2.1 Pro model. It impacts developers managing ample monorepos, documentation-heavy domains, oregon multi-file dependency chains.

In summation to accepted inference, GPT-4.1’s upgraded token output limit, present up to 32,768 tokens, enables single-call afloat record rewrites, removing the request for post-processing oregon fragment merging.

Adherence to structured formats allows developers to optimize workflows astir minimal output procreation for codification diffs, cutting token costs and expanding strategy responsiveness.

According to OpenAI’s interior tests, GPT-4.1 has already demonstrated improved accumulation results crossed frontend development, ineligible parsing, and backend automation.

In comparative evaluations, paid graders preferred GPT-4.1-generated websites implicit GPT-4o results successful 80% of trial cases, citing superior functionality and clarity successful HTML, CSS, and JavaScript output.

GPT-4.1 mini and nano models widen these benefits to low-resource environments and latency-critical settings. The instauration of nano provides a fast-reacting, low-cost LLM susceptible of replacing larger models successful accelerated iteration pipelines, chat interfaces, oregon embedded dev tools.

Developers utilizing GPT-4.5 oregon GPT-4o mini are advised to measure migration paths now, arsenic GPT-4.1’s show and token economics favour its adoption successful astir deployment configurations. Model access, prompting guides, and updated benchmarks are disposable done the OpenAI developer platform.

Per OpenAI, GPT-4o and GPT-4o mini volition proceed to beryllium supported successful the API for the foreseeable future, but accent is being placed connected the GPT-4.1 enactment arsenic the preferred upgrade path.

The station OpenAI releases 1 cardinal token coding exemplary GPT 4.1, disposable instantly via API appeared archetypal connected CryptoSlate.

View source