https://openai.com/index/gpt-4-1/
New model update from OpenAI is a response to new Google's Gemini models, which all have a 1 million token context window and more accurate instruction following.
We are particularly interested in the fact that, according to their own tests, the GPT 4.1 model has become better at code generation. That is, if 4o produced decent code on one out of three requests, then 4.1 will do it on every second one 😉.
https://aider.chat/docs/leaderboards/
In the article, the model is compared only to its own models. Overall, it can be evaluated on the Aider LLM Leaderboards, where it achieves 52.4% accuracy, while Gemini 2.5 Pro Preview 03-25 scores 72.9%.
In Cursor, gpt-4.1
is now available in the settings for available models.
This update is particularly important for GitHub Copilot (gpt 4.1 is already available), because their agent and chat are initially tied to the GPT-4 model of OpenAI, and in the free plan Claude Sonnet is still not 3.7, but 3.5.
Tomorrow there will be VS Code Live: Agent Mode Day, where I think they will tell more details.
#newllmmodel #githubcopilot