OpenAI offers GPT-4o mini to slash the cost of applications

4 months ago 25
BOOK THIS SPACE FOR AD
ARTICLE AD
OpenAI on phone
Jaque Silva/SOPA Images/LightRocket via Getty Images

OpenAI on Thursday unveiled a stripped-down version of its GPT-4o large language model, GPT-4o mini, which the startup said has better accuracy than GPT-4, and costs dramatically less than GPT-3.5 Turbo when used by developers, for whom it can boost the development of applications that use the AI model extensively.

Also: Two ways you can build custom AI assistants with GPT-4o - and one is free!

The startup touts the new AI model as "the most cost-efficient small model in the market," although, as with most OpenAI releases, no technical details are available about GPT-4o mini (such as the number of parameters), so it's unclear what "small" means in this case.

(An "AI model" is the part of an AI program that contains numerous neural net parameters and activation functions that are the key elements for how an AI program functions.)

GPT-4o mini "is priced at 15 cents per million input tokens and 60 cents per million output tokens, an order of magnitude more affordable than previous frontier models and more than 60% cheaper than GPT-3.5 Turbo," OpenAI wrote in a blog post emailed to ZDNET. 

Also: I put GPT-4o through my coding tests and it aced them - except for one weird result

That reduction in cost, the startup said, will aid the development of applications that are affected by volume of activity. 

For example, applications that must make multiple application programming interface (API) calls, or that use larger "context windows" to retrieve materials (say, to retrieve an entire code-base when developing an app), or that have to interact frequently with the user, such as a help desk support bot, will benefit from the reduction in per-transaction cost, OpenAI said. 

The model outperforms the standard GPT-4 model when used as a chatbot, based on crowdsourced tests by the Lmsys leaderboard. It also "surpasses GPT-3.5 Turbo and other small models on academic benchmarks across both textual intelligence and multimodal reasoning," and supports as many languages as the standard GPT-4o model.

Also: How to use ChatGPT to digitize your handwritten notes for free

The new model is available immediately to developers via the Assistants API, Chat Completions API, and Batch API, and can be used instead of GPT-3.5 Turbo in ChatGPT's free, Plus, and Team accounts. 

GPT-4o mini offers only text and image support at the moment, with audio and video to be added at an unspecified date. The GPT-4o mini context window is 128,000 tokens, and its training data is current through October 2023. 

Editorial standards
Read Entire Article