Date: July 19, 2024
OpenAI has launched a smaller and cheaper version of its flagship GPT-4o for web and mobile app consumers, including developers.
OpenAI has launched GPT-4o mini, a smaller and cheaper version of GPT-4o. The small AI model outperforms existing cutting-edge AI models. The mini version was released yesterday for developers, web, and mobile application users of ChatGPT. The AI model will be released for Enterprise users by next week.
For tasks that involve text and vision reasoning, GPT-4o mini outperforms Gemini 1.5 Flash, Llama 3 (70B), NeMo, GPT-3.5 Turbo, Reka Edge, and many more competitive smaller models. An independent Artificial Analysis shows a glaring difference in the performance results between GPT-4o and other available small AI models. GPT-40 mini scored 82% on MMLU, a benchmark to measure reasoning, compared to 79% for Gemini 1.5 Flash and 75% for Claude 3 Haiku.
“For every corner of the world to be empowered by AI, we need to make the models much more affordable. I think GPT-4o mini is a really big step forward in that direction”
- OpenAI’s head of Product API
GPT-4o mini will also replace GPT-3.5 Turbo as the smallest AI model offered by the company. It is still unclear if the existing users of Turbo will be shifted to GPT-4o mini or not. The company claims that its offering is much more affordable and consumes less power. Mini comes with text and vision capabilities, but the company says that video and audio capabilities will be added soon.
“Relative to comparable models, GPT-4o mini is very fast, with a median output speed of 202 tokens per second. This is more than 2X faster than GPT-4o and GPT-3.5 Turbo and represents a compelling offering for speed-dependent use-cases including many consumer applications and agentic approaches to using LLMs,” said George Cameron, Co-Founder at Artificial Analysis, in an email to a tech media house.
For developers, GPT-4o mini is priced at 15 cents per million input tokens and 60 cents per million output tokens. The model comes with a context window of 128,000 tokens that transforms roughly to the length of a book. OpenAI has not revealed how big the GPT-4o mini actually is but claims that it is similar to other small AI models.
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. Armed with a Bachelor's in Business Administration and a knack for crafting compelling narratives and a sharp specialization in everything from Predictive Analytics to FinTech—and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
Reddit Unveils AI-Powered Search Tool for Smarter Results
Reddit launched Reddit Answers, an AI-powered search tool that curates and summarizes discussions to enhance user experience and reduce reliance on Google.
OpenAI Scraps o3 Model, Pushes for Unified GPT-5 in a Major AI Overhaul
OpenAI is canceling its o3 AI model and merging it into GPT-5 for a simpler, more powerful system. A big move to stay ahead in the AI race.
Virtual Reality in Healthcare: Revolutionizing Patient Care
Experience the power of virtual reality in healthcare as it transforms medical training, patient care, and treatment methods with immersive technology for better accuracy, efficiency, and improved outcomes.
Google I/O 2025: Dates Announced for the Tech Giant’s Biggest Event of the Year
Google I/O 2025 is set for May 20-21! Expect big AI reveals, Android 16 updates, and more. Registrations are open for keynotes, demos, and game-changing tech innovations!