What happened
OpenAI rolled out new embedding models (text-embedding-3-small and text-embedding-3-large), preview updates to GPT-4 Turbo (gpt-4-0125-preview), a refreshed GPT-3.5 Turbo variant (gpt-3.5-turbo-0125), and a moderation model (text-moderation-007). The embeddings show big quality gains on MIRACL (small: 31.4% → 44.0%; large: 54.9% vs the Dec‐2022 text-embedding-ada-002 baseline) and the small model is about 5× cheaper; GPT-3.5 input price fell 50% and output price 25%. The GPT-4 Turbo preview fixes UTF‐8/non‐English encoding bugs and improves code completion and format compliance.
Why this matters
Market impact — cost and performance shift. Lower-cost, higher-quality embeddings and cheaper GPT-3.5 Turbo change the economics of retrieval‐heavy and high‐throughput applications (RAG systems, vector databases, chat copilots). Improved multilingual handling and UTF‐8 fixes reduce friction for global developers. Better format adherence (JSON, function calls) and larger context handling in GPT‐4 Turbo previews support more complex, production-oriented workflows. At the same time, fragmentation across model variants, potential brittleness at higher embedding dimensionality, and moderation trade‐offs (false positives/negatives across languages) are practical risks for teams migrating pipelines. OpenAI also reports that 70% of GPT‐4 API requests have moved to GPT‐4 Turbo since its release, indicating rapid adoption of the Turbo family.
Practical next steps for engineering and product teams: benchmark new models against existing workloads (latency, cost, multilingual accuracy), plan vector store compatibility for varied embedding dimensions, and test moderation behavior in target languages and domains.
Sources
- OpenAI — announcement: New embedding models and API updates (as cited in the article) — https://openai.com/index/new-embedding-models-and-api-updates/
- Infoworld — coverage of GPT Turbo updates and fixes — https://www.infoworld.com/article/2335949/openai-unveils-new-embedding-models-gpt-turbo-updates.html
- GizChina — coverage of new embedding models and pricing — https://www.gizchina.com/tech/openai-models-api-upgrade
(Article text was provided to the assistant for this brief.)