OpenAI’s first-ever developer conference, DevDay, unveiled some of the most powerful advancements in the field of Artificial Intelligence.
At Haptik, we’re excited about the features announced at OpenAI’s inaugural developer conference and here’s what the new features will mean for enterprises.
GPT-4 as we knew it was one of the most powerful models so far. However, it was very expensive and many enterprises chose to limit their usage to GPT-3.5 Turbo to ensure cost-effectiveness and scalability of their applications. This is poised to change now.
GPT-4 Turbo, a new iteration of the GPT-4 model, was preview-launched as a more capable and knowledgeable version updated to April 2023. GPT-4 Turbo is priced 3x cheaper than GPT-4. This allows all businesses to be able to really improve their application by using a much better performing model without putting a dent on the budgets.
In conjunction with GPT-4 Turbo, OpenAI has reduced the costs of its entire API range, further incentivizing businesses to integrate these powerful tools into their systems.
GPT-4 Turbo got a few technical upgrades especially beneficial for enterprise grade assistants which run at scale:
- The new model introduces "seed" for reproducible outcomes. This allows the enterprise assistants to be more consistent and repeatable which is important to deliver a good experience at scale.
- The APIs also now give probabilities of outcomes, giving the developer insights into accuracies of the replies. This makes the bots more transparent and allows for better analytics and reporting.
- The function-calling feature now boasts of more powerful API capabilities which was a hindrance before to be able to call multiple APIs, a common occurrence for enterprises.
Assistants API and Retrieval
Perhaps the most exciting of the new features is the Assistants API. It empowers businesses to create bespoke Assistants with minimal configuration and coding effort. These assistants are not only capable of digesting documents and executing API calls but can also write and execute code. This innovation is poised to democratize bot creation, enabling the rapid development of simple yet potent bots.
For enterprises, Assistants API could be a boon for automating mundane internal workflows, leading to unprecedented productivity gains. However, it's important to note that while the Assistants API excels in crafting straightforward bots, given the complex nature of customer experience workflows that require calling of multiple prompts in an order, the solution is not effective enough for large-scale, customer experience automation workflows.
Dawn of Multi-Modality - Images, Text-to-Speech & More
Text-only Assistants are a thing of the past, thanks to GPT-4 Vision. GPT-4 Vision has ushered in a new paradigm where bots can not only interpret but also generate images. This multi-modal capability significantly improves the user experience, allowing enterprise bots to not just view images, but also perform more sophisticated tasks involving image analysis and creation.
Complementing this visual prowess, OpenAI has also advanced text-to-speech and speech-to-text technologies, which are set to empower the next wave of voice applications for enterprises. These advancements are not just incremental improvements; they represent the maturation of technology necessary to create truly effective voice applications.
Looking Ahead with OptimismThe announcements offer more than a glimpse into the vision of OpenAI in making AI a versatile enabler while keeping costs under control. In his keynote at DevDay 2023, Sam Altman, OpenAI’s CEO said: “We believe that AI is going to be a technological and societal revolution. It will change the world in many ways, and we’re happy to get to work on something that will empower you to build so much for all of us.”
Overall, we at Haptik firmly believe that these enhancements will enable us to create even more potent enterprise Assistants and deliver markedly improved experiences to end users.