OpenAI has released GPT-4, an advanced multimodal model with improved performance on human-like tasks and a significantly expanded token processing capacity, now immediately available to ChatGPT Plus users and soon to be accessible via API.
Abstract
The latest iteration from OpenAI, GPT-4, has been launched, marking a significant update to the GPT series. This model introduces multimodal capabilities, allowing it to handle both text and image inputs and outputs. It is a substantial leap forward from its predecessors, with a default token processing limit of 8,000 tokens, which can be extended to 32,000 tokens under certain conditions. This enables GPT-4 to tackle more extensive and complex tasks, such as writing lengthy documents or engaging with large datasets. The model has demonstrated superior performance on standardized tests, scoring in the top 10% across various subjects, including the GRE and LSAT. GPT-4's enhanced abilities have been made available first to ChatGPT Plus subscribers, with a broader rollout expected following the launch of its API, which will facilitate integration into various products and services.
Opinions
Early adopters of ChatGPT Plus are seen as receiving significant value for their subscription fee due to immediate access to GPT-4.
The multimodal nature of GPT-4 is considered a major advancement, with its ability to process images suggesting a broader range of applications beyond text-based tasks.
OpenAI's decision to provide API access to developers is anticipated to lead to a rapid adoption of GPT-4 in a variety of products and services.
The substantial increase in token processing capability is viewed as a breakthrough that will enable more sophisticated uses of the AI model, such as processing large documents or datasets.
GPT-4's improved performance on human-like tasks, including standardized tests, is seen as a testament to its advanced understanding and problem-solving capabilities.
5 Crucial Things to Know About GPT-4
ChatGPT is about to get way more powerful
Screenshots via the author
After a long wait, OpenAI finally released the new GPT-4 model, a major update to the basic framework underlying their wildly popular ChatGPT system, as well as their GPT-3.5 API.
There’s a lot to unpack in the new GPT-4 release. Based on early examples, API documentation and code samples provided by OpenAI, here are five crucial things to know about the new model.
If you were one of the early users to fork over $20 to join OpenAI’s ChatGPT Plus service, you’re about to get a major reward — and some serious return on your investment.
ChatGPT Plus users are the first people with access to the new GPT-4 platform. If you subscribe to ChatGPT Plus, you can head over and begin using GPT-4 right away.
Lots of people asked why someone would pay money to subscribe to the Plus service when ChatGPT is free. Apparently, so that you can get fast access to new model updates!
It’s Multimodal
Before GPT-4 came out, there was wild speculation about whether it would remain a text-only model like ChatGPT, or would become a multimodal model.
Multimodal models are able to work with a wide range of media types as both their output and input — everything from text to images and ultimately video.
At the moment, GPT-4 appears to support images for both its inputs and outputs. Initially, this capability is available only to a single third-party company which is helping OpenAI test image processing.
Once the system is faster, images as input will become available to more users.
But OpenAI has some examples of how it could ultimately play out. One example includes a photo of eggs and flour, with a cooking-related query.
GPT-4 recommends recipes that could be made with the ingredients shown in the picture.
The model could also be used to caption images, or write amazing alt text for images on websites. Video isn’t available yet, but it’s likely on the way now that GPT-4 is multimodal.
There’s an API
OpenAI will provide API access to the new model almost immediately. The company released a waitlist upon launching GPT-4, and said that some developers would get access on the day of the launch.
Expect many companies to start integrating GPT-4 into their products almost immediately. Many are already integrated with existing APIs from OpenAI, so the switch to GPT-4 isn’t hard to implement.
It Can Process Way More Data
The original ChatGPT could process 4,096 tokens. That’s about 5,000 words worth of text.
That limitation included both the text in the prompt given to ChatGPT and the system’s output. The limits prevented the system from analyzing big documents, or writing long blog posts — or even books.
The limitation was likely a cost/computation based limitation. The more tokens you add to a large language model, the more computation the model needs to perform in order to process the text, and thus the more expensive it is to run.
GPT-4 expands these limitations dramatically. By default, GPT-4 can handle 8,000 tokens right out of the box. It can also support up to 32,000 tokens. That’s about 50 pages of text.
Processing more data will allow the system to process far more instructions, writer longer articles, and perhaps even write very long documents or full literary works.
It’s Better at Human-Like Tasks and Tests
ChatGPT running GPT3.5 could pass human-oriented exams like the bar exam — but only just.
The system tended to get a score on the lower end of passing — a C- or thereabouts. GPT-4 has been trained to perform these human-like tests and tasks much better.
The model now performs as well as a top student on many standard exams. AP Environmental Science exams, GRE exams, and even the LSAT — GPT-4 can easily score in the top 10% of them all.
It’s still terrible at English lit. But tasks like math exams — where GPT-3.5 stuggled — are much improved with GPT-4.
Next Steps
If you want to use GPT-4, there are a few ways you can do so:
Sign up for ChatGPT Plus. GPT-4 should be available through the paid ChatGPT already.
Sign up for the waitlist for access to the GPT-4 API. Developers should get access over the next few days/weeks.
Want more coverage of cutting-edge generative AI? Subscribe to me on Medium and follow The Generator.