https://www.techtarget.com/whatis/definition/GPT-4
GPT-4 is OpenAI's large multimodal language model that generates text from textual and visual input. Open AI is the American AI research company behind Dall-E, ChatGPT and GPT-4's predecessor GPT-3.
GPT-4 can handle more complex tasks than previous GPT models. The model exhibits human-level performance on many professional and academic benchmarks, including the Uniform Bar Exam. It was developed to improve alignment and scalability for large models of its kind.
GPT-4 stands for Generative Pre-Trained Transformer 4.
GPTs are machine learning algorithms that respond to input with human-like text. They have the following characteristics:
GPTs were introduced by OpenAI in a 2018 paper titled "Improving Language Understanding by Generative Pre-Training." This paper described GPT's semi-supervised learning model, which contrasted against other natural language processing models that used supervised learning and labeled data.
GPT processing power scales with the number of parameters the model has. Each new GPT model has more parameters than the previous one. GPT-1 has 0.12 billion parameters and GPT-2 has 1.5 billion parameters, whereas GPT-3 has more than 175 billion parameters. The exact number of parameters in GPT-4 is unknown but is rumored to be more than 1 trillion parameters.
GPT is the first large multimodal model of its kind. It is sometimes referred to as a next-gen model. GPT-4 Vision can turn image inputs into text.
In fall 2023, OpenAI rolled out GPT-4 Turbo, which provides answers with context up to April 2023. The previous knowledge cutoff for GPT-4 was January 2022. The release also increased the model’s context window and decreased pricing for developers. Developers with an OpenAI API account can access GPT-4 Turbo.
In May 2024, OpenAI introduced GPT-4 Omni (GPT-4o) with improvements including faster response times and advanced multimodal capabilities to recognize audio, image and text. Users can engage in real-time conversations with ChatGPT, and the GPT-4o can recognize screens and photos and ask questions about them while conversing with the user. The GPT-4o model will be available on consumer and developer products and will be free to all users.
Open AI has released relatively little information about the technical specifications of GPT-4. There is little information about the data used to train the system, the model size, the energy costs of the system, the hardware it runs on or the methods used to create it. OpenAI acknowledged this in the GPT-4 technical paper, which said they wouldn't release this information because of safety reasons and the highly competitive market. OpenAI did acknowledge that GPT-4 was trained on both publicly available data and data licensed from third parties.
GPT-4 -- and other GPTs -- are trained using reinforcement learning from human feedback. Models are rewarded for desired behavior or when they follow a set of rules. GPT-4 gets an extra safety reward during training to reduce harmful outputs. OpenAI tested GPT-4's accuracy on adversarial questions with the help of constitutional AI company Anthropic. A few example rules from Anthropic's constitution include the following:
As powerful generative AI models like OpenAI's are released to the public and companies restructure around them, conversations have emerged about AI alignment, ethics and regulation. Here are some other stories to read:
Reasons for and effects of Microsoft cutting AI ethics unit
Federal report focuses on AI diversity and ethics
Implications of AI art lawsuits for copyright laws
The accelerating use of generative AI may prompt U.S. action
OpenAI has released several demos that show GPT-4's capabilities. Some specific notable capabilities include the following:
Like any language model, GPT-4 still hallucinates information, gives wrong answers and produces buggy code in some instances. It may also still be susceptible to racial and gender bias.
GPT-3 is large language model, which means it performs language processing exclusively. GPT-4 is a large multimodal model that can process image and text inputs. OpenAI emphasizes the goal of GPT-4 was to scale up deep learning.
Some other ways the two models differ include the following:
GPT-4 was released March 14, 2023. In an ABC news interview days after its release, OpenAI CEO Sam Altman said, "We've got to be cautious here, and also, it doesn't work to do all of this in a lab. You've got to get all of these products out into the world and make contact with reality, make our mistakes while the stakes are low. All of that said, I think people should be happy that we're a little bit scared of this."
The newest version of GPT-4 -- GPT-4o -- was announced in May 2024.
There are two main ways of accessing GPT-4 as of this writing:
Developers can also use the API on a pay-per-use basis.
Users can also evaluate the model. Open AI CEO Sam Altman tweeted on March 14, 2023 that the company is open sourcing an evaluation framework.
GPT-4 was not free. However, with the new GPT-4o model, OpenAI announced it will be free to ChatGPT users, so no subscription is required for ChatGPT Plus. Other features included in the original subscription to GPT-4 -- such as memory and web browsing -- are also free to consumers. There is a fee for developers to use the API of $5 per 1 million tokens for input and $15 per 1 million tokens for output.
GPT-4 was a milestone AI release that came at the beginning of 2023. Check out these 10 AI trends to prepare for what the rest of the year might bring.
14 May 2024