Updated June 26, 2024 to reflect recent updates.
For most of the early 21st century, Google was just about unanimously considered the king of AI. However, with OpenAI’s release of ChatGPT to the public at the end of 2022 and the whirlwind of AI innovation that followed, Google has taken the backseat. Due to the impressive 1M context window and competitive pricing, Gemini could be their chance to take the lead.
Google Gemini
Google’s generative AI offerings have been hard to follow. In December 2023, not even one year after Bard (Google’s former and short-lived flagship LLM) was announced, Gemini was announced as its “largest and most capable model” and since then, there have been rapid and significant developments.
February 2024, in particular, was a busy month for updates and rebranding efforts. The biggest of which was the rebranding of Bard to Gemini. Gemini builds upon and surpasses Bard in terms of functionalities and scalability. February also saw the consolidation of Duet AI for Workspace, the AI tools used to integrate with Google productivity tools (i.e. Gmail, Docs, and Sheets), and Duet AI for Developers into the Gemini framework.
Gemini is accessible via Google AI Studio or Google Cloud Vertex AI.
Google Gemini Models
Model Family | Functionalities | Max Tokens |
---|---|---|
Gemini 1.0 Pro | Text generation, translation, Q&A, code completion. | 32k tokens |
Gemini 1.0 Ultra | Highly complex tasks, including reasoning, problem-solving, and multimodal interactions. | 8k tokens |
Gemini 1.0 Ultra Vision | Gemini 1.0 Ultra functionalities, processing and understanding visual data. | 8k tokens |
Gemini 1.5 Pro | Comprehensive language and code functionalities, with increased focus on reasoning and adaptation. | 1M tokens |
Gemini 1.5 Flash | Gemini 1.5 Pro functionalities, with quicker response times. | 1M tokens |
Azure OpenAI GPT
OpenAI almost needs no introduction. Since the release of ChatGPT to the public in November 2022, OpenAI GPT models have seen widespread adoption across various sectors. Its impact on the AI landscape has been substantial, catalyzing advances in research and development and fostering increased awareness, with many people referring to AI chatbots and ChatGPT synonymously.
Azure OpenAI is a partnership between Azure and OpenAI that enables Azure users to use OpenAI (including OpenAI GPT models) via an API, Python SDK, or their web-based interface, while authenticated with their Azure cloud credentials. Azure OpenAI distinguishes itself from OpenAI by offering co-developed APIs, enhanced security, and private networking. Throughout this article, “GPT models” refers exclusively to Azure OpenAI GPT models for the sake of brevity.
Azure OpenAI GPT Models
Model | Stated Use Cases | Max Tokens |
---|---|---|
GPT-3.5 Turbo | Advanced complex reasoning and chat, code understanding and generation, and traditional completions tasks. | 4k and 16k tokens |
GPT-4 | Advanced complex reasoning and chat, advanced problem solving, code understanding and generation, and traditional completions tasks. | 8k and 32k tokens |
GPT-4 Turbo | GPT-4 capabilities, instruction following, and parallel function calling. | 128k tokens |
GPT-4 Turbo with Vision | GPT-4 Turbo capabilities, image analysis, and Q&A. | 128k tokens |
GPT-4o | GPT-4 Turbo with Vision capabilities. Faster responses. | 128k tokens |
Google Gemini vs Azure OpenAI GPT Models Functionality
Gemini has been met with skepticism from users, due in part to Bard falling short of expectations. Gemini 1.0 has received mixed reviews from users, with some saying it falls short of GPT-4 and some saying they prefer it over GPT-4. However, the highly anticipated Gemini 1.5 and its massive context window are receiving glowing accolades from early adopters, with predictions that it is poised to surpass GPT-4. Aside from comparing benchmarks of the two, we can compare them at a service and model level.
Google AI vs Azure AI Service Comparison
As far as service offerings, Google and Azure provide varying levels of support.
- Documentation/Community: Based on anecdotal assessments, the documentation for both services is abysmal. Users of both are complaining about missing information and instructions that are hard to follow. Some users are even complaining that Google’s documentation is incorrect. This is likely because both services and the models within the services are so new and constantly changing.
- Accessibility: Both services can be accessed through APIs and cloud-based studios. Additionally, Azure OpenAI offers an SDK for developers. Both studios provide user-friendly interfaces, which can be particularly helpful for users with limited technical experience.
-
Fine-Tuning Models: Supervised fine-tuning for Gemini 1.0 Pro is in preview. With Azure, you can fine-tune GPT-3.5 Turbo.
-
Context Caching: Certain Gemini models support context caching for a separate charge, enabling you to cache repetitive input/output results easily and possibly save money.
-
Data Use: Neither Google nor Azure uses customer data to train their AI models.
Gemini vs GPT Model Comparison
As far as the actual models go, there are several quantitative factors to consider.
- Max Tokens: The max token amount varies per model, but both Gemini and GPT models offer similar options, ranging from 8k to 128k tokens. However, Gemini 1.5 takes the overwhelming lead with an unprecedented 1M token context window, providing users with an exceptional capacity for processing large amounts of data. To put it into perspective, 1M tokens correspond to an immense amount of data, equating to approximately “1 hour of video, 11 hours of audio, codebases with over 30,000 lines of code, or over 700,000 words.”
- Supported Regions: Availability may be model and feature-specific. Check Google and Azure to see if your region is supported.
- Supported Languages: Gemini can be used in over 40 different languages, all of which are clearly listed here. It is less clear which languages are supported for the GPT models. OpenAI has stated you can use the GPT models with multiple languages, however, they are optimized for English.
- Training Data Date: Gemini models have a knowledge cutoff of November 2023. The training data date is model-specific for GPT models, GPT-3.5 Turbo and GPT-4 were trained up to September 2021 while the GPT-4 Turbo versions were trained until April 2023.
Google Gemini Models Pricing
Pay-As-You-Go
Charges vary for different model types and input types.
Model | Price per 1000 Input Tokens | Price per 1000 Output Tokens |
---|---|---|
Gemini 1.0 Pro | 0.0005 | 0.0015 |
Gemini 1.0 Ultra | Unknown | Unknown |
Gemini 1.0 Ultra Vision | Unknown | Unknown |
Gemini 1.5 Pro | $0.0035 (for prompts up to 128K tokens) $0.007 (for prompts over 128K tokens) |
$0.0105 (for prompts up to 128K tokens) $0.021 (for prompts over 128K tokens) |
Gemini 1.5 Flash | $0.00035 (for prompts up to 128K tokens) $0.0007 (for prompts over 128K tokens) |
$0.00105 (for prompts up to 128K tokens) $0.00210 (for prompts over 128K tokens) |
Azure OpenAI GPT Models Pricing
Charges for GPT models are fairly simple. It is a pay-as-you-go, with no commitment. There are additional customization charges. Price varies per region and is shown for the US East region.
Pay-As-You-Go
Charges vary for different model types and context.
Model | Context | Price per 1000 Input Tokens | Price per 1000 Output Tokens |
---|---|---|---|
GPT-3.5 Turbo Instruct | 4k | $0.0015 | $0.002 |
GPT-3.5 Turbo | 16k | $0.0005 | $0.0015 |
GPT-4 | 8k | $0.03 | $0.06 |
GPT-4 | 32k | $0.06 | $0.12 |
GPT-4 Turbo | 128k | $0.01 | $0.03 |
GPT-4 Turbo With Vision | 128k | $0.01 | $0.03 |
GPT-4o | 128k | $0.005 | $0.015 |
Fine-Tuning
Fine-tuning charges are based on training tokens and hosting time.
Model | Context | Training per 1000 tokens | Price for Hosting per Hour |
---|---|---|---|
GPT-3.5 Turbo | 4k | $0.008 | $3 |
GPT-3.5 Turbo | 16k | $0.008 | $3 |
Pricing Comparison Gemini vs GPT Models
Based on benchmarks, user feedback, and use cases the most comparable models are:
Gemini 1.5 Pro vs GPT 4o Pricing
Gemini 1.5 Pro is the cost-effective choice compared to GPT-4o. Both the input and output tokens are priced 30% lower than GPT-4o’s.
Model | 1000 Input Tokens | 1000 Output Tokens |
---|---|---|
Gemini 1.5 Pro | $0.0035 (for prompts up to 128K tokens) | $0.0105 (for prompts up to 128K tokens) |
GPT-4o | $0.005 | $0.015 |
Gemini 1.0 Pro vs GPT-4 Pricing
Gemini 1.0 Pro is significantly less expensive than GPT-4. Its input tokens are priced 99.17% lower than GPT-4’s and its output tokens are 98.75% lower. This dramatic price difference makes Gemini 1.0 Pro a good option for users looking to optimize costs.
Model | 1000 Input Tokens | 1000 Output Tokens |
---|---|---|
Gemini 1.0 Pro | 0.0005 | 0.0015 |
GPT-4 | $0.06 | $0.12 |
Conclusion
The GPT models are still widely considered to be the most powerful AI models available on the market. However, based on the extraordinarily competitive pricing of Gemini, as well as the 1M context window with Gemini 1.5, Google has positioned itself as a serious competitor in the realm of AI.
Monitor your Google & Azure costs.