Google Gemini: Everything you need to know about the generative AI apps and models
What is Gemini?
Gemini is Google's highly anticipated next-generation family of generative AI models, developed through a collaboration between DeepMind and Google Research. It's designed to be versatile, coming in various sizes to cater to different needs:
- Gemini Ultra: A powerhouse model, designed for the most complex tasks.
- Gemini Pro: A robust model, with the latest version, Gemini 2.0 Pro, being Google's current flagship.
- Gemini Flash: A faster, streamlined version of Pro, perfect for quick tasks.
- Gemini Flash-Lite: Even smaller and faster than Flash, it's built for efficiency.
- Gemini Flash Thinking: A specialized version with enhanced reasoning capabilities.
- Gemini Nano: Consists of two compact models, Nano-1 and Nano-2, the latter capable of running offline.
One of the key features of Gemini is its multimodal nature. Unlike earlier models like Google's LaMDA, which were limited to text, Gemini models have been trained on a diverse dataset including audio, images, videos, code, and text in multiple languages. This allows them to not only process but also generate various types of content, setting them apart in the AI landscape.
However, it's worth noting the ethical and legal concerns surrounding the use of public data for training these models. Google offers an AI indemnification policy, but it's not a blanket protection, so if you're considering using Gemini for commercial purposes, tread carefully.
What’s the difference between the Gemini apps and Gemini models?
The Gemini models are the brains behind the operation, while the Gemini apps serve as user-friendly interfaces to access these models. These apps, available on web and mobile platforms (formerly known as Bard), act as front ends similar to ChatGPT or Anthropic's Claude. They offer a chatbot-like experience, allowing users to interact with Gemini's capabilities through a familiar interface.

Image Credits: Google
On Android, the Gemini app has taken over from the Google Assistant, and on iOS, it's integrated into the Google and Google Search apps. Android users can even summon a Gemini overlay to interact with content on their screens, such as YouTube videos, by pressing the power button or using voice commands.
The apps support a range of inputs, including images, voice commands, and text, and can even generate images. Conversations are synced across devices if you're signed into the same Google Account.
Gemini Advanced
Beyond the basic apps, Gemini Advanced offers enhanced features for a monthly fee of $20 as part of the Google One AI Premium Plan. This plan integrates Gemini into Google Workspace apps like Gmail, Docs, Maps, and more, allowing for advanced tasks like email composition, document editing, and even generating slides.

Image Credits: Google
Gemini Advanced users enjoy perks like priority access to new features, the ability to run and edit Python code directly in the app, and increased limits for tools like NotebookLM. A recent addition, the memory feature, helps Gemini remember user preferences and past conversations, enhancing the user experience. One standout feature, Deep Research, uses advanced reasoning to create detailed briefs on complex topics.
Gemini in Gmail, Docs, Chrome, dev tools, and more
Gemini's integration extends to various Google services. In Gmail and Docs, it offers side panels for tasks like email composition and document refinement. In Slides, it generates custom images and slides, while in Sheets, it helps with data organization and formula creation.

Image Credits: Google
Gemini also enhances Google Maps with personalized recommendations and aggregates reviews. In Drive, it can summarize files and provide quick insights. In Chrome, it acts as an AI writing tool, adapting to the context of the webpage you're on. Gemini's influence reaches into Google's security and development tools, as well as apps like Photos, YouTube, and Meet, where it supports natural language searches and translations.
Gemini extensions and Gems
For Gemini Advanced users, the ability to create Gems is a unique feature. These are custom chatbots powered by Gemini models, which can be tailored to specific tasks like creating a daily running plan. Gems can be shared or kept private, adding a personal touch to AI interactions.

Image Credits: Google
Gemini apps also leverage "Gemini extensions" to integrate with Google services like Drive, Gmail, and YouTube, allowing for seamless interaction and information retrieval across platforms.
Gemini Live in-depth voice chats
Gemini Live offers a unique experience for voice interactions, available in the Gemini apps on mobile and the Pixel Buds Pro 2. It allows for real-time, adaptive conversations, where you can interrupt Gemini to ask questions or seek clarification. This feature is designed to help with tasks like job interview preparation and public speaking practice.

Image Credits: Google
Gemini for teens
Google has also introduced a teen-focused version of Gemini, designed for students. It includes additional safety measures and an AI literacy guide but otherwise offers a similar experience to the standard version, including the "double-check" feature for accuracy.
What can the Gemini models do?
Given their multimodal capabilities, Gemini models can handle a variety of tasks, from speech transcription to real-time image and video captioning. Google is constantly expanding these capabilities, promising even more in the future.
However, like all generative AI, Gemini isn't without its challenges, such as biases and the potential to generate inaccurate information. It's important to be aware of these limitations when using or considering paying for Gemini services.
Gemini Pro’s capabilities
The latest iteration, Gemini 2.0 Pro, excels in coding and handling complex prompts, outperforming its predecessor in various benchmarks. Developers can customize it through Google's Vertex AI platform, tailoring it to specific contexts and integrating it with third-party data or APIs. Google's AI Studio also offers tools for creating structured prompts and adjusting safety settings.
Gemini Flash is lightweight, while Gemini Flash Thinking adds reasoning
Gemini 2.0 Flash, designed for efficiency, is ideal for tasks like summarization and data extraction, while Gemini 2.0 Flash-Lite offers even better performance at the same price point. The "thinking" version of Gemini 2.0 Flash enhances reliability by taking time to reason through problems before responding.
Gemini Nano can run on your phone
Gemini Nano is designed to run directly on devices, enhancing privacy and offline functionality. It powers features like Summarize in Recorder and Smart Reply in Gboard on devices like the Pixel 8 series and Samsung Galaxy S24. Future versions of Android will use Nano for scam detection during calls, and it's already enhancing weather reports and accessibility features.

Image Credits: Google
Gemini Ultra, MIA for now
While Gemini Ultra hasn't been in the spotlight recently, it remains a part of Google's plans, potentially returning with new capabilities in the future.
How much do the Gemini models cost?
The pricing for Gemini models through the Gemini API is structured as follows:
- Gemini 1.5 Pro: $1.25/$2.50 per million input tokens and $5/$10 per million output tokens, depending on prompt length.
- Gemini 1.5 Flash: 7.5/15 cents per million input tokens and 30/60 cents per million output tokens, depending on prompt length.
- Gemini 2.0 Flash: 10 cents per million input tokens and 40 cents per million output tokens, with audio input at 70 cents per million tokens.
- Gemini 2.0 Flash-Lite: 7.5 cents per million input tokens and 30 cents per million output tokens.
Pricing for Gemini 2.0 Pro and Nano has yet to be announced.
Is Gemini coming to the iPhone?
There's potential for Gemini to make its way to the iPhone. Apple has expressed interest in integrating Gemini and other third-party models into its Apple Intelligence suite, though specifics are still under wraps following discussions at WWDC 2024.
This post was originally published on February 16, 2024, and is regularly updated to reflect the latest developments.
Related article
Imagen 4:谷歌最新AI圖像生成器
Google近日發表最新圖像生成AI模型「Imagen 4」,宣稱將為用戶帶來比前代Imagen 3更出色的視覺體驗。本週稍早在Google I/O 2025大會亮相的這款新模型,被譽為在畫質與多樣性方面取得重大突破。Google表示,Imagen 4特別擅長處理織物質感、水珠反光與動物毛髮等精細紋理,同時能輕鬆駕馭寫實與抽象風格。其輸出解析度最高可達2K,
谷歌Gemini代碼助手強化AI編程代理功能
Google旗下AI程式開發助手Gemini Code Assist近期推出全新「代理模式」功能,目前開放預覽體驗。在最新Cloud Next大會上,Google展示這些AI代理如何突破性處理複雜編程任務——從Google文件規格書直接生成完整應用程式,或輕鬆實現跨語言程式碼轉換。更令人驚豔的是,開發者現可在Android Studio等整合開發環境中直接啟
谷歌的人工智慧未來基金可能需要謹慎行事
Google 的新 AI 投資計劃:監管審查下的戰略轉變Google 最近宣布設立 AI 未來基金(AI Futures Fund),這標誌著這家科技巨頭在其塑造人工智慧未來的征程中邁出了大膽的一步。該計劃旨在為初創公司提供急需的資金、早期接觸仍在開發中的尖端人工智慧模型,以及來自 Google 內部專家的指導。儘管這不是 Google 第一次涉足初創企業生
Comments (15)
0/200
FrankMartínez
April 24, 2025 at 12:00:00 AM GMT
Google Gemini is pretty cool, but it's a bit overwhelming with all the different models! I like the Gemini Ultra for its power, but I wish there was a simpler version for everyday use. Still, it's impressive what it can do! 🤯
0
JackMartin
April 25, 2025 at 12:00:00 AM GMT
Google Geminiはすごく面白いけど、モデルがたくさんあって少し混乱するね!Gemini Ultraのパワーは好きだけど、日常的に使えるもっとシンプルなバージョンが欲しいな。でも、できることがすごい!🤯
0
StevenAllen
April 25, 2025 at 12:00:00 AM GMT
Google Gemini는 꽤 멋지지만, 다양한 모델 때문에 조금 혼란스러워요! Gemini Ultra의 강력함은 좋지만, 일상적으로 사용할 수 있는 더 간단한 버전이 있었으면 좋겠어요. 그래도 할 수 있는 일이 대단해요! 🤯
0
WilliamMiller
April 24, 2025 at 12:00:00 AM GMT
Google Gemini é bem legal, mas é um pouco confuso com todos esses modelos diferentes! Gosto do Gemini Ultra pela sua potência, mas gostaria que houvesse uma versão mais simples para uso diário. Ainda assim, é impressionante o que ele pode fazer! 🤯
0
StevenGreen
April 25, 2025 at 12:00:00 AM GMT
Google Gemini es bastante genial, pero es un poco abrumador con todos los diferentes modelos. Me gusta el Gemini Ultra por su potencia, pero desearía que hubiera una versión más simple para el uso diario. Aún así, es impresionante lo que puede hacer! 🤯
0
WalterSanchez
April 24, 2025 at 12:00:00 AM GMT
Google Gemini is pretty cool! It's like having a super smart AI buddy that can handle all sorts of tasks. The different sizes are awesome, but I wish the Ultra version was more accessible. Still, it's a game-changer for sure! 🤓
0
What is Gemini?
Gemini is Google's highly anticipated next-generation family of generative AI models, developed through a collaboration between DeepMind and Google Research. It's designed to be versatile, coming in various sizes to cater to different needs:
- Gemini Ultra: A powerhouse model, designed for the most complex tasks.
- Gemini Pro: A robust model, with the latest version, Gemini 2.0 Pro, being Google's current flagship.
- Gemini Flash: A faster, streamlined version of Pro, perfect for quick tasks.
- Gemini Flash-Lite: Even smaller and faster than Flash, it's built for efficiency.
- Gemini Flash Thinking: A specialized version with enhanced reasoning capabilities.
- Gemini Nano: Consists of two compact models, Nano-1 and Nano-2, the latter capable of running offline.
One of the key features of Gemini is its multimodal nature. Unlike earlier models like Google's LaMDA, which were limited to text, Gemini models have been trained on a diverse dataset including audio, images, videos, code, and text in multiple languages. This allows them to not only process but also generate various types of content, setting them apart in the AI landscape.
However, it's worth noting the ethical and legal concerns surrounding the use of public data for training these models. Google offers an AI indemnification policy, but it's not a blanket protection, so if you're considering using Gemini for commercial purposes, tread carefully.
What’s the difference between the Gemini apps and Gemini models?
The Gemini models are the brains behind the operation, while the Gemini apps serve as user-friendly interfaces to access these models. These apps, available on web and mobile platforms (formerly known as Bard), act as front ends similar to ChatGPT or Anthropic's Claude. They offer a chatbot-like experience, allowing users to interact with Gemini's capabilities through a familiar interface.
On Android, the Gemini app has taken over from the Google Assistant, and on iOS, it's integrated into the Google and Google Search apps. Android users can even summon a Gemini overlay to interact with content on their screens, such as YouTube videos, by pressing the power button or using voice commands.
The apps support a range of inputs, including images, voice commands, and text, and can even generate images. Conversations are synced across devices if you're signed into the same Google Account.
Gemini Advanced
Beyond the basic apps, Gemini Advanced offers enhanced features for a monthly fee of $20 as part of the Google One AI Premium Plan. This plan integrates Gemini into Google Workspace apps like Gmail, Docs, Maps, and more, allowing for advanced tasks like email composition, document editing, and even generating slides.
Gemini Advanced users enjoy perks like priority access to new features, the ability to run and edit Python code directly in the app, and increased limits for tools like NotebookLM. A recent addition, the memory feature, helps Gemini remember user preferences and past conversations, enhancing the user experience. One standout feature, Deep Research, uses advanced reasoning to create detailed briefs on complex topics.
Gemini in Gmail, Docs, Chrome, dev tools, and more
Gemini's integration extends to various Google services. In Gmail and Docs, it offers side panels for tasks like email composition and document refinement. In Slides, it generates custom images and slides, while in Sheets, it helps with data organization and formula creation.
Gemini also enhances Google Maps with personalized recommendations and aggregates reviews. In Drive, it can summarize files and provide quick insights. In Chrome, it acts as an AI writing tool, adapting to the context of the webpage you're on. Gemini's influence reaches into Google's security and development tools, as well as apps like Photos, YouTube, and Meet, where it supports natural language searches and translations.
Gemini extensions and Gems
For Gemini Advanced users, the ability to create Gems is a unique feature. These are custom chatbots powered by Gemini models, which can be tailored to specific tasks like creating a daily running plan. Gems can be shared or kept private, adding a personal touch to AI interactions.
Gemini apps also leverage "Gemini extensions" to integrate with Google services like Drive, Gmail, and YouTube, allowing for seamless interaction and information retrieval across platforms.
Gemini Live in-depth voice chats
Gemini Live offers a unique experience for voice interactions, available in the Gemini apps on mobile and the Pixel Buds Pro 2. It allows for real-time, adaptive conversations, where you can interrupt Gemini to ask questions or seek clarification. This feature is designed to help with tasks like job interview preparation and public speaking practice.
Gemini for teens
Google has also introduced a teen-focused version of Gemini, designed for students. It includes additional safety measures and an AI literacy guide but otherwise offers a similar experience to the standard version, including the "double-check" feature for accuracy.
What can the Gemini models do?
Given their multimodal capabilities, Gemini models can handle a variety of tasks, from speech transcription to real-time image and video captioning. Google is constantly expanding these capabilities, promising even more in the future.
However, like all generative AI, Gemini isn't without its challenges, such as biases and the potential to generate inaccurate information. It's important to be aware of these limitations when using or considering paying for Gemini services.
Gemini Pro’s capabilities
The latest iteration, Gemini 2.0 Pro, excels in coding and handling complex prompts, outperforming its predecessor in various benchmarks. Developers can customize it through Google's Vertex AI platform, tailoring it to specific contexts and integrating it with third-party data or APIs. Google's AI Studio also offers tools for creating structured prompts and adjusting safety settings.
Gemini Flash is lightweight, while Gemini Flash Thinking adds reasoning
Gemini 2.0 Flash, designed for efficiency, is ideal for tasks like summarization and data extraction, while Gemini 2.0 Flash-Lite offers even better performance at the same price point. The "thinking" version of Gemini 2.0 Flash enhances reliability by taking time to reason through problems before responding.
Gemini Nano can run on your phone
Gemini Nano is designed to run directly on devices, enhancing privacy and offline functionality. It powers features like Summarize in Recorder and Smart Reply in Gboard on devices like the Pixel 8 series and Samsung Galaxy S24. Future versions of Android will use Nano for scam detection during calls, and it's already enhancing weather reports and accessibility features.
Gemini Ultra, MIA for now
While Gemini Ultra hasn't been in the spotlight recently, it remains a part of Google's plans, potentially returning with new capabilities in the future.
How much do the Gemini models cost?
The pricing for Gemini models through the Gemini API is structured as follows:
- Gemini 1.5 Pro: $1.25/$2.50 per million input tokens and $5/$10 per million output tokens, depending on prompt length.
- Gemini 1.5 Flash: 7.5/15 cents per million input tokens and 30/60 cents per million output tokens, depending on prompt length.
- Gemini 2.0 Flash: 10 cents per million input tokens and 40 cents per million output tokens, with audio input at 70 cents per million tokens.
- Gemini 2.0 Flash-Lite: 7.5 cents per million input tokens and 30 cents per million output tokens.
Pricing for Gemini 2.0 Pro and Nano has yet to be announced.
Is Gemini coming to the iPhone?
There's potential for Gemini to make its way to the iPhone. Apple has expressed interest in integrating Gemini and other third-party models into its Apple Intelligence suite, though specifics are still under wraps following discussions at WWDC 2024.
This post was originally published on February 16, 2024, and is regularly updated to reflect the latest developments.



Google Gemini is pretty cool, but it's a bit overwhelming with all the different models! I like the Gemini Ultra for its power, but I wish there was a simpler version for everyday use. Still, it's impressive what it can do! 🤯




Google Geminiはすごく面白いけど、モデルがたくさんあって少し混乱するね!Gemini Ultraのパワーは好きだけど、日常的に使えるもっとシンプルなバージョンが欲しいな。でも、できることがすごい!🤯




Google Gemini는 꽤 멋지지만, 다양한 모델 때문에 조금 혼란스러워요! Gemini Ultra의 강력함은 좋지만, 일상적으로 사용할 수 있는 더 간단한 버전이 있었으면 좋겠어요. 그래도 할 수 있는 일이 대단해요! 🤯




Google Gemini é bem legal, mas é um pouco confuso com todos esses modelos diferentes! Gosto do Gemini Ultra pela sua potência, mas gostaria que houvesse uma versão mais simples para uso diário. Ainda assim, é impressionante o que ele pode fazer! 🤯




Google Gemini es bastante genial, pero es un poco abrumador con todos los diferentes modelos. Me gusta el Gemini Ultra por su potencia, pero desearía que hubiera una versión más simple para el uso diario. Aún así, es impresionante lo que puede hacer! 🤯




Google Gemini is pretty cool! It's like having a super smart AI buddy that can handle all sorts of tasks. The different sizes are awesome, but I wish the Ultra version was more accessible. Still, it's a game-changer for sure! 🤓












