More transparency and control over Gemini API costs

More transparency and control over Gemini API costs

Usage levels: Less friction and more transparency as you scale

We’ve completely revamped our Usage Tiers to give you higher capacity faster. While we rely on these tiers to manage overall load and help ensure fair API access, your progression through them is now automated and transparent. Here’s what’s changing:

  • Lower consumption qualifications: To make it easier for users with a strong payment history to get higher quotas, we’re also reducing the spending qualifications for higher tiers.
  • Automatic and faster upgrades: The system now automatically upgrades you to the next level as your spending grows and your payment history matures. You get access to higher rate limits and increased monthly fees as soon as the criteria are met.
  • Level limit for billing account: Each spend level will now have a maximum monthly spend limit ($) enforced across your entire billing account (similar to other platforms in the industry). This system-defined cap increases automatically as you upgrade to higher levels, and works independently of any custom project spending caps you set yourself.

You can see the usage level limits along with the new criteria in our docs and discover how different levels affect your speed limits directly in Google AI Studio.

Improved invoicing flow with improved observability and control

Over the past few months, we’ve rolled out a series of updates to Google AI Studio to improve our billing experience, observability, and cost management, with the goal of providing developers with an easier and more transparent experience with our paid services. Here’s what’s new:

  • New billing setup directly in Google AI Studio: You can now configure your billing profile and link it to your projects directly from the settings, ensuring you can scale your application more seamlessly as your needs grow. No more jumping between 3 different windows and tabs.
  • New rate limit dashboard: The dashboard gives you a clear overview of your progress towards speed limits for each project imported into Google AI Studio. You can monitor usage against three key metrics: Requests Per Minute (RPM), Tokens Per Minute (TPM) and Requests Per Day (RPD), view and filter graphs of these metrics to identify traffic spikes and explore speed limits across different models.
  • New price dashboard: To help you manage your budget, we’ve also launched a daily cost graph in the billing dashboard. This tool provides a transparent overview of your consumption, so you can track costs per project over different time frames – from the last 7 days to the whole month and filter by model.
  • New usage dashboard: An extended, comprehensive view of your system’s performance. In addition to standard request counts, you can now dive into error metrics, token usage, and specific generation statistics. We’ve also added dedicated graphs for Imagen and Veo requests per day, in addition to tools like Grounding with Google Search and Maps.

We hope these updates help you build more confidently with the Gemini API, and we’ll continue to make improvements to provide a more reliable and transparent service.

Leave a Reply

Your email address will not be published. Required fields are marked *