Google

Google Gemini provider for GoAI - Gemini models via the Google AI REST API.

For Vertex AI (GCP-managed Gemini), see the Vertex provider.

Setup

bash

go get github.com/zendev-sh/goai@latest

Set the GOOGLE_GENERATIVE_AI_API_KEY (or GEMINI_API_KEY) environment variable, or pass it explicitly:

import "github.com/zendev-sh/goai/provider/google"

model := google.Chat("gemini-2.5-flash", google.WithAPIKey("..."))

The provider also reads GOOGLE_GENERATIVE_AI_BASE_URL from the environment when no explicit base URL is set.

Models

Model ID	Type	Notes
`gemini-2.5-flash`	Chat	Fast, thinking-capable
`gemini-2.5-flash-lite`	Chat	Lightweight variant
`gemini-2.5-pro`	Chat	Most capable, thinking-capable
`gemini-3-flash-preview`	Chat	Next-gen, thinking-capable
`gemini-3-pro-preview`	Chat	Next-gen, thinking-capable
`gemini-3.1-pro-preview`	Chat	Next-gen, thinking-capable
`gemini-2.0-flash`	Chat	Previous gen, no thinking
`text-embedding-004`	Embedding	768 dimensions
`imagen-4.0-generate-001`	Image	Imagen via :predict endpoint
`imagen-4.0-fast-generate-001`	Image	Imagen fast variant via :predict endpoint
`gemini-2.5-flash-image`	Image	Gemini image via generateContent

Tested Models

E2E tested with real API calls. Last run: 2026-03-15.

Model	Generate	Stream	Status
`gemini-2.5-flash`	PASS	PASS	Stable
`gemini-2.5-flash-lite`	PASS	PASS	Stable
`gemini-2.5-pro`	PASS	PASS	Stable
`gemini-3-flash-preview`	PASS	PASS	Stable
`gemini-3-pro-preview`	PASS	PASS	Stable
`gemini-3.1-pro-preview`	PASS	PASS	Stable
`gemini-2.0-flash`	PASS	PASS	Stable
`gemini-flash-latest`	PASS	PASS	Stable
`gemini-flash-lite-latest`	PASS	PASS	Stable

Unit tested models: gemini-2.5-flash, gemini-2.5-flash-image, imagen-4.0-generate-001, imagen-4.0-fast-generate-001, text-embedding-004.

Usage

Chat

import (
    "context"
    "fmt"

    "github.com/zendev-sh/goai"
    "github.com/zendev-sh/goai/provider/google"
)

func main() {
    model := google.Chat("gemini-2.5-flash")
    result, err := goai.GenerateText(context.Background(), model,
        goai.WithPrompt("Explain Go interfaces in one paragraph."),
    )
    if err != nil {
        panic(err)
    }
    fmt.Println(result.Text)
}

Streaming

import (
    "context"
    "fmt"

    "github.com/zendev-sh/goai"
    "github.com/zendev-sh/goai/provider"
    "github.com/zendev-sh/goai/provider/google"
)

model := google.Chat("gemini-2.5-flash")
stream, err := goai.StreamText(context.Background(), model,
    goai.WithPrompt("Write a haiku about Go."),
)
if err != nil {
    panic(err)
}
for chunk := range stream.Stream() {
    if chunk.Type == provider.ChunkText {
        fmt.Print(chunk.Text)
    }
}

Embedding

import (
    "context"
    "fmt"

    "github.com/zendev-sh/goai"
    "github.com/zendev-sh/goai/provider/google"
)

model := google.Embedding("text-embedding-004")
result, err := goai.Embed(context.Background(), model, "Hello world")
if err != nil {
    panic(err)
}
fmt.Println(len(result.Embedding)) // 768

Google-specific embedding options (under the "google" key in ProviderOptions):

result, err := goai.Embed(ctx, model, "Hello world",
    goai.WithEmbeddingProviderOptions(map[string]any{
        "google": map[string]any{
            "taskType":             "RETRIEVAL_DOCUMENT",
            "outputDimensionality": 256,
        },
    }),
)

Image Generation (Imagen)

import (
    "context"
    "fmt"

    "github.com/zendev-sh/goai"
    "github.com/zendev-sh/goai/provider/google"
)

model := google.Image("imagen-4.0-fast-generate-001")
result, err := goai.GenerateImage(context.Background(), model,
    goai.WithImagePrompt("A serene mountain lake at sunset"),
)
if err != nil {
    panic(err)
}
fmt.Printf("Generated %d bytes\n", len(result.Images[0].Data))

Image Generation (Gemini)

Gemini image models use the generateContent endpoint with responseModalities: ["IMAGE"]:

model := google.Image("gemini-2.5-flash-image")
result, err := goai.GenerateImage(ctx, model,
    goai.WithImagePrompt("A cartoon cat programming in Go"),
)

Options

Option	Type	Description
`WithAPIKey(key)`	`string`	Static API key. Falls back to `GOOGLE_GENERATIVE_AI_API_KEY` or `GEMINI_API_KEY` env var.
`WithTokenSource(ts)`	`provider.TokenSource`	Dynamic token resolution.
`WithBaseURL(url)`	`string`	Override base URL. Falls back to `GOOGLE_GENERATIVE_AI_BASE_URL` env var.
`WithHeaders(h)`	`map[string]string`	Additional HTTP headers on every request.
`WithHTTPClient(c)`	`*http.Client`	Custom HTTP client for proxies, logging, URL rewriting.

Provider Options (via `goai.WithProviderOptions`)

Google-specific options are nested under the "google" key:

Key	Type	Description
`google.thinkingConfig`	`map[string]any` or `bool`	Override thinking config: `{includeThoughts, thinkingLevel}`. Set to `false` to disable.
`google.safetySettings`	`[]map[string]any`	Safety settings per category.
`google.cachedContent`	`string`	Cached content resource name.
`google.responseModalities`	`[]string`	Output modalities (e.g., `["TEXT", "IMAGE"]`).
`google.mediaResolution`	`string`	Media resolution for input images/videos.
`google.audioTimestamp`	`bool`	Enable audio timestamps in response.
`google.labels`	`map[string]any`	Request labels for tracking.
`google.imageConfig`	`map[string]any`	Gemini image config: `{aspectRatio, imageSize}`.
`google.retrievalConfig`	`map[string]any`	Retrieval configuration for tool config.
`google.google_search`	`map[string]any`	Legacy Google Search via ProviderOptions (prefer `google.Tools.GoogleSearch()`).

Provider Tools

Three built-in tools are available via google.Tools. These require Gemini 2.0+.

Tool	Description	Execution
`google.Tools.GoogleSearch()`	Grounding with Google Search	Server-side
`google.Tools.URLContext()`	Fetch and process URL content	Server-side
`google.Tools.CodeExecution()`	Execute Python code in sandbox	Server-side

GoogleSearch

def := google.Tools.GoogleSearch()
result, err := goai.GenerateText(ctx, model,
    goai.WithPrompt("What are the latest Go releases?"),
    goai.WithTools(goai.Tool{
        Name:                   def.Name,
        ProviderDefinedType:    def.ProviderDefinedType,
        ProviderDefinedOptions: def.ProviderDefinedOptions,
    }),
)
// result.Sources contains grounding URLs
for _, src := range result.Sources {
    fmt.Printf("Source: %s - %s\n", src.Title, src.URL)
}

Options: WithWebSearch(), WithImageSearch(), WithTimeRange(startRFC3339, endRFC3339).

URLContext

def := google.Tools.URLContext()
result, err := goai.GenerateText(ctx, model,
    goai.WithPrompt("Summarize the content at https://go.dev/doc/effective_go"),
    goai.WithTools(goai.Tool{
        Name:                   def.Name,
        ProviderDefinedType:    def.ProviderDefinedType,
        ProviderDefinedOptions: def.ProviderDefinedOptions,
    }),
)

No configuration options. The model uses URLs from the prompt to fetch and process content.

CodeExecution

def := google.Tools.CodeExecution()
result, err := goai.GenerateText(ctx, model,
    goai.WithPrompt("Calculate the mean and median of [3, 7, 8, 5, 12, 14, 21]"),
    goai.WithTools(goai.Tool{
        Name:                   def.Name,
        ProviderDefinedType:    def.ProviderDefinedType,
        ProviderDefinedOptions: def.ProviderDefinedOptions,
    }),
)

No configuration options. The model generates Python code, executes it server-side, and uses the output in its response.

Notes

Thinking/Reasoning: Enabled by default for Gemini 2.5+ and 3.x models. Gemini 3.x models default to thinkingLevel: "high". Gemma and Gemini 1.5/2.0 models do not support thinking. Set google.thinkingConfig to false in provider options to disable.
Schema sanitization: Gemini has stricter JSON Schema requirements. GoAI auto-sanitizes schemas: enum values are coerced to strings, additionalProperties is removed, array items get a default type, and required fields are filtered to match properties.
Implicit caching: The request body uses a struct with deterministic field ordering (system instruction and tools before contents) to maximize Gemini's implicit prompt cache hit rate.
Function call IDs: Gemini does not provide tool call IDs. GoAI generates synthetic IDs in the format call_{toolName}_{N} with a counter to ensure uniqueness.
Role mapping: assistant is mapped to model, and tool is mapped to user for function responses.
Embedding batch limit: 100 values per call via batchEmbedContents. goai.EmbedMany auto-chunks larger batches.
Difference from Vertex: This provider uses Google AI (generativelanguage.googleapis.com) with API key auth. The Vertex provider uses Vertex AI endpoints with OAuth/ADC auth and supports additional features like enterprise web search and RAG stores.

Google ​

Setup ​

Models ​

Tested Models ​

Usage ​

Chat ​

Streaming ​

Embedding ​

Image Generation (Imagen) ​

Image Generation (Gemini) ​

Options ​

Provider Options (via goai.WithProviderOptions) ​

Provider Tools ​

GoogleSearch ​

URLContext ​

CodeExecution ​

Notes ​