gollm

package module

v0.0.0-...-6393786 Latest Latest Go to latest Published: Dec 18, 2025 License: Apache-2.0 Imports: 41 Imported by: 5

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/GoogleCloudPlatform/kubectl-ai

Links

Open Source Insights

README ¶

gollm

A Go library for calling into multiple Large Language Model (LLM) providers with a unified interface.

This library is intended for use by kubectl-ai, but may prove useful for other similar go tools in future.

Note that the library is still evolving and will likely make incompatible changes often. We are focusing on kubectl-ai's use-case, but will consider changes to support additional use-cases.

Overview

gollm provides a consistent API for interacting with various LLM providers, making it easy to switch between different models and services without changing your application code. The library supports both chat-based conversations and single completions, with features like function calling, streaming responses, and retry logic.

Features

Multi-provider support: OpenAI, Azure OpenAI, Google Gemini, Ollama, LlamaCPP, Grok, and more
Unified interface: Consistent API across all providers
Chat conversations: Multi-turn conversations with conversation history
Function calling: Define and use custom functions with LLMs
Streaming support: Real-time streaming responses
Retry logic: Built-in retry mechanisms with configurable backoff
Response schemas: Constrain LLM responses to specific JSON schemas
SSL configuration: Optional SSL certificate verification skipping
Environment-based configuration: Easy setup via environment variables

Providers

Provider	ID	Description
OpenAI	`openai://`	OpenAI's GPT models
Azure OpenAI	`azopenai://`	Microsoft Azure's OpenAI service
Google Gemini	`gemini://`	Google's Gemini models
Vertex AI	`vertexai://`	Google Cloud Vertex AI (via Gemini)
Ollama	`ollama://`	Local Ollama models
LlamaCPP	`llamacpp://`	Local LlamaCPP models
Grok	`grok://`	xAI's Grok models

Quick Start

Installation

go get github.com/GoogleCloudPlatform/kubectl-ai/gollm

Basic Usage

package main

import (
    "context"
    "fmt"
    "log"
    
    "github.com/GoogleCloudPlatform/kubectl-ai/gollm"
)

func main() {
    ctx := context.Background()
    
    // Create a client using environment variable
    client, err := gollm.NewClient(ctx, "")
    if err != nil {
        log.Fatal(err)
    }
    defer client.Close()
    
    // Start a chat conversation
    chat := client.StartChat("You are a helpful assistant.", "gpt-3.5-turbo")
    
    // Send a message
    response, err := chat.Send(ctx, "Hello, how are you?")
    if err != nil {
        log.Fatal(err)
    }
    
    // Print the response
    for _, candidate := range response.Candidates() {
        fmt.Println(candidate.String())
    }
}

Environment Configuration

Set the LLM_CLIENT environment variable to specify your preferred provider:

# OpenAI
export LLM_CLIENT="openai://api.openai.com"
export OPENAI_API_KEY="your-api-key"

# Azure OpenAI
export LLM_CLIENT="azopenai://your-resource.openai.azure.com"
export AZURE_OPENAI_API_KEY="your-api-key"

# Google Gemini
export LLM_CLIENT="gemini://generativelanguage.googleapis.com"
export GOOGLE_API_KEY="your-api-key"

# Ollama (local)
export LLM_CLIENT="ollama://localhost:11434"

Examples

Single Completion

ctx := context.Background()
client, err := gollm.NewClient(ctx, "openai://api.openai.com")
if err != nil {
    log.Fatal(err)
}
defer client.Close()

req := &gollm.CompletionRequest{
    Model:  "gpt-3.5-turbo",
    Prompt: "Write a short poem about programming",
}

response, err := client.GenerateCompletion(ctx, req)
if err != nil {
    log.Fatal(err)
}

fmt.Println(response.Response())

Streaming Chat

ctx := context.Background()
client, err := gollm.NewClient(ctx, "openai://api.openai.com")
if err != nil {
    log.Fatal(err)
}
defer client.Close()

chat := client.StartChat("You are a helpful assistant.", "gpt-3.5-turbo")

// Send a streaming message
iterator, err := chat.SendStreaming(ctx, "Tell me a story about a robot")
if err != nil {
    log.Fatal(err)
}

// Process streaming response
for response := range iterator {
    if response.V1 != nil {
        for _, candidate := range response.V1.Candidates() {
            for _, part := range candidate.Parts() {
                if text, ok := part.AsText(); ok {
                    fmt.Print(text)
                }
            }
        }
    }
    if response.V2 != nil {
        // Handle error
        log.Printf("Error: %v", response.V2)
        break
    }
}

Function Calling

// Define a function that the LLM can call
functionDef := &gollm.FunctionDefinition{
    Name:        "get_weather",
    Description: "Get the current weather for a location",
    Parameters: &gollm.Schema{
        Type: gollm.TypeObject,
        Properties: map[string]*gollm.Schema{
            "location": {
                Type:        gollm.TypeString,
                Description: "The city and state, e.g. San Francisco, CA",
            },
            "unit": {
                Type:        gollm.TypeString,
                Description: "The temperature unit to use. Infer this from the user's location.",
                Required:    []string{"location"},
            },
        },
    },
}

chat := client.StartChat("You are a helpful assistant.", "gpt-3.5-turbo")
chat.SetFunctionDefinitions([]*gollm.FunctionDefinition{functionDef})

response, err := chat.Send(ctx, "What's the weather like in San Francisco?")
if err != nil {
    log.Fatal(err)
}

// Check for function calls in the response
for _, candidate := range response.Candidates() {
    for _, part := range candidate.Parts() {
        if functionCalls, ok := part.AsFunctionCalls(); ok {
            for _, call := range functionCalls {
                fmt.Printf("Function call: %s with args %v\n", call.Name, call.Arguments)
                
                // Execute the function and send the result back
                result := executeWeatherFunction(call.Arguments)
                chat.Send(ctx, gollm.FunctionCallResult{
                    ID:     call.ID,
                    Name:   call.Name,
                    Result: result,
                })
            }
        }
    }
}

Response Schema Constraints

// Define a schema for structured responses
schema := &gollm.Schema{
    Type: gollm.TypeObject,
    Properties: map[string]*gollm.Schema{
        "name": {
            Type:        gollm.TypeString,
            Description: "The person's name",
        },
        "age": {
            Type:        gollm.TypeInteger,
            Description: "The person's age",
        },
        "interests": {
            Type: gollm.TypeArray,
            Items: &gollm.Schema{
                Type: gollm.TypeString,
            },
            Description: "List of interests",
        },
    },
    Required: []string{"name", "age"},
}

client.SetResponseSchema(schema)

// Now all responses will be constrained to match this schema
response, err := chat.Send(ctx, "Tell me about a person named Alice who is 30 years old")

Retry Logic

// Configure retry behavior
retryConfig := gollm.RetryConfig{
    MaxAttempts:    3,
    InitialBackoff: time.Second,
    MaxBackoff:     30 * time.Second,
    BackoffFactor:  2.0,
    Jitter:         true,
}

// Create a chat with retry logic
chat := client.StartChat("You are a helpful assistant.", "gpt-3.5-turbo")
retryChat := gollm.NewRetryChat(chat, retryConfig)

// Use the retry chat - it will automatically retry on retryable errors
response, err := retryChat.Send(ctx, "Hello!")

Building Schemas from Go Types

type Person struct {
    Name     string   `json:"name"`
    Age      int      `json:"age"`
    Interests []string `json:"interests,omitempty"`
}

// Automatically build a schema from a Go struct
schema := gollm.BuildSchemaFor(reflect.TypeOf(Person{}))

// Use the schema to constrain responses
client.SetResponseSchema(schema)

Configuration Options

Client Options

// Create a client with custom options
client, err := gollm.NewClient(ctx, "openai://api.openai.com",
    gollm.WithSkipVerifySSL(), // Skip SSL verification (for development)
)

Environment Variables

LLM_CLIENT: The provider URL to use (e.g., "openai://api.openai.com")
LLM_SKIP_VERIFY_SSL: Set to "1" or "true" to skip SSL certificate verification
Provider-specific API keys (e.g., OPENAI_API_KEY, GOOGLE_API_KEY)

Error Handling

The library provides structured error handling with retryable error detection:

var apiErr *gollm.APIError
if errors.As(err, &apiErr) {
    fmt.Printf("API Error: Status=%d, Message=%s\n", apiErr.StatusCode, apiErr.Message)
}

// Check if an error is retryable
if chat.IsRetryableError(err) {
    // Implement retry logic
}

Adding a provider

To add a new provider:

Create a new file (e.g., myprovider.go)
Implement the Client interface
Register the provider in an init() function:

func init() {
    if err := gollm.RegisterProvider("myprovider", myProviderFactory); err != nil {
        panic(err)
    }
}

License

This project is licensed under the Apache License, Version 2.0. See the LICENSE file for details.

Documentation ¶

Index ¶

Variables
func DefaultIsRetryableError(err error) bool
func RegisterProvider(id string, factoryFunc FactoryFunc) error
func Retry[T any](ctx context.Context, config RetryConfig, isRetryable IsRetryableFunc, ...) (T, error)
type APIError
- func (e *APIError) Error() string
- func (e *APIError) Unwrap() error
type AzureOpenAICandidate
- func (r *AzureOpenAICandidate) Parts() []Part
- func (r *AzureOpenAICandidate) String() string
type AzureOpenAIChat
- func (c *AzureOpenAIChat) Initialize(messages []*api.Message) error
- func (c *AzureOpenAIChat) IsRetryableError(err error) bool
- func (c *AzureOpenAIChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)
- func (c *AzureOpenAIChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)
- func (c *AzureOpenAIChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error
type AzureOpenAIChatResponse
- func (r *AzureOpenAIChatResponse) Candidates() []Candidate
- func (r *AzureOpenAIChatResponse) MarshalJSON() ([]byte, error)
- func (r *AzureOpenAIChatResponse) String() string
- func (r *AzureOpenAIChatResponse) UsageMetadata() any
type AzureOpenAIClient
- func NewAzureOpenAIClient(ctx context.Context, opts ClientOptions) (*AzureOpenAIClient, error)
- func (c *AzureOpenAIClient) Close() error
- func (c *AzureOpenAIClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)
- func (c *AzureOpenAIClient) ListModels(ctx context.Context) ([]string, error)
- func (c *AzureOpenAIClient) SetResponseSchema(schema *Schema) error
- func (c *AzureOpenAIClient) StartChat(systemPrompt string, model string) Chat
type AzureOpenAICompletionResponse
- func (r *AzureOpenAICompletionResponse) Response() string
- func (r *AzureOpenAICompletionResponse) UsageMetadata() any
type AzureOpenAIPart
- func (p *AzureOpenAIPart) AsFunctionCalls() ([]FunctionCall, bool)
- func (p *AzureOpenAIPart) AsText() (string, bool)
type BedrockClient
- func NewBedrockClient(ctx context.Context, opts ClientOptions) (*BedrockClient, error)
- func (c *BedrockClient) Close() error
- func (c *BedrockClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)
- func (c *BedrockClient) ListModels(ctx context.Context) ([]string, error)
- func (c *BedrockClient) SetResponseSchema(schema *Schema) error
- func (c *BedrockClient) StartChat(systemPrompt, model string) Chat
type Candidate
type Chat
- func NewRetryChat[C Chat](underlying C, config RetryConfig) Chat
type ChatResponse
type ChatResponseIterator
type Client
- func NewClient(ctx context.Context, providerID string, opts ...Option) (Client, error)
type ClientOptions
type CompletionRequest
type CompletionResponse
type FactoryFunc
type FunctionCall
type FunctionCallResult
type FunctionDefinition
type GeminiAPIClientOptions
type GeminiCandidate
- func (r *GeminiCandidate) Parts() []Part
- func (r *GeminiCandidate) String() string
type GeminiChat
- func (c *GeminiChat) Initialize(messages []*api.Message) error
- func (c *GeminiChat) IsRetryableError(err error) bool
- func (c *GeminiChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)
- func (c *GeminiChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)
- func (c *GeminiChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error
type GeminiChatResponse
- func (r *GeminiChatResponse) Candidates() []Candidate
- func (r *GeminiChatResponse) MarshalJSON() ([]byte, error)
- func (r *GeminiChatResponse) String() string
- func (r *GeminiChatResponse) UsageMetadata() any
type GeminiCompletionResponse
- func (r *GeminiCompletionResponse) MarshalJSON() ([]byte, error)
- func (r *GeminiCompletionResponse) Response() string
- func (r *GeminiCompletionResponse) String() string
- func (r *GeminiCompletionResponse) UsageMetadata() any
type GeminiPart
- func (p *GeminiPart) AsFunctionCalls() ([]FunctionCall, bool)
- func (p *GeminiPart) AsText() (string, bool)
type GoogleAIClient
- func NewGeminiAPIClient(ctx context.Context, opt GeminiAPIClientOptions) (*GoogleAIClient, error)
- func NewVertexAIClient(ctx context.Context, opt VertexAIClientOptions) (*GoogleAIClient, error)
- func (c *GoogleAIClient) Close() error
- func (c *GoogleAIClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)
- func (c *GoogleAIClient) ListModels(ctx context.Context) (modelNames []string, err error)
- func (c *GoogleAIClient) SetResponseSchema(responseSchema *Schema) error
- func (c *GoogleAIClient) StartChat(systemPrompt string, model string) Chat
type GrokClient
- func NewGrokClient(ctx context.Context, opts ClientOptions) (*GrokClient, error)
- func (c *GrokClient) Close() error
- func (c *GrokClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)
- func (c *GrokClient) ListModels(ctx context.Context) ([]string, error)
- func (c *GrokClient) SetResponseSchema(schema *Schema) error
- func (c *GrokClient) StartChat(systemPrompt, model string) Chat
type IsRetryableFunc
type LlamaCppCandidate
- func (r *LlamaCppCandidate) Parts() []Part
- func (r *LlamaCppCandidate) String() string
type LlamaCppChat
- func (c *LlamaCppChat) Initialize(messages []*api.Message) error
- func (c *LlamaCppChat) IsRetryableError(err error) bool
- func (c *LlamaCppChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)
- func (c *LlamaCppChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)
- func (c *LlamaCppChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error
type LlamaCppChatResponse
- func (r *LlamaCppChatResponse) Candidates() []Candidate
- func (r *LlamaCppChatResponse) MarshalJSON() ([]byte, error)
- func (r *LlamaCppChatResponse) String() string
- func (r *LlamaCppChatResponse) UsageMetadata() any
type LlamaCppClient
- func NewLlamaCppClient(ctx context.Context, opts ClientOptions) (*LlamaCppClient, error)
- func (c *LlamaCppClient) Close() error
- func (c *LlamaCppClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)
- func (c *LlamaCppClient) ListModels(ctx context.Context) ([]string, error)
- func (c *LlamaCppClient) SetResponseSchema(responseSchema *Schema) error
- func (c *LlamaCppClient) StartChat(systemPrompt, model string) Chat
type LlamaCppCompletionResponse
- func (r *LlamaCppCompletionResponse) Response() string
- func (r *LlamaCppCompletionResponse) UsageMetadata() any
type LlamaCppPart
- func (p *LlamaCppPart) AsFunctionCalls() ([]FunctionCall, bool)
- func (p *LlamaCppPart) AsText() (string, bool)
type OllamaCandidate
- func (r *OllamaCandidate) Parts() []Part
- func (r *OllamaCandidate) String() string
type OllamaChat
- func (c *OllamaChat) Initialize(messages []*kctlApi.Message) error
- func (c *OllamaChat) IsRetryableError(err error) bool
- func (c *OllamaChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)
- func (c *OllamaChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)
- func (c *OllamaChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error
type OllamaChatResponse
- func (r *OllamaChatResponse) Candidates() []Candidate
- func (r *OllamaChatResponse) MarshalJSON() ([]byte, error)
- func (r *OllamaChatResponse) String() string
- func (r *OllamaChatResponse) UsageMetadata() any
type OllamaClient
- func NewOllamaClient(ctx context.Context, opts ClientOptions) (*OllamaClient, error)
- func (c *OllamaClient) Close() error
- func (c *OllamaClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)
- func (c *OllamaClient) ListModels(ctx context.Context) ([]string, error)
- func (c *OllamaClient) SetResponseSchema(schema *Schema) error
- func (c *OllamaClient) StartChat(systemPrompt, model string) Chat
type OllamaCompletionResponse
- func (r *OllamaCompletionResponse) Response() string
- func (r *OllamaCompletionResponse) UsageMetadata() any
type OllamaPart
- func (p *OllamaPart) AsFunctionCalls() ([]FunctionCall, bool)
- func (p *OllamaPart) AsText() (string, bool)
type OpenAIClient
- func NewOpenAIClient(ctx context.Context, opts ClientOptions) (*OpenAIClient, error)
- func (c *OpenAIClient) Close() error
- func (c *OpenAIClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)
- func (c *OpenAIClient) ListModels(ctx context.Context) ([]string, error)
- func (c *OpenAIClient) SetResponseSchema(schema *Schema) error
- func (c *OpenAIClient) StartChat(systemPrompt, model string) Chat
type Option
- func WithSkipVerifySSL() Option
type Part
type RecordChatResponse
type RecordCompletionResponse
type RetryConfig
type Schema
- func BuildSchemaFor(t reflect.Type) *Schema
- func (s *Schema) ToRawSchema() (json.RawMessage, error)
type SchemaType
type VertexAIClientOptions

Constants ¶

This section is empty.

Variables ¶

View Source

var DefaultRetryConfig = RetryConfig{
	MaxAttempts:    5,
	InitialBackoff: 200 * time.Millisecond,
	MaxBackoff:     10 * time.Second,
	BackoffFactor:  2.0,
	Jitter:         true,
}

DefaultRetryConfig provides sensible defaults (same as before)

Functions ¶

func DefaultIsRetryableError ¶

func DefaultIsRetryableError(err error) bool

DefaultIsRetryableError provides a default implementation based on common HTTP codes and network errors.

func RegisterProvider ¶

func RegisterProvider(id string, factoryFunc FactoryFunc) error

func Retry ¶

func Retry[T any](
	ctx context.Context,
	config RetryConfig,
	isRetryable IsRetryableFunc,
	operation func(ctx context.Context) (T, error),
) (T, error)

Retry executes the provided operation with retries, returning the result and error. It's now generic to handle any return type T.

Types ¶

type APIError ¶

type APIError struct {
	StatusCode int
	Message    string
	Err        error
}

APIError represents an error returned by the LLM client.

func (*APIError) Error ¶

func (e *APIError) Error() string

func (*APIError) Unwrap ¶

func (e *APIError) Unwrap() error

type AzureOpenAICandidate ¶

type AzureOpenAICandidate struct {
	// contains filtered or unexported fields
}

func (*AzureOpenAICandidate) Parts ¶

func (r *AzureOpenAICandidate) Parts() []Part

func (*AzureOpenAICandidate) String ¶

func (r *AzureOpenAICandidate) String() string

type AzureOpenAIChat ¶

type AzureOpenAIChat struct {
	// contains filtered or unexported fields
}

func (*AzureOpenAIChat) Initialize ¶

func (c *AzureOpenAIChat) Initialize(messages []*api.Message) error

func (*AzureOpenAIChat) IsRetryableError ¶

func (c *AzureOpenAIChat) IsRetryableError(err error) bool

func (*AzureOpenAIChat) Send ¶

func (c *AzureOpenAIChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)

func (*AzureOpenAIChat) SendStreaming ¶

func (c *AzureOpenAIChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)

func (*AzureOpenAIChat) SetFunctionDefinitions ¶

func (c *AzureOpenAIChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error

type AzureOpenAIChatResponse ¶

type AzureOpenAIChatResponse struct {
	// contains filtered or unexported fields
}

func (*AzureOpenAIChatResponse) Candidates ¶

func (r *AzureOpenAIChatResponse) Candidates() []Candidate

func (*AzureOpenAIChatResponse) MarshalJSON ¶

func (r *AzureOpenAIChatResponse) MarshalJSON() ([]byte, error)

func (*AzureOpenAIChatResponse) String ¶

func (r *AzureOpenAIChatResponse) String() string

func (*AzureOpenAIChatResponse) UsageMetadata ¶

func (r *AzureOpenAIChatResponse) UsageMetadata() any

type AzureOpenAIClient ¶

type AzureOpenAIClient struct {
	// contains filtered or unexported fields
}

func NewAzureOpenAIClient ¶

func NewAzureOpenAIClient(ctx context.Context, opts ClientOptions) (*AzureOpenAIClient, error)

NewAzureOpenAIClient creates a new Azure OpenAI client. Supports ClientOptions and SkipVerifySSL for custom HTTP transport.

func (*AzureOpenAIClient) Close ¶

func (c *AzureOpenAIClient) Close() error

func (*AzureOpenAIClient) GenerateCompletion ¶

func (c *AzureOpenAIClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)

func (*AzureOpenAIClient) ListModels ¶

func (c *AzureOpenAIClient) ListModels(ctx context.Context) ([]string, error)

func (*AzureOpenAIClient) SetResponseSchema ¶

func (c *AzureOpenAIClient) SetResponseSchema(schema *Schema) error

func (*AzureOpenAIClient) StartChat ¶

func (c *AzureOpenAIClient) StartChat(systemPrompt string, model string) Chat

type AzureOpenAICompletionResponse ¶

type AzureOpenAICompletionResponse struct {
	// contains filtered or unexported fields
}

func (*AzureOpenAICompletionResponse) Response ¶

func (r *AzureOpenAICompletionResponse) Response() string

func (*AzureOpenAICompletionResponse) UsageMetadata ¶

func (r *AzureOpenAICompletionResponse) UsageMetadata() any

type AzureOpenAIPart ¶

type AzureOpenAIPart struct {
	// contains filtered or unexported fields
}

func (*AzureOpenAIPart) AsFunctionCalls ¶

func (p *AzureOpenAIPart) AsFunctionCalls() ([]FunctionCall, bool)

func (*AzureOpenAIPart) AsText ¶

func (p *AzureOpenAIPart) AsText() (string, bool)

type BedrockClient ¶

type BedrockClient struct {
	// contains filtered or unexported fields
}

BedrockClient implements the gollm.Client interface for AWS Bedrock models

func NewBedrockClient ¶

func NewBedrockClient(ctx context.Context, opts ClientOptions) (*BedrockClient, error)

NewBedrockClient creates a new client for interacting with AWS Bedrock models

func (*BedrockClient) Close ¶

func (c *BedrockClient) Close() error

Close cleans up any resources used by the client

func (*BedrockClient) GenerateCompletion ¶

func (c *BedrockClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)

GenerateCompletion generates a single completion for the given request

func (*BedrockClient) ListModels ¶

func (c *BedrockClient) ListModels(ctx context.Context) ([]string, error)

ListModels returns the list of supported Bedrock models

func (*BedrockClient) SetResponseSchema ¶

func (c *BedrockClient) SetResponseSchema(schema *Schema) error

SetResponseSchema sets the response schema for the client (not supported by Bedrock)

func (*BedrockClient) StartChat ¶

func (c *BedrockClient) StartChat(systemPrompt, model string) Chat

StartChat starts a new chat session with the specified system prompt and model

type Candidate ¶

type Candidate interface {
	// String returns a string representation of the candidate.
	fmt.Stringer

	// Parts returns the parts of the candidate.
	Parts() []Part
}

Candidate is one of a set of candidate response from the LLM.

type Chat ¶

type Chat interface {
	// Send adds a user message to the chat, and gets the response from the LLM.
	// Note that this method automatically updates the state of the Chat,
	// you do not need to "replay" any messages from the LLM.
	Send(ctx context.Context, contents ...any) (ChatResponse, error)

	// SendStreaming is the streaming version of Send.
	SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)

	// SetFunctionDefinitions configures the set of tools (functions) available to the LLM
	// for function calling.
	SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error

	// IsRetryableError returns true if the error is retryable.
	IsRetryableError(error) bool

	// Initialize initializes the chat with a previous conversation history.
	Initialize(messages []*api.Message) error
}

Chat is an active conversation with a language model. Messages are sent and received, and add to a conversation history.

func NewRetryChat ¶

func NewRetryChat[C Chat](
	underlying C,
	config RetryConfig,
) Chat

NewRetryChat creates a new Chat that wraps the given underlying client with retry logic using the provided configuration. It returns the Chat interface type, hiding the generic implementation detail.

type ChatResponse ¶

type ChatResponse interface {
	UsageMetadata() any

	// Candidates are a set of candidate responses from the LLM.
	// The LLM may return multiple candidates, and we can choose the best one.
	Candidates() []Candidate
}

ChatResponse is a generic chat response from the LLM.

type ChatResponseIterator ¶

type ChatResponseIterator iter.Seq2[ChatResponse, error]

ChatResponseIterator is a streaming chat response from the LLM.

type Client ¶

type Client interface {
	io.Closer

	// StartChat starts a new multi-turn chat with a language model.
	StartChat(systemPrompt, model string) Chat

	// GenerateCompletion generates a single completion for a given prompt.
	GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)

	// SetResponseSchema constrains LLM responses to match the provided schema.
	// Calling with nil will clear the current schema.
	SetResponseSchema(schema *Schema) error

	// ListModels lists the models available in the LLM.
	ListModels(ctx context.Context) ([]string, error)
}

Client is a client for a language model.

func NewClient ¶

func NewClient(ctx context.Context, providerID string, opts ...Option) (Client, error)

NewClient builds a Client based on the LLM_CLIENT environment variable or the provided providerID. If providerID is not empty, it overrides the value from LLM_CLIENT. Supports Option parameters and the LLM_SKIP_VERIFY_SSL environment variable.

type ClientOptions ¶

type ClientOptions struct {
	URL           *url.URL
	SkipVerifySSL bool
}

type CompletionRequest ¶

type CompletionRequest struct {
	Model  string `json:"model,omitempty"`
	Prompt string `json:"prompt,omitempty"`
}

CompletionRequest is a request to generate a completion for a given prompt.

type CompletionResponse ¶

type CompletionResponse interface {
	Response() string
	UsageMetadata() any
}

CompletionResponse is a response from the GenerateCompletion method.

type FactoryFunc ¶

type FactoryFunc func(ctx context.Context, opts ClientOptions) (Client, error)

type FunctionCall ¶

type FunctionCall struct {
	ID        string         `json:"id,omitempty"`
	Name      string         `json:"name,omitempty"`
	Arguments map[string]any `json:"arguments,omitempty"`
}

FunctionCall is a function call to a language model. The LLM will reply with a FunctionCall to a user-defined function, and we will send the results back.

type FunctionCallResult ¶

type FunctionCallResult struct {
	ID     string         `json:"id,omitempty"`
	Name   string         `json:"name,omitempty"`
	Result map[string]any `json:"result,omitempty"`
}

FunctionCallResult is the result of a function call. We use this to send the results back to the LLM.

type FunctionDefinition ¶

type FunctionDefinition struct {
	Name        string  `json:"name,omitempty"`
	Description string  `json:"description,omitempty"`
	Parameters  *Schema `json:"parameters,omitempty"`
}

FunctionDefinition is a user-defined function that can be called by the LLM. If the LLM determines the function should be called, it will reply with a FunctionCall object; we will invoke the function and the results back.

type GeminiAPIClientOptions ¶

type GeminiAPIClientOptions struct {
	// API Key for GenAI. Required for BackendGeminiAPI.
	APIKey string
}

GeminiAPIClientOptions are the options for the Gemini API client.

type GeminiCandidate ¶

type GeminiCandidate struct {
	// contains filtered or unexported fields
}

GeminiCandidate is a candidate for the response. It implements the Candidate interface.

func (*GeminiCandidate) Parts ¶

func (r *GeminiCandidate) Parts() []Part

Parts returns the parts of the candidate.

func (*GeminiCandidate) String ¶

func (r *GeminiCandidate) String() string

String returns a string representation of the response.

type GeminiChat ¶

type GeminiChat struct {
	// contains filtered or unexported fields
}

GeminiChat is a chat with the model. It implements the Chat interface.

func (*GeminiChat) Initialize ¶

func (c *GeminiChat) Initialize(messages []*api.Message) error

func (*GeminiChat) IsRetryableError ¶

func (c *GeminiChat) IsRetryableError(err error) bool

func (*GeminiChat) Send ¶

func (c *GeminiChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)

Send sends a message to the model. It returns a ChatResponse object containing the response from the model.

func (*GeminiChat) SendStreaming ¶

func (c *GeminiChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)

func (*GeminiChat) SetFunctionDefinitions ¶

func (c *GeminiChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error

SetFunctionDefinitions sets the function definitions for the chat. This allows the LLM to call user-defined functions.

type GeminiChatResponse ¶

type GeminiChatResponse struct {
	// contains filtered or unexported fields
}

GeminiChatResponse is a response from the Gemini API. It implements the ChatResponse interface.

func (*GeminiChatResponse) Candidates ¶

func (r *GeminiChatResponse) Candidates() []Candidate

Candidates returns the candidates for the response.

func (*GeminiChatResponse) MarshalJSON ¶

func (r *GeminiChatResponse) MarshalJSON() ([]byte, error)

func (*GeminiChatResponse) String ¶

func (r *GeminiChatResponse) String() string

String returns a string representation of the response.

func (*GeminiChatResponse) UsageMetadata ¶

func (r *GeminiChatResponse) UsageMetadata() any

UsageMetadata returns the usage metadata for the response.

type GeminiCompletionResponse ¶

type GeminiCompletionResponse struct {
	// contains filtered or unexported fields
}

func (*GeminiCompletionResponse) MarshalJSON ¶

func (r *GeminiCompletionResponse) MarshalJSON() ([]byte, error)

func (*GeminiCompletionResponse) Response ¶

func (r *GeminiCompletionResponse) Response() string

func (*GeminiCompletionResponse) String ¶

func (r *GeminiCompletionResponse) String() string

func (*GeminiCompletionResponse) UsageMetadata ¶

func (r *GeminiCompletionResponse) UsageMetadata() any

type GeminiPart ¶

type GeminiPart struct {
	// contains filtered or unexported fields
}

GeminiPart is a part of a candidate. It implements the Part interface.

func (*GeminiPart) AsFunctionCalls ¶

func (p *GeminiPart) AsFunctionCalls() ([]FunctionCall, bool)

AsFunctionCalls returns the function calls of the part.

func (*GeminiPart) AsText ¶

func (p *GeminiPart) AsText() (string, bool)

AsText returns the text of the part.

type GoogleAIClient ¶

type GoogleAIClient struct {
	// contains filtered or unexported fields
}

GoogleAIClient is a client for the google AI APIs. It implements the Client interface.

func NewGeminiAPIClient ¶

func NewGeminiAPIClient(ctx context.Context, opt GeminiAPIClientOptions) (*GoogleAIClient, error)

NewGeminiAPIClient builds a client for the Gemini API.

func NewVertexAIClient ¶

func NewVertexAIClient(ctx context.Context, opt VertexAIClientOptions) (*GoogleAIClient, error)

NewVertexAIClient builds a client for the vertexai API.

func (*GoogleAIClient) Close ¶

func (c *GoogleAIClient) Close() error

Close frees the resources used by the client.

func (*GoogleAIClient) GenerateCompletion ¶

func (c *GoogleAIClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)

func (*GoogleAIClient) ListModels ¶

func (c *GoogleAIClient) ListModels(ctx context.Context) (modelNames []string, err error)

ListModels lists the models available in the Gemini API.

func (*GoogleAIClient) SetResponseSchema ¶

func (c *GoogleAIClient) SetResponseSchema(responseSchema *Schema) error

SetResponseSchema constrains LLM responses to match the provided schema. Calling with nil will clear the current schema.

func (*GoogleAIClient) StartChat ¶

func (c *GoogleAIClient) StartChat(systemPrompt string, model string) Chat

StartChat starts a new chat with the model.

type GrokClient ¶

type GrokClient struct {
	// contains filtered or unexported fields
}

GrokClient implements the gollm.Client interface for X.AI's Grok model.

func NewGrokClient ¶

func NewGrokClient(ctx context.Context, opts ClientOptions) (*GrokClient, error)

NewGrokClient creates a new client for interacting with X.AI's Grok model. Supports custom HTTP client and skipVerifySSL via ClientOptions.

func (*GrokClient) Close ¶

func (c *GrokClient) Close() error

Close cleans up any resources used by the client.

func (*GrokClient) GenerateCompletion ¶

func (c *GrokClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)

GenerateCompletion sends a completion request to the Grok API.

func (*GrokClient) ListModels ¶

func (c *GrokClient) ListModels(ctx context.Context) ([]string, error)

ListModels returns a list of available Grok models.

func (*GrokClient) SetResponseSchema ¶

func (c *GrokClient) SetResponseSchema(schema *Schema) error

SetResponseSchema is not implemented yet for Grok.

func (*GrokClient) StartChat ¶

func (c *GrokClient) StartChat(systemPrompt, model string) Chat

StartChat starts a new chat session.

type IsRetryableFunc ¶

type IsRetryableFunc func(error) bool

IsRetryableFunc defines the signature for functions that check if an error is retryable. TODO (droot): Adjust the signature to allow underlying client to relay the backoff delay etc. for example, Gemini's error codes contain retryDelay information.

type LlamaCppCandidate ¶

type LlamaCppCandidate struct {
	// contains filtered or unexported fields
}

func (*LlamaCppCandidate) Parts ¶

func (r *LlamaCppCandidate) Parts() []Part

func (*LlamaCppCandidate) String ¶

func (r *LlamaCppCandidate) String() string

type LlamaCppChat ¶

type LlamaCppChat struct {
	// contains filtered or unexported fields
}

func (*LlamaCppChat) Initialize ¶

func (c *LlamaCppChat) Initialize(messages []*api.Message) error

func (*LlamaCppChat) IsRetryableError ¶

func (c *LlamaCppChat) IsRetryableError(err error) bool

func (*LlamaCppChat) Send ¶

func (c *LlamaCppChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)

func (*LlamaCppChat) SendStreaming ¶

func (c *LlamaCppChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)

func (*LlamaCppChat) SetFunctionDefinitions ¶

func (c *LlamaCppChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error

type LlamaCppChatResponse ¶

type LlamaCppChatResponse struct {
	LlamaCppResponse llamacppChatResponse
	// contains filtered or unexported fields
}

func (*LlamaCppChatResponse) Candidates ¶

func (r *LlamaCppChatResponse) Candidates() []Candidate

func (*LlamaCppChatResponse) MarshalJSON ¶

func (r *LlamaCppChatResponse) MarshalJSON() ([]byte, error)

func (*LlamaCppChatResponse) String ¶

func (r *LlamaCppChatResponse) String() string

func (*LlamaCppChatResponse) UsageMetadata ¶

func (r *LlamaCppChatResponse) UsageMetadata() any

type LlamaCppClient ¶

type LlamaCppClient struct {
	// contains filtered or unexported fields
}

func NewLlamaCppClient ¶

func NewLlamaCppClient(ctx context.Context, opts ClientOptions) (*LlamaCppClient, error)

NewLlamaCppClient creates a new client for llama.cpp. Supports custom HTTP client and skipVerifySSL via ClientOptions.

func (*LlamaCppClient) Close ¶

func (c *LlamaCppClient) Close() error

func (*LlamaCppClient) GenerateCompletion ¶

func (c *LlamaCppClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)

func (*LlamaCppClient) ListModels ¶

func (c *LlamaCppClient) ListModels(ctx context.Context) ([]string, error)

func (*LlamaCppClient) SetResponseSchema ¶

func (c *LlamaCppClient) SetResponseSchema(responseSchema *Schema) error

func (*LlamaCppClient) StartChat ¶

func (c *LlamaCppClient) StartChat(systemPrompt, model string) Chat

type LlamaCppCompletionResponse ¶

type LlamaCppCompletionResponse struct {
	// contains filtered or unexported fields
}

func (*LlamaCppCompletionResponse) Response ¶

func (r *LlamaCppCompletionResponse) Response() string

func (*LlamaCppCompletionResponse) UsageMetadata ¶

func (r *LlamaCppCompletionResponse) UsageMetadata() any

type LlamaCppPart ¶

type LlamaCppPart struct {
	// contains filtered or unexported fields
}

func (*LlamaCppPart) AsFunctionCalls ¶

func (p *LlamaCppPart) AsFunctionCalls() ([]FunctionCall, bool)

func (*LlamaCppPart) AsText ¶

func (p *LlamaCppPart) AsText() (string, bool)

type OllamaCandidate ¶

type OllamaCandidate struct {
	// contains filtered or unexported fields
}

func (*OllamaCandidate) Parts ¶

func (r *OllamaCandidate) Parts() []Part

func (*OllamaCandidate) String ¶

func (r *OllamaCandidate) String() string

type OllamaChat ¶

type OllamaChat struct {
	// contains filtered or unexported fields
}

func (*OllamaChat) Initialize ¶

func (c *OllamaChat) Initialize(messages []*kctlApi.Message) error

func (*OllamaChat) IsRetryableError ¶

func (c *OllamaChat) IsRetryableError(err error) bool

func (*OllamaChat) Send ¶

func (c *OllamaChat) Send(ctx context.Context, contents ...any) (ChatResponse, error)

func (*OllamaChat) SendStreaming ¶

func (c *OllamaChat) SendStreaming(ctx context.Context, contents ...any) (ChatResponseIterator, error)

func (*OllamaChat) SetFunctionDefinitions ¶

func (c *OllamaChat) SetFunctionDefinitions(functionDefinitions []*FunctionDefinition) error

type OllamaChatResponse ¶

type OllamaChatResponse struct {
	// contains filtered or unexported fields
}

func (*OllamaChatResponse) Candidates ¶

func (r *OllamaChatResponse) Candidates() []Candidate

func (*OllamaChatResponse) MarshalJSON ¶

func (r *OllamaChatResponse) MarshalJSON() ([]byte, error)

func (*OllamaChatResponse) String ¶

func (r *OllamaChatResponse) String() string

func (*OllamaChatResponse) UsageMetadata ¶

func (r *OllamaChatResponse) UsageMetadata() any

type OllamaClient ¶

type OllamaClient struct {
	// contains filtered or unexported fields
}

func NewOllamaClient ¶

func NewOllamaClient(ctx context.Context, opts ClientOptions) (*OllamaClient, error)

NewOllamaClient creates a new client for Ollama. Supports custom HTTP client and skipVerifySSL via ClientOptions if the SDK supports it.

func (*OllamaClient) Close ¶

func (c *OllamaClient) Close() error

func (*OllamaClient) GenerateCompletion ¶

func (c *OllamaClient) GenerateCompletion(ctx context.Context, request *CompletionRequest) (CompletionResponse, error)

func (*OllamaClient) ListModels ¶

func (c *OllamaClient) ListModels(ctx context.Context) ([]string, error)

func (*OllamaClient) SetResponseSchema ¶

func (c *OllamaClient) SetResponseSchema(schema *Schema) error

func (*OllamaClient) StartChat ¶

func (c *OllamaClient) StartChat(systemPrompt, model string) Chat

type OllamaCompletionResponse ¶

type OllamaCompletionResponse struct {
	// contains filtered or unexported fields
}

func (*OllamaCompletionResponse) Response ¶

func (r *OllamaCompletionResponse) Response() string

func (*OllamaCompletionResponse) UsageMetadata ¶

func (r *OllamaCompletionResponse) UsageMetadata() any

type OllamaPart ¶

type OllamaPart struct {
	// contains filtered or unexported fields
}

func (*OllamaPart) AsFunctionCalls ¶

func (p *OllamaPart) AsFunctionCalls() ([]FunctionCall, bool)

func (*OllamaPart) AsText ¶

func (p *OllamaPart) AsText() (string, bool)

type OpenAIClient ¶

type OpenAIClient struct {
	// contains filtered or unexported fields
}

OpenAIClient implements the gollm.Client interface for OpenAI models.

func NewOpenAIClient ¶

func NewOpenAIClient(ctx context.Context, opts ClientOptions) (*OpenAIClient, error)

NewOpenAIClient creates a new client for interacting with OpenAI. Supports custom HTTP client (e.g., for skipping SSL verification).

func (*OpenAIClient) Close ¶

func (c *OpenAIClient) Close() error

Close cleans up any resources used by the client.

func (*OpenAIClient) GenerateCompletion ¶

func (c *OpenAIClient) GenerateCompletion(ctx context.Context, req *CompletionRequest) (CompletionResponse, error)

GenerateCompletion sends a completion request to the OpenAI API.

func (*OpenAIClient) ListModels ¶

func (c *OpenAIClient) ListModels(ctx context.Context) ([]string, error)

ListModels returns a slice of strings with model IDs. Note: This may not work with all OpenAI-compatible providers if they don't fully implement the Models.List endpoint or return data in a different format.

func (*OpenAIClient) SetResponseSchema ¶

func (c *OpenAIClient) SetResponseSchema(schema *Schema) error

SetResponseSchema is not implemented yet.

func (*OpenAIClient) StartChat ¶

func (c *OpenAIClient) StartChat(systemPrompt, model string) Chat

StartChat starts a new chat session.

type Option ¶

type Option func(*ClientOptions)

Option is a functional option for configuring ClientOptions.

func WithSkipVerifySSL ¶

func WithSkipVerifySSL() Option

WithSkipVerifySSL enables skipping SSL certificate verification for HTTP clients.

type Part ¶

type Part interface {
	// AsText returns the text of the part.
	// if the part is not text, it returns ("", false)
	AsText() (string, bool)

	// AsFunctionCalls returns the function calls of the part.
	// if the part is not a function call, it returns (nil, false)
	AsFunctionCalls() ([]FunctionCall, bool)
}

Part is a part of a candidate response from the LLM. It can be a text response, or a function call. A response may comprise multiple parts, for example a text response and a function call where the text response is "I need to do the necessary" and then the function call is "do_necessary".

type RecordChatResponse ¶

type RecordChatResponse struct {
	// TODO: Structured data?
	Raw any `json:"raw"`
}

type RecordCompletionResponse ¶

type RecordCompletionResponse struct {
	Text string `json:"text"`
	Raw  any    `json:"raw"`
}

type RetryConfig ¶

type RetryConfig struct {
	MaxAttempts    int
	InitialBackoff time.Duration
	MaxBackoff     time.Duration
	BackoffFactor  float64
	Jitter         bool
}

RetryConfig holds the configuration for the retry mechanism (same as before)

type Schema ¶

type Schema struct {
	Type        SchemaType         `json:"type,omitempty"`
	Properties  map[string]*Schema `json:"properties,omitempty"`
	Items       *Schema            `json:"items,omitempty"`
	Description string             `json:"description,omitempty"`
	Required    []string           `json:"required,omitempty"`
}

Schema is a schema for a function definition.

func BuildSchemaFor ¶

func BuildSchemaFor(t reflect.Type) *Schema

BuildSchemaFor will build a schema for the given golang type. Because this does not have description populated, it is more useful for the response schema than tools/functions.

func (*Schema) ToRawSchema ¶

func (s *Schema) ToRawSchema() (json.RawMessage, error)

ToRawSchema converts a Schema to a json.RawMessage.

type SchemaType ¶

type SchemaType string

SchemaType is the type of a field in a Schema.

const (
	TypeObject SchemaType = "object"
	TypeArray  SchemaType = "array"

	TypeString  SchemaType = "string"
	TypeBoolean SchemaType = "boolean"
	TypeNumber  SchemaType = "number"
	TypeInteger SchemaType = "integer"
)

type VertexAIClientOptions ¶

type VertexAIClientOptions struct {
	// GCP Project ID for Vertex AI. Required for BackendVertexAI.
	Project string
	// GCP Location/Region for Vertex AI. Required for BackendVertexAI. See https://cloud.google.com/vertex-ai/docs/general/locations
	Location string
}

VertexAIClientOptions are the options for using the VertexAPI.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL