Google AI / Vertex AI yerini alır

Local AI

Run AI models locally. Zero data leaks. Full privacy.

Run large language models on your own infrastructure with Ollama. Chat completions, embeddings, and code generation — all without sending data to third parties.

Başla API Dokümanları

5+ available

Models

100% local

Privacy

OpenAI compatible

API

45-68 tok/s

Speed

Özellikler

Chat Completion

Conversational AI with Llama 3.1, Mistral, and Phi-3 models.

Embeddings

Generate vector embeddings for semantic search and RAG.

Code Generation

Generate and explain code with CodeLlama 13B.

100% Private

All inference runs locally. Zero data leaves your server.

Multiple Models

Switch between models per request. Each optimized for different tasks.

OpenAI Compatible

Drop-in replacement for OpenAI API. Use existing SDKs.

Hızlı Entegrasyon

SDK veya REST API ile dakikalar içinde başlayın.

example.ts

// Chat with local AI
const response = await olo.ai.chat({
  model: 'llama3.1',
  messages: [
    { role: 'user', content: 'Summarize this document' }
  ]
})

// Generate embeddings for semantic search
const embeddings = await olo.ai.embeddings({
  model: 'nomic-embed',
  input: 'OloSearch is a self-hosted platform'
})

// Code generation
const code = await olo.ai.generate({
  model: 'codellama',
  prompt: 'Write a React component for a login form'
})

API Uç Noktaları

Temel URL: https://api.voltimap.com

POST/api/v1/ai/chatChat completion

POST/api/v1/ai/embeddingsGenerate embeddings

POST/api/v1/ai/generateText generation

GET/api/v1/ai/modelsList models

Tam API dokümantasyonunu görüntüle

Local AI kullanmaya bugün başlayın

14 günlük ücretsiz deneme. Kredi kartı gerekmez.

Ücretsiz Başla