Google AI / Vertex AI yerini alır

Local AI

Run AI models locally. Zero data leaks. Full privacy.

Run large language models on your own infrastructure with Ollama. Chat completions, embeddings, and code generation — all without sending data to third parties.

5+ available
Models
100% local
Privacy
OpenAI compatible
API
45-68 tok/s
Speed

Özellikler

Chat Completion

Conversational AI with Llama 3.1, Mistral, and Phi-3 models.

Embeddings

Generate vector embeddings for semantic search and RAG.

Code Generation

Generate and explain code with CodeLlama 13B.

100% Private

All inference runs locally. Zero data leaves your server.

Multiple Models

Switch between models per request. Each optimized for different tasks.

OpenAI Compatible

Drop-in replacement for OpenAI API. Use existing SDKs.

Hızlı Entegrasyon

SDK veya REST API ile dakikalar içinde başlayın.

example.ts
// Chat with local AI
const response = await olo.ai.chat({
  model: 'llama3.1',
  messages: [
    { role: 'user', content: 'Summarize this document' }
  ]
})

// Generate embeddings for semantic search
const embeddings = await olo.ai.embeddings({
  model: 'nomic-embed',
  input: 'OloSearch is a self-hosted platform'
})

// Code generation
const code = await olo.ai.generate({
  model: 'codellama',
  prompt: 'Write a React component for a login form'
})

API Uç Noktaları

Temel URL: https://api.voltimap.com

POST/api/v1/ai/chat
POST/api/v1/ai/embeddings
POST/api/v1/ai/generate
GET/api/v1/ai/models

Local AI kullanmaya bugün başlayın

14 günlük ücretsiz deneme. Kredi kartı gerekmez.

Ücretsiz Başla