llama-stack
Llama Stack
Getting Started
Core Concepts
API Providers
External Providers
OpenAI API Compatibility
Inference
Overview
Providers
inline::meta-reference
inline::sentence-transformers
remote::anthropic
remote::bedrock
remote::cerebras
remote::databricks
remote::fireworks
remote::gemini
remote::groq
remote::hf::endpoint
remote::hf::serverless
remote::llama-openai-compat
remote::nvidia
remote::ollama
remote::openai
remote::passthrough
remote::runpod
remote::sambanova
remote::tgi
remote::together
remote::vertexai
remote::vllm
remote::watsonx
Agents
Datasetio
Safety
Telemetry
Vector_Io
Tool_Runtime
Files
Distributions Overview
Advanced APIs
AI Application Examples
Deployment Examples
Contributing to Llama Stack
Llama Stack Benchmark Suite on Kubernetes
References
llama-stack
API Providers
Inference
inline::sentence-transformers
View page source
inline::sentence-transformers
Description
Sentence Transformers inference provider for text embeddings and similarity search.
Sample Configuration
{}
Read the Docs
v: v0.2.21
Versions
latest
v0.2.20
v0.2.19
v0.2.18
v0.2.17
v0.2.16
v0.2.15
v0.2.14
v0.2.13
v0.2.12
v0.2.11