ServerOperations
Ollama embeddings
The Qdrant vector search service uses Ollama embeddings for generating vector representations of data. RAG Agent uses Ollama embeddings via nomic-embed-text for document retrieval The Docling extraction system utilizes Ollama embeddings (nomic-embed-text) to generate vector embeddings for semantic search and reasoning. The nomic-embed-text component is part of Ollama embeddings used for GPU batch embedding processing of document chunks. Qdrant vectors store the vector embeddings generated by Ollama embeddings from Docling extracted chunks for semantic search. Ollama embeddings run on the RTX 4000 SFF Ada 20GB GPU for batch processing of text chunks.