RequirementIntent
Async embedding queue
The Ollama GPU qwen3-coder-next 80B model is used by the async embedding queue to generate GPU embeddings for text chunks. The Nomic-embed-text embedding model is used by the async embedding queue for batch GPU embedding processing. The file extraction process triggers the async embedding queue to generate embeddings asynchronously after extraction completes.