Project: datalens
81 entity types
Matrix/Intent/GPU-first extraction pipeline
BusinessProcessIntent

GPU-first extraction pipeline

Operational process involving GPU-based Docling extraction and semantic chunking, with rich metadata, ensuring high-quality, scalable document processing. The Extraction Pipeline (GPU-First) includes the theo orchestration server that manages FastAPI backend, RQ workers, PostgreSQL metadata, DuckDB data storage, and Redis job queue. The Extraction Pipeline (GPU-First) utilizes the elin GPU processing server which hosts the RTX 4000 GPU, runs Docling for extraction, Ollama for embeddings, and CUDA 12.8.

Attributes
labelsEntity,BusinessProcess
process levelL3_operational
Relationships2 connections
Loading graph...