Architecture
232 entities found
backend/Dockerfile
backend_storage volume
Background workers
Background workers include the catalog worker as a component. Background workers include the extract worker as a component. Background workers include the vectorize worker as a component. Background workers include the prioritize worker as a component. Background workers use RQ job chaining for job chaining and orchestration. The DOCX extractor relies on background workers for asynchronous processing. The PPTX extractor operates via background workers to perform extraction tasks. MSG extractor uses background workers to process message files. Background workers utilize RQ job chaining to coordinate sequential processing tasks.
bcrypt
Passlib depends on bcrypt for password hashing.
bge-large-en-v1.5
Document RAG uses the embedding model bge-large-en-v1.5 for semantic embedding of documents. LlamaIndex generates embeddings using bge-large-en-v1.5 model. LlamaIndex generates embeddings using bge-large-en-v1.5 model.
bits-ui
The npm dev dependency @playwright/test uses the bits-ui component as part of the frontend dependency set in the project.
catalog.py:_generate_ai_summary
The catalog.py:_generate_ai_summary function is modified to replace hardcoded budget analysis text with the Project's scope in prompts.
ChromaDB
Vector database (e.g., Qdrant) used for semantic search, with no version or license info specified. Vanna.AI integrates schema and documents embedding into ChromaDB.
Claude Opus
Claude Sonnet
CLI Tool
WrenAI primarily offers a CLI tool with an optional web UI for querying.
clsx
npm dependency: clsx@^2.1.1, used in frontend, license type and version unspecified, no approval or risk assessment noted.
CODE_STRUCTURE.md
Details the backend's code structure, including services like question_router.py and question_router.py modification for lazy-loading Qdrant.
Commit 408be74
Commit 408be74 fixes the SQL extraction issue where multiple code blocks in Qwen3 response cause incorrect matching by capturing only SELECT...; patterns in code blocks.
Container-per-project Isolation
Context Window
Limited to about 8,000 tokens; Arctic-Text2SQL-R1-7B encounters truncation with large schemas (490 tables). Reducing max_tables to 3 is planned to mitigate this performance constraint.
Coolify
DataLens uses Coolify on the theo server for deploying web frontends, APIs, and supporting infrastructure including PostgreSQL, Redis, Ollama, and Qdrant. It facilitates automated container builds, environment management, and environment variables like ANTHROPIC_API_KEY, ensuring smooth, continuous deployment and updates of the DataLens platform. IronClaw Agent Feature deployment integrates with Coolify for container build and deployment management. The Coolify backend is configured to use IronClawClient in remote mode with appropriate environment variables. ANTHROPIC_API_KEY is required to be set in Coolify environment for OpenClaw to operate successfully without timeout. Backend container depends on Coolify environment variables to have the ANTHROPIC_API_KEY set to function correctly with OpenClaw and Claude. Coolify environment variables provide necessary settings including Anthropic API key to the backend container during deployment. The DataLens platform backend integrates the coolify skill for deployment.
Coolify auto-deploy
Automated deployment process for DataLens platform updates via Coolify CI/CD system. Auto-deploy via Coolify integrates with the Commit identification and deployment process to automate deployment after pushing the master branch code. Coolify auto-deploy integrates with the Discovery API router deployment process to automate updates.
Cost-per-inspection trends
User asked about the trends in cost-per-inspection, which was addressed with a summary indicating an increase from 2,340 DKK (2021) to 2,400 DKK (2022).
CRASH_PATTERN_ANALYSIS.md
Analyzes crash patterns related to backend startup, focusing on Qdrant connection handling, and suggests lazy-loading to avoid timeouts.
CUDA 12.8
Docling extraction is constrained by the CUDA 12.8 technology on elin GPU server.
CustomTools
Extensible tools for additional functionalities like email or notifications, integrated into the agent framework.
Data Wizard
data/uploads/
The DS-STAR Intelligence Layer processes files from the 'data/uploads/' directory for autonomous extraction.
Database connectivity for Coolify
Reviewed in recent deployment; performance constraints noted.
Database services
Project services cooperate with Database services in the backend Database services cooperate with Qdrant service which is running with firewall open
DataLens Session 2026-03-03
Post-launch feature discussion on Danish language support for UI and summaries, including backend prompts and frontend i18n framework implementation.
Design Decision: Skip Column/Table Name Translation
Decided to keep column and table names unchanged to maintain data integrity and avoid query errors, using prompts to instruct language responses instead.
design decisions
The architecture comparison document validates design decisions. The architecture comparison with Vanna 2.0 and WrenAI validates design decisions made.
discovery-svgv.spec.ts
Playwright test suite with 15 real-data validation tests for Discovery feature, scheduled to be integrated into deployment pipeline.