MKB Explorer

ConsolidationRecommendation

DiscoveryService produces ConsolidationRecommendation to suggest table consolidations for questions. DiscoveryService generates ConsolidationRecommendation for intelligent table consolidation based on questions.

Content filtering

Contextual summaries with project context

Requirement to produce file summaries that relate content to project goals; improvements made for relevance.

control questions

System includes QA measures like data provenance tracking, validation, and iterative refinement, supporting quality control.

AcceptanceCriteriaIntent

Core functionality test suite

CORS

CORS configuration details are not explicitly mentioned in the provided messages.

CapabilityIntent

Cost per inspection analysis

QuestionRouter realizes the capability to answer cost per inspection analysis queries using structured data.

Cost Reduction Achievement Use Case

Business process for analyzing savings and reductions across related tables.

StakeholderIntent

CRISP-BI

Cross-file queries

Enables queries across multiple files using standard schemas, with ongoing enhancements to schema management and mapping.

CSV Extraction

CSV extractor component implements the CSV Extraction requirement.

CSV extraction to DuckDB

Operational process for ingesting CSV files into DuckDB, now stable and part of the batch extraction workflow.

CSV files

Data source to be validated, cleaned, and loaded into DuckDB for analysis. DS-STAR Orchestrator processes CSV files during autonomous extraction DS-STAR Orchestrator processes Excel files during autonomous extraction DS-STAR Orchestrator processes CSV files during autonomous extraction DS-STAR Orchestrator processes Excel files during autonomous extraction Extractor Agents validate and clean CSV files for loading into the database.

CapabilityIntent

DABStep

Testing, Benchmarks, Polish references DABStep for data analysis benchmarks. The DataLens Master Implementation Plan uses DABStep as a dataset to benchmark query accuracy during testing. Data analysis benchmarks include DABStep as a benchmark component.

StakeholderIntent

Danish Government - Styrelsen for Grøn Arealomlægning og Vandmiljø

The SVGV Budget Analysis Project is owned by the Danish Government - Styrelsen for Grøn Arealomlægning og Vandmiljø. The Danish Government's agency Styrelsen for Grøn Arealomlægning og Vandmiljø is the client for the SVGV Budget Analysis Project.

Danish to English translation dictionary

TableCatalog uses a Danish to English translation dictionary to translate Danish table names into English keywords.

dashboard.spec.ts

The project dashboard UI is validated by the dashboard.spec.ts frontend test case.

Data analysis benchmarks

DataLens Master Implementation Plan incorporates Data analysis benchmarks like DABStep and KramaBench. Data analysis benchmarks include DABStep as a benchmark component. Data analysis benchmarks include KramaBench as one of the benchmarking tools. The DataLens Master Implementation Plan includes research and testing with Data analysis benchmarks like DABStep dataset and KramaBench. The Data analysis benchmarks use the DABStep dataset for measuring query accuracy.

EpicIntent

Data Discovery feature

The Data Discovery feature includes backend services, UI components like DiscoveryFlow.svelte, and supports the SVGV dataset with 132 files, 473 tables, and 351K rows. It performs semantic matching, search, and consolidation to improve data analysis success rates, validated by extensive E2E tests. It offers /discovery, /tables, and /validate APIs, and is a key part of DataLens, validated by the team and stakeholder Arne Hauge. The Data Discovery feature uses the Discovery service. The Data Discovery system integrates with the PostgreSQL database for storing and retrieving data. The Data Discovery system depends on the RQ Worker to process extraction jobs asynchronously. Arne Hauge is expected to use the Data Discovery system after deployment. The Data Discovery feature was delivered and represented by commit 68764d5.

data pipelines

DataLens helps build and run data pipelines. DataLens agent works on building and running data pipelines in its analysis workflows.

AcceptanceCriteriaIntent

Data validation test suite

CapabilityIntent

Data volume

DATA_BACKEND feature flag

The DATA_BACKEND feature flag can toggle between using PostgreSQL (PgDataService) and DuckDBService as the data backend.