DataEntityData Model
tables as JSON
The DOCX extractor embeds extracted tables as JSON within text chunks instead of storing them as separate DuckDB tables. The Docling extraction system uses a business rule that tables extracted from documents are embedded as JSON within semantic chunks.