Project: datalens
81 entity types
Matrix/Operations/DOCXExtractorTET
ServerOperations

DOCXExtractorTET

Refactored for semantic, section-based chunking with optional Docling GPU-accelerated extraction on elin. Uses heading hierarchy to define boundaries, embeds tables as JSON, improves text quality, and integrates with batch processing. The DOCX Extractor is deployed as part of the Backend and produces text chunks and table parsing outputs.

Attributes
labelsEntity
Relationships1 connections
Loading graph...