ai.vectorizer_errors table.
Parser selection
The parser selection examines file extensions and content types:- PDF files, images, Office documents (DOCX, XLSX, etc.): Uses Docling
- EPUB and MOBI (e-book formats): Uses PyMuPDF
- Text formats (TXT, MD, etc.): No parser used (content read directly)
Samples
Use automatic parser selection
Arguments
This function takes no parameters.Returns
A JSON configuration object for use increate_vectorizer().
Related functions
parsing_none(): skip parsing for textual dataparsing_docling(): explicitly use Docling parserparsing_pymupdf(): explicitly use PyMuPDF parserloading_uri(): load data from file URIs