You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Current indexing only source from .md and .rst files for simplicity, discarding valuable information in .pdf and .csv and other file types.
π‘ Proposed Solution
For unified internal interface, convert all to .md using docling-project/docling while retaining metadata before indexing. This allows future-proof file type support, even for code files and others.
π€ Alternatives Considered
Considered low level approaches such as using pdfplumber, determined too complex given existing integrated solutions such as docling.