Search results
8 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Yet another library to extract text from MS Office and PDF files
published version 3.0.3, a year ago9 dependents licensed under $ISC
39,741
This package fixes MS-Excel sheet name by limiting it to 31 characters, empty sheet name, and removing illegal characters such as :\/?*[] and more.
published version 2.0.0, 9 years ago0 dependents licensed under $MIT
70
Fork of office-text-extractor with unreleased changes that include browser support
published version 3.1.4, 9 months ago0 dependents licensed under $ISC
24
The `@anyparser/core` Typescript SDK enables developers to quickly extract structured data from a wide variety of file formats like PDFs, images, websites, audio, and videos.
- anyparser
- ai
- artificial-intelligence
- rag
- retrieval-augmented-generation
- graph-rag
- cag
- cache-augmented-generation
- pdf-processing
- pdf-extraction
- ms-office
- microsoft-office
- microsoft-word
- View more
published version 1.0.1, 4 months ago0 dependents licensed under $Apache-2.0
22
Convert MS-Office (Word/Excel/PowerPoint) documents to PDF files via Office Online (and OneDrive).
published version 0.2.4, 10 years ago0 dependents licensed under $MIT
11
work with office filaname extensions. check if a file is of type office
published version 0.0.2, 7 years ago0 dependents licensed under $MIT
9
Yet another library to extract text from MS Office and PDF files
published version 3.0.4, a year ago0 dependents licensed under $ISC
7
Converts a docx file to html
published version 0.7.0, a year ago0 dependents licensed under $MIT
3