Word processing formats
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Ability WP | Y | N | N |
| Ability Write (later versions) | Y | N | N |
| ACT | Y | N | N |
| Adobe InDesign document | Y | N | N |
| Adobe InDesign IDML format | Y | N | N |
| Adobe FrameMaker | Y | N | N |
| Adobe FrameMaker Interchange | Y | Y | N |
| Adobe PDF | Y | Y | Y |
| Adobe XML Data Package format | Y | N | N |
| AES Multiplus Comm | Y | N | N |
| Android Binary XML (compressed byaapt) format | Y | N | N |
| Apple iBooks format | Y | N | N |
| Apple iChat Log | Y | Y | N |
| Apple iWork Pages (‘08, ‘09) | Y | Y | Y |
| Apple iWork Pages (‘13, ‘16, iCloud 2018) | Y | Y | N |
| Applix Alis | Y | N | N |
| Applix Asterix | Y | N | N |
| Applix Words | Y | Y | N |
| AT&T DjVu format | Y | N | N |
| Atom Syndication Format | Y | N | N |
| Broad Band eBook (BBeB) in LRF format | Y | N | N |
| Calamus Desktop Publishing | Y | N | N |
| Chemical Markup Language (CML) XML format | Y | N | N |
| COMET TOP Word | Y | N | N |
| Convergent Technologies DEF Comm. | Y | N | N |
| Corel WordPerfect Macintosh | Y | Y | N |
| Corel WordPerfect Windows | Y | Y | P |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Word Perfect (version5) | Y | N | N |
| WordPerfect (version 6 and higher) | Y | N | N |
| WordPerfect Graphics (version 1) | Y | N | N |
| CPT Communication | Y | N | N |
| Data Point Vistaword | Y | N | N |
| DCS | Y | N | N |
| DEC WPS PLUS | Y | N | N |
| DECdx | Y | N | N |
| DG CEOwrite | Y | N | N |
| DG Common Data Stream | Y | N | N |
| Digital Document Interchange Format (DDIF) | Y | N | N |
| Digitally Signed PDF file (Native) | Y | N | N |
| DisplayWrite | Y | Y | N |
| DNAML DNL eBook | Y | N | N |
| DSA101 | Y | N | N |
| e-Szigno signed xml document | Y | N | N |
| EBCDIC-encoded XML file | Y | N | N |
| EBCDIC Text | Y | N | N |
| eFax | Y | N | N |
| Electronic Publication | Y | N | N |
| Enable | Y | N | N |
| Encrypted Microsoft OneNote Files | Y | N | N |
| Envoy | Y | N | N |
| Extensible Data Format (XDF) XML format | Y | N | N |
| Extensible Style sheet Language Transformations (XSLT) format | Y | N | N |
| Folio Flat File | Y | Y | Y |
| Founder Chinese E-paper Basic | Y | Y | N |
| Foxmail email format | Y | N | N |
| Fujitsu Oasys | Y | Y | P |
| Haansoft Hangul | Y | Y | Y |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Hangul HWPX document | Y | N | N |
| Health level7 | Y | Y | Y |
| HP Word PC | Y | N | N |
| IBM 1403 Line Printer | Y | N | N |
| IBM DCA/RFT | Y | Y | N |
| IBM DCA-FFT | Y | N | N |
| IBM DCF Script | Y | N | N |
| Ichitaro Compressed | Y | N | N |
| Informix SmartWare II | Y | N | N |
| Interleaf | Y | N | N |
| Java Network Launching Protocol | Y | N | N |
| JustSystems Ichitaro | Y | Y | P |
| Lotus AMI Pro | Y | Y | P |
| Lotus AMI Pro Style Sheet | Y | Y | P |
| Lotus Notes CDF | Y | N | N |
| Lotus Word Pro (96, 97, R9) | Y | Y | P |
| Lotus SmartMaster | Y | Y | N |
| Lotus Organizer documents | Y | N | N |
| Lyrix | Y | N | N |
| Machine-Readable Cataloging (MARC) XML format | Y | N | N |
| Macromedia Flash FLA Project File OLE format | Y | N | N |
| MacWrite | Y | N | N |
| MacWrite II | Y | N | N |
| MASS-11 | Y | N | N |
| Metadata Encoding and Transmission Standard (METS) XML format | Y | N | N |
| Metadata Object Description Schema (MODS) XML format | Y | N | N |
| Metalink XML format | Y | N | N |
| Microsoft Database Markup Language XML document | Y | N | N |
| Microsoft Excel HTML format | Y | N | N |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Microsoft Front Page macro file format | Y | N | N |
| Microsoft Help file | Y | N | N |
| Microsoft Office Groove | Y | N | N |
| Microsoft Office Word Files | Y | Y | N |
| Microsoft Office Word Macro- enabled Files (OOXML) | Y | Y | Y |
| Microsoft OneNote | Y | N | N |
| Microsoft Outlook vCard Contact Files | Y | N | N |
| Microsoft Pocket Word | Y | N | N |
| Microsoft Publisher | Y | Y | Y |
| Microsoft Windows Cardfile address book format | Y | N | N |
| Microsoft Windows Sticky Notes format | Y | N | N |
| Microsoft Windows Write | Y | Y | N |
| Microsoft Word (UNIX) | Y | N | N |
| Microsoft Word 2007 Flat XML | Y | Y | Y |
| Microsoft Word for Macintosh | Y | Y | Y |
| Microsoft Word HTML format | Y | N | N |
| Microsoft Word Windows (1.0, 2.0) | Y | Y | N |
| Microsoft Word Windows (all other versions) | Y | Y | Y |
| Microsoft Word PC (incl. Glossary, Stylesheet) | Y | Y | N |
| Microsoft Works | Y | Y | N |
| Microsoft Works (Macintosh) | Y | N | N |
| Microsoft Works Communication (Mac) | Y | N | N |
| Milestone Document | Y | N | N |
| MORE Database Outliner (Mac) | Y | N | N |
| Mozilla XML User Interface Language (XUL) XML format | Y | N | N |
| MultiMate | Y | N | N |
| Multimate Advantage | Y | N | N |
| MultiMate Advantage Footnote | Y | N | N |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| MultiMate Advantage II | Y | N | N |
| MultiMate Advantage II Footnote | Y | N | N |
| Multimate Footnote | Y | N | N |
| MXML UI markup language XML format | Y | N | N |
| Navy DIF | Y | N | N |
| NBI Async Archive | Y | N | N |
| NBI Net Archive | Y | N | N |
| NIEM-Conformant XML | Y | N | N |
| NIOS TOP | Y | N | N |
| Oasis Open Document Format | Y | Y | Y |
| OASIS XML Common Biometric Format (XCBF) | Y | N | N |
| ODA/ODIF (FOD 26) | Y | N | N |
| ODA/ODIF (FOD 36) | Y | N | N |
| ODA/ODIF Ql11 | Y | N | N |
| ODA/ODIF Ql12 | Y | N | N |
| ODF Drawing/Graphics flat XML format | Y | N | N |
| ODF Text | Y | N | N |
| ODF Text Flat XML format | Y | N | N |
| ODF Text Master | Y | N | N |
| ODF Text Template | Y | N | N |
| ODF Text Web | Y | N | N |
| OLIDIF (Olivetti) | Y | N | N |
| Omni Outliner | Y | Y | N |
| OneNote Alternative Packaging Format | Y | N | N |
| Open Document format (OpenOffice1/StarOffice6.7) Writer Master document XML | Y | N | N |
| Open eBook (OEBPS) XML format | Y | N | N |
| OpenOffice Writer/LibreOffice Writer | Y | Y | Y |
| Open Publication Structure eBook | Y | Y | Y |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Pages | Y | N | N |
| Pages (Legacy) | Y | N | N |
| PDF Forms Data Format | Y | N | N |
| PDF XML Forms Data Format | Y | N | N |
| Philips Script | Y | N | N |
| PKCS #7 cryptographic format | Y | N | N |
| Portfolio PDF File | |||
| PRIMEWORD | Y | N | N |
| Pronunciation Lexicon Specification (PLS) XML format | Y | N | N |
| Q and A for DOS | Y | N | N |
| Q and A for Windows | Y | N | N |
| Quadratron Q-One V1.93J | Y | N | N |
| Quadratron Q-One V2.0 | Y | N | N |
| RDF/XML format | Y | N | N |
| Really Simple Discovery (RSD) XML format | Y | N | N |
| Rights Management Services (RMS)-protected format | Y | N | N |
| RMS-Protected Microsoft Word Documents (Legacy) | Y | N | N |
| RSS syndication XML format | Y | N | N |
| SAMNA Word IV | Y | N | N |
| Scribe markup language and word processing system | Y | N | N |
| Search/Retrieve via URL (SRU) XML format | Y | N | N |
| SGML | Y | N | N |
| Skype Log | Y | Y | N |
| SPARQL Query Results XML format | Y | N | N |
| Speech Recognition Grammar Specification (SRGS) XML format | Y | N | N |
| Speech Synthesis Markup Language (SSML) XML format | Y | N | N |
| StarOffice Writer (3, 4, 5) | Y | Y | N |
| StarOffice Writer (6, 7, 8, 9) | Y | Y | Y |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| Synchronization Markup Language (SyncML) XML format | Y | N | N |
| Synchronized Multimedia Integration Language (SMIL) XML format | Y | N | N |
| Systems Biology Markup Language (SBML) XML format | Y | N | N |
| Tab-separated values (TSV) file | Y | N | N |
| Targon Word | Y | N | N |
| TCR (Text Compression for Reader) eBook format | Y | N | N |
| Texas Instruments CCXML target configuration XML format | Y | N | N |
| Text Encoding Initiative (TEI) XML format | Y | N | N |
| Uniplex | Y | N | N |
| USENET | Y | N | N |
| Verity XML | Y | N | N |
|
VoiceXML (VXML) XML format |
Y | N | N |
| Volkswriter | Y | N | N |
| WANG PC | Y | N | N |
| WANG WITA | Y | N | N |
| WANG WPS | Y | N | N |
| Word Connection | Y | N | N |
| WordERA | Y | N | N |
| WordMARC | Y | N | N |
| WordPad (thru 2003) | Y | Y | P |
| WordPerfect | Y | N | N |
| WordPerfect Configuration File | Y | N | N |
| WordPerfect Driver | Y | N | N |
| WordPerfect Graphics (version 1) | Y | N | N |
| WordPerfect Hyphenation Dictionary | Y | N | N |
| WordPerfect Macro | Y | N | N |
| WordPerfect Miscellaneous File | Y | N | N |
| File Format | Detection | Content Extraction | Meta Data Extraction |
|---|---|---|---|
| WordPerfect Resource File | Y | N | N |
| WordPerfect Spelling Dictionary | Y | N | N |
| WordPerfect Thesaurus | Y | N | N |
| WordPerfect VAX | Y | N | N |
| WordStar | Y | N | N |
| WordStar 2000 | Y | N | N |
| WordStar for Windows file | Y | N | N |
| WriteNow | Y | N | N |
| Writing Assistant | Y | N | N |
| XAML Browser Application (XBAP) format | Y | N | N |
| Xerox 860 | Y | N | N |
| Xerox DocuWorks | Y | N | N |
| Xerox Writer | Y | N | N |
| XML | Y | N | N |
| XML Paper Specification | Y | Y | N |
| XML Shareable Playlist Format (XSPF) | Y | N | N |
| XyWrite/Nota Bene | Y | Y | N |
| Yahoo Instant Messenger | Y | Y | N |
| YIN XML format | Y | N | N |