Word processing formats
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Ability WP | Y | N | N |
Ability Write (later versions) | Y | N | N |
ACT | Y | N | N |
Adobe InDesign document | Y | N | N |
Adobe InDesign IDML format | Y | N | N |
Adobe FrameMaker | Y | N | N |
Adobe FrameMaker Interchange | Y | Y | N |
Adobe PDF | Y | Y | Y |
Adobe XML Data Package format | Y | N | N |
AES Multiplus Comm | Y | N | N |
Android Binary XML (compressed byaapt) format | Y | N | N |
Apple iBooks format | Y | N | N |
Apple iChat Log | Y | Y | N |
Apple iWork Pages (‘08, ‘09) | Y | Y | Y |
Apple iWork Pages (‘13, ‘16, iCloud 2018) | Y | Y | N |
Applix Alis | Y | N | N |
Applix Asterix | Y | N | N |
Applix Words | Y | Y | N |
AT&T DjVu format | Y | N | N |
Atom Syndication Format | Y | N | N |
Broad Band eBook (BBeB) in LRF format | Y | N | N |
Calamus Desktop Publishing | Y | N | N |
Chemical Markup Language (CML) XML format | Y | N | N |
COMET TOP Word | Y | N | N |
Convergent Technologies DEF Comm. | Y | N | N |
Corel WordPerfect Macintosh | Y | Y | N |
Corel WordPerfect Windows | Y | Y | P |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Word Perfect (version5) | Y | N | N |
WordPerfect (version 6 and higher) | Y | N | N |
WordPerfect Graphics (version 1) | Y | N | N |
CPT Communication | Y | N | N |
Data Point Vistaword | Y | N | N |
DCS | Y | N | N |
DEC WPS PLUS | Y | N | N |
DECdx | Y | N | N |
DG CEOwrite | Y | N | N |
DG Common Data Stream | Y | N | N |
Digital Document Interchange Format (DDIF) | Y | N | N |
Digitally Signed PDF file (Native) | Y | N | N |
DisplayWrite | Y | Y | N |
DNAML DNL eBook | Y | N | N |
DSA101 | Y | N | N |
e-Szigno signed xml document | Y | N | N |
EBCDIC-encoded XML file | Y | N | N |
EBCDIC Text | Y | N | N |
eFax | Y | N | N |
Electronic Publication | Y | N | N |
Enable | Y | N | N |
Encrypted Microsoft OneNote Files | Y | N | N |
Envoy | Y | N | N |
Extensible Data Format (XDF) XML format | Y | N | N |
Extensible Style sheet Language Transformations (XSLT) format | Y | N | N |
Folio Flat File | Y | Y | Y |
Founder Chinese E-paper Basic | Y | Y | N |
Foxmail email format | Y | N | N |
Fujitsu Oasys | Y | Y | P |
Haansoft Hangul | Y | Y | Y |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Hangul HWPX document | Y | N | N |
Health level7 | Y | Y | Y |
HP Word PC | Y | N | N |
IBM 1403 Line Printer | Y | N | N |
IBM DCA/RFT | Y | Y | N |
IBM DCA-FFT | Y | N | N |
IBM DCF Script | Y | N | N |
Ichitaro Compressed | Y | N | N |
Informix SmartWare II | Y | N | N |
Interleaf | Y | N | N |
Java Network Launching Protocol | Y | N | N |
JustSystems Ichitaro | Y | Y | P |
Lotus AMI Pro | Y | Y | P |
Lotus AMI Pro Style Sheet | Y | Y | P |
Lotus Notes CDF | Y | N | N |
Lotus Word Pro (96, 97, R9) | Y | Y | P |
Lotus SmartMaster | Y | Y | N |
Lotus Organizer documents | Y | N | N |
Lyrix | Y | N | N |
Machine-Readable Cataloging (MARC) XML format | Y | N | N |
Macromedia Flash FLA Project File OLE format | Y | N | N |
MacWrite | Y | N | N |
MacWrite II | Y | N | N |
MASS-11 | Y | N | N |
Metadata Encoding and Transmission Standard (METS) XML format | Y | N | N |
Metadata Object Description Schema (MODS) XML format | Y | N | N |
Metalink XML format | Y | N | N |
Microsoft Database Markup Language XML document | Y | N | N |
Microsoft Excel HTML format | Y | N | N |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Microsoft Front Page macro file format | Y | N | N |
Microsoft Help file | Y | N | N |
Microsoft Office Groove | Y | N | N |
Microsoft Office Word Files | Y | Y | N |
Microsoft Office Word Macro- enabled Files (OOXML) | Y | Y | Y |
Microsoft OneNote | Y | N | N |
Microsoft Outlook vCard Contact Files | Y | N | N |
Microsoft Pocket Word | Y | N | N |
Microsoft Publisher | Y | Y | Y |
Microsoft Windows Cardfile address book format | Y | N | N |
Microsoft Windows Sticky Notes format | Y | N | N |
Microsoft Windows Write | Y | Y | N |
Microsoft Word (UNIX) | Y | N | N |
Microsoft Word 2007 Flat XML | Y | Y | Y |
Microsoft Word for Macintosh | Y | Y | Y |
Microsoft Word HTML format | Y | N | N |
Microsoft Word Windows (1.0, 2.0) | Y | Y | N |
Microsoft Word Windows (all other versions) | Y | Y | Y |
Microsoft Word PC (incl. Glossary, Stylesheet) | Y | Y | N |
Microsoft Works | Y | Y | N |
Microsoft Works (Macintosh) | Y | N | N |
Microsoft Works Communication (Mac) | Y | N | N |
Milestone Document | Y | N | N |
MORE Database Outliner (Mac) | Y | N | N |
Mozilla XML User Interface Language (XUL) XML format | Y | N | N |
MultiMate | Y | N | N |
Multimate Advantage | Y | N | N |
MultiMate Advantage Footnote | Y | N | N |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
MultiMate Advantage II | Y | N | N |
MultiMate Advantage II Footnote | Y | N | N |
Multimate Footnote | Y | N | N |
MXML UI markup language XML format | Y | N | N |
Navy DIF | Y | N | N |
NBI Async Archive | Y | N | N |
NBI Net Archive | Y | N | N |
NIEM-Conformant XML | Y | N | N |
NIOS TOP | Y | N | N |
Oasis Open Document Format | Y | Y | Y |
OASIS XML Common Biometric Format (XCBF) | Y | N | N |
ODA/ODIF (FOD 26) | Y | N | N |
ODA/ODIF (FOD 36) | Y | N | N |
ODA/ODIF Ql11 | Y | N | N |
ODA/ODIF Ql12 | Y | N | N |
ODF Drawing/Graphics flat XML format | Y | N | N |
ODF Text | Y | N | N |
ODF Text Flat XML format | Y | N | N |
ODF Text Master | Y | N | N |
ODF Text Template | Y | N | N |
ODF Text Web | Y | N | N |
OLIDIF (Olivetti) | Y | N | N |
Omni Outliner | Y | Y | N |
OneNote Alternative Packaging Format | Y | N | N |
Open Document format (OpenOffice1/StarOffice6.7) Writer Master document XML | Y | N | N |
Open eBook (OEBPS) XML format | Y | N | N |
OpenOffice Writer/LibreOffice Writer | Y | Y | Y |
Open Publication Structure eBook | Y | Y | Y |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Pages | Y | N | N |
Pages (Legacy) | Y | N | N |
PDF Forms Data Format | Y | N | N |
PDF XML Forms Data Format | Y | N | N |
Philips Script | Y | N | N |
PKCS #7 cryptographic format | Y | N | N |
Portfolio PDF File | |||
PRIMEWORD | Y | N | N |
Pronunciation Lexicon Specification (PLS) XML format | Y | N | N |
Q and A for DOS | Y | N | N |
Q and A for Windows | Y | N | N |
Quadratron Q-One V1.93J | Y | N | N |
Quadratron Q-One V2.0 | Y | N | N |
RDF/XML format | Y | N | N |
Really Simple Discovery (RSD) XML format | Y | N | N |
Rights Management Services (RMS)-protected format | Y | N | N |
RMS-Protected Microsoft Word Documents (Legacy) | Y | N | N |
RSS syndication XML format | Y | N | N |
SAMNA Word IV | Y | N | N |
Scribe markup language and word processing system | Y | N | N |
Search/Retrieve via URL (SRU) XML format | Y | N | N |
SGML | Y | N | N |
Skype Log | Y | Y | N |
SPARQL Query Results XML format | Y | N | N |
Speech Recognition Grammar Specification (SRGS) XML format | Y | N | N |
Speech Synthesis Markup Language (SSML) XML format | Y | N | N |
StarOffice Writer (3, 4, 5) | Y | Y | N |
StarOffice Writer (6, 7, 8, 9) | Y | Y | Y |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
Synchronization Markup Language (SyncML) XML format | Y | N | N |
Synchronized Multimedia Integration Language (SMIL) XML format | Y | N | N |
Systems Biology Markup Language (SBML) XML format | Y | N | N |
Tab-separated values (TSV) file | Y | N | N |
Targon Word | Y | N | N |
TCR (Text Compression for Reader) eBook format | Y | N | N |
Texas Instruments CCXML target configuration XML format | Y | N | N |
Text Encoding Initiative (TEI) XML format | Y | N | N |
Uniplex | Y | N | N |
USENET | Y | N | N |
Verity XML | Y | N | N |
VoiceXML (VXML) XML format |
Y | N | N |
Volkswriter | Y | N | N |
WANG PC | Y | N | N |
WANG WITA | Y | N | N |
WANG WPS | Y | N | N |
Word Connection | Y | N | N |
WordERA | Y | N | N |
WordMARC | Y | N | N |
WordPad (thru 2003) | Y | Y | P |
WordPerfect | Y | N | N |
WordPerfect Configuration File | Y | N | N |
WordPerfect Driver | Y | N | N |
WordPerfect Graphics (version 1) | Y | N | N |
WordPerfect Hyphenation Dictionary | Y | N | N |
WordPerfect Macro | Y | N | N |
WordPerfect Miscellaneous File | Y | N | N |
File Format | Detection | Content Extraction | Meta Data Extraction |
---|---|---|---|
WordPerfect Resource File | Y | N | N |
WordPerfect Spelling Dictionary | Y | N | N |
WordPerfect Thesaurus | Y | N | N |
WordPerfect VAX | Y | N | N |
WordStar | Y | N | N |
WordStar 2000 | Y | N | N |
WordStar for Windows file | Y | N | N |
WriteNow | Y | N | N |
Writing Assistant | Y | N | N |
XAML Browser Application (XBAP) format | Y | N | N |
Xerox 860 | Y | N | N |
Xerox DocuWorks | Y | N | N |
Xerox Writer | Y | N | N |
XML | Y | N | N |
XML Paper Specification | Y | Y | N |
XML Shareable Playlist Format (XSPF) | Y | N | N |
XyWrite/Nota Bene | Y | Y | N |
Yahoo Instant Messenger | Y | Y | N |
YIN XML format | Y | N | N |