On this page I present some document conversion scripts and programs I wrote.
Note that the scripts using Microsoft Office or Corel WordpPerfect Office usually accept other known input formats too, or otherwise these scripts can easily be modified to do so.
Input Format | Output Encoding | Script or Program |
---|---|---|
Plain text, any | Plain text, ASCII | 2ASCII.ps1 |
Plain text, Unicode | 2Unicode.ps1 | |
Plain text, UTF-8 | 2UTF8.ps1 |
Input Format | Extension | Output Encoding | Script or Program | Requirements |
---|---|---|---|---|
EPUB | .epub | Any | Epub2Txt.exe | |
OpenOffice | .odt | Any | ODT2Txt.exe | |
Docx2Txt.exe | Microsoft Word | |||
Pages | .pages | Any | Pages2Txt.bat | 7-Zip Tesseract OCR (uses OCR, hence single page simple layouts only) |
ASCII | PDF2Txt.bat | GhostScript | ||
Rich Text | .rtf | Any | Rtf2Txt.exe | |
Word2Txt.exe | ||||
Word | .doc | Any | Word2Txt.exe | |
UTF-8 | Doc2Txt.exe | |||
.docx | Any | Word2Txt.exe | ||
Docx2Txt.exe | Microsoft Word | |||
(all) | Any | Word2Txt.exe | Microsoft Word | |
WordPerfect | .wpd | Any | Word2Txt.exe | |
WPD2Txt.exe |
Input Format | Extension | Output Format | Script or Program | Requirements |
---|---|---|---|---|
JPEG | .jpg, .jpeg | JPGs2PDF.bat | GhostScript | |
OpenOffice | .odt | Word2PDF.vbs | Microsoft Word | |
ODT2PDF.exe | Either LibreOffice and its SDK or OpenOffice and its SDK, and Java runtime | |||
PDF2JPG2PDF.bat | GhostScript | |||
Rich Text | .rtf | Word2PDF.vbs | Microsoft Word | |
Word | .doc, .docx | Word2PDF.vbs | Microsoft Word | |
WordPerfect | .wpd | WP2PDF.vbs | Corel WordPerfect Office |
Input Format | Extension | Output Format | Script or Program | Requirements |
---|---|---|---|---|
Excel | .xls, .xlsx | XML | Excel2XML.vbs | Microsoft.XMLDOM |
OpenOffice | .odt | Any known Office format | Word2Any.vbs | Microsoft Word |
RTF | Word2RTF.vbs | Microsoft Word | ||
Word .docx | ODT2Docx.exe | Either LibreOffice and its SDK or OpenOffice and its SDK, and Java runtime | ||
XPS | Word2XPS.vbs | Microsoft Word | ||
JPG | PDF2JPG.bat | GhostScript | ||
Word | .doc, .docx | Any known Office format | Word2Any.vbs | Microsoft Word |
HTML | WSaveAs.vbs | Microsoft Word | ||
OpenOffice .odt | Word2OpenOffice.exe | Microsoft Word | ||
RTF | Word2RTF.vbs | Microsoft Word | ||
XPS | Word2XPS.vbs | Microsoft Word | ||
WordPerfect | .wpd | HTML | WPSaveAs.vbs | Corel WordPerfect Office |
Word .doc | WPD2Doc.vbs | Corel WordPerfect Office |
page last modified: 2024-09-12; loaded in 0.0021 seconds