XML::Simple is a Perl module that makes it really easy to read and write XML
files.
XML::Simple was originally developed for the purpose of reading and writing
config files in XML format (which offers various advantages over say .INI
format). Having said that, many people find it useful for other purposes.
This is a port of pdftohtml, which converts pdf-files into nicely
formatted html, combined with png images.
The HTML generated uses frames to emulate the content listing from
the pdf.
txt2tags is a utility to turn simply formatted text into markup (e.g., HTML).
Zorba is a general purpose XQuery processor implementing in C++ the W3C family
of specifications. The query processor has been designed to be embeddable in a
variety of environments such as other programming languages extended with XML
processing capabilities, browsers, database servers, XML message dispatchers,
or smartphones. Zorba can be accessed through APIs from C, C++, Ruby, Python,
Java, and PHP. Zorba runs on most platforms and is available under the Apache
license v2.
The Siemens PocketReader is a handheld OCR scanner, this is the Linux
software which can download text from it.
Permute is a simple program to read lines from standard input and
generate all possible permutations of those lines.
agate-dbf adds read support for dbf files to agate
agate-excel adds read support for Excel files (xls and xlsx) to agate.
Paraphrasing the website:
Python-DSV is a Python module for importing and exporting DSV (delimiter
separated values) files. DSV is a generalization of CSV (comma separated
values). CSV is a common file format used by many programs to import and
export data.
Features:
- Pure Python
- Optional wxPython GUI
- Optional heuristics for determining file format
- Handles embedded quotes, delimiters and newlines
- Customizable error handling
- Simple to use
- Portable