This is a port of HTMLDOC, which can:
Convert HTML files to PDF or PostScript
Generate a table-of-contents for books
Generate indexed HTML files
Generate files on-the-fly for web applications, from the
command-line for batch jobs, or from a GUI for interactive work.
HTMLDOC Provides
A command-line interface for batch and WWW applications.
A graphical interface for interactive work.
In my opinion, HTMLDOC is *fast*, compared to the other solutions I've seen.
HTMLDOC is available under the GPL.
Commercial support is available from the author.
Full-text search system. You can search lots of documents for some documents
including specified words. If you run a web site, it is useful as your own
search engine for pages in your site. Also, it is useful as search utilities
of mail boxes and file servers.
The characteristic of Hyper Estraier is the following.
* High performance of search
* High scalability of target documents
* Perfect recall ratio by N-gram method
* Phrase search, attribute search, and similarity search
* Multilingualism with Unicode
* Independent of file format and repository
* Simple and powerful API
* Supporting P2P architecture
Hyphen - hyphenation library to use converted TeX hyphenation patterns
El-Kabong is a high-speed, forgiving, sax-style HTML parser.
Its aim is to provide consumers with a very fast, clean,
lightweight library which parses HTML quickly, while forgiving
syntactically incorrect tags.
ITS Tool allows you to translate your XML documents with PO files, using rules
from the W3C Internationalization Tag Set (ITS) to determine what to translate
and how to separate it into PO file messages.
PO files are the standard translation format for GNU and other Unix-like
systems. They present translatable information as discrete messages, allowing
each message to be translated independently. In contrast to whole-page
translation, translating with a message-based format like PO means you can
easily track changes to the source document down to the paragraph. When new
strings are added or existing strings are modified, you only need to update
the corresponding messages.
ITS Tool is designed to make XML documents translatable through PO files by
applying standard ITS rules, as well as extension rules specific to ITS Tool.
ITS also provides an industry standard way for authors to override translation
information in their documents, such as whether a particular element should be
translated.
JaLingo is a free OS independent dictionary application.
It understands 4 dictionary formats:
* DSL
* Mova
* PtkDict/phpMyLingvo MySQL dumps
* Sdictionary (249)
Aspell English dictionaries.
Jaxup defines an interface to update XML documents, through which clients can
work without knowledge of the exact object model that the document uses. The
interface is called Updater, and the idea behind it is the same as with Jaxen's
Navigator interface. In addition, an implementation of xmldb.org's proposed
XUpdate specification is provided. The implementation is in the XUpdate class.
Implementations of the Updater interface are provided for the following object
models:
- DOM
- Dom4j
- JDom
Aspell Esperanto dictionary.
JDictionary is a powerful multi platform dictionary software.
JDictionary is plugin based, and has an easy to use plugin managment system,
that is able to download and install the desired plugins automagically from
the web. Installing or upgrading one or more plugins (even simultaneously)
is just one click!