Graphical Diff for X
LICENSE: GPL2 or later
SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and
Research Laboratory since 1995. The toolkit has also greatly benefitted from
its use and enhancements during the Johns Hopkins University/CLSP summer
workshops in 1995, 1996, and 1997
SRILM consists of the following components:
* A set of C++ class libraries implementing language models,
supporting data stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to
perform standard tasks such as training LMs and testing them on
data, tagging or segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.
TagSoup - Just Keep On Truckin'
TagSoup is a SAX-compliant parser written in Java that, instead of parsing
well-formed or valid XML, parses HTML as it is found in the wild: poor,
nasty and brutish, though quite often far from short. TagSoup is designed
for people who have to process this stuff using some semblance of a rational
application design. By providing a SAX interface, it allows standard XML
tools to be applied to even the worst HTML. TagSoup also includes
a command-line processor that reads HTML files and can generate either
clean HTML or well-formed XML that is a close approximation to XHTML.
Tex2im is a simple tool that converts LaTeX formulas into high resolution
pixmap graphics for inclusion in text processors or presentations. I
encountered the problem that the formulas generated by the editors of common
office packages usually were the ugliest part of my scientific presentations;
on the other hand I didn't want to use latex for my transparencies. On the
latex side I'm aware of the slitex and foiltex packages, nevertheless I
consider them to be masochistic. EPS import can be nice, but commonly you get
either display or printing problems. Also, often its nice just to copy
formulas out of you latex documents.
Converts a LaTeX file containing equations(s) into a GIF file for
embedding into an HTML document. The black and white image of the
equation is created at high resolution and then resampled to the
target resolution to antialias what would otherwise be jagged
edges.
TinyXML-2 is a simple, small, efficient, C++ XML parser that can be easily
integrated into other programs. It attempts to be flexible, but correct. It
does not rely on exceptions or RTTI. It has UTF-8 support, but does not parse or
use DTDs or XSL. It doesn't have the STL support of TinyXML-1, but uses less
memory, has a proper namespace, and is faster.
TOPIC markup provides a minimal context for plain text databases that
can be read/modified by most text editors, and easily parsed as well.
Use as a classic Unix filter or interactively with included REPL.
GTK+ modules of uim input method.
GTK+ 3.x modules of uim input method.
PHP bindings for the mRss library for parsing, writing and creating RSS files or
streams.