ArCHMage is the extensible reader/decompiler of files in CHM format(Microsoft
HTML help, also known as Compiled HTML). ArCHMage is based on chmlib by Jed
Wing and is written on python.
A Perl 5 module for reading in, manipulating, and outputting
bibliographic records in the USMARC format. It handles conversions
from MARC into ASCII (text), Library of Congress MARCMaker, HTML,
and ISBD. Input from MARCMaker format is also supported. Individual
records, fields, indicators, and subfields can be created, modified, and
deleted. It can extract URLs from the 856 field into HTML.
KDiff3 is a program that:
* compares or merges two or three text input files or directories,
* shows the differences line by line and character by character (!),
* provides an automatic merge-facility and
* an integrated editor for comfortable solving of merge-conflicts,
* supports KIO on KDE (allows accessing ftp, sftp, fish, smb etc.),
* Printing of differences,
* Manual alignment of lines,
* Automatic merging of version control history (cvs Log keyword),
* and has an intuitive graphical user interface.
These are namespace-aware XSLT stylesheets for the DocBook DTD and its
derivatives (Simplified DocBook, etc.).
About the project
We plan to build a program that will accept ASCII text as input and generate
International Morse Code as output. The output formats can be:
- . -..- - (text) on the console
Raw audio on /dev/audio (8bit PCM data)
.wav files
.ogg or (proprietary format) compressed audio
International Morse Code
Supported character set includes [A-Za-z] (all downcased as Morse is not case
sensitive), [0-9], ",-.?/" plus a few procedural characters (SK, AR, BT etc).
dbacl is a digramic Bayesian text classifier. Given some text,
it calculates the posterior probabilities that the input resembles
one of any number of previously learned document collections.
It can be used to sort incoming email into arbitrary categories
such as spam, work, and play, or simply to distinguish an English text
from a French text. It fully supports international character sets,
and uses sophisticated statistical models based on the
Maximum Entropy Principle.
Dblatex started as a DB2LaTeX clone. So, why this project? The purpose
is a bit different on these points:
(1) The project is end-user oriented, that is, it tries to hide as much
as possible the latex compiling stuff by providing a single clean
script to produce directly DVI, PostScript and PDF output.
(2) The actual output rendering is done not only by the XSL stylesheets
transformation, but also by a dedicated LaTeX package. The purpose is
to allow a deep LaTeX customisation without changing the XSL
stylesheets.
(3) Post-processing is done by Python, to make publication faster,
convert the images if needed, and do the whole compilation.
denature is a perl program that attempts to convert an HTML page into XSL-FO
which it then passes off to the FOP (Formatted Objects Formatter) to produce a
PDF document.
denature trys to use any included CSS stylesheets to figure out the properties
used in the document. The CSS processing in denature is not very mature and
only handles a limited amount of the available CSS markup. The CSS support
does not handle the contextual entries in a CSS document, and the CSS::Tiny
module requires that all the :'s in a document have a space after them.
XML::STX is a pure Perl implementation of STX processor. Streaming
Transformations for XML (STX) is a one-pass transformation language for
XML documents that builds on the Simple API for XML (SAX).