This class knows how to read two treebank formats, the Penn format
and the Chomsky Normal Form (CNF) format. These formats differ in
how they handle terminal nodes. The Penn format places pre-terminal
part of speech tags in the left-hand position of a
parenthesis-delimited pair, just like it does non-terminal nodes.
The CNF format attaches pre-terminal tags to the word with an
underscore.
MARC::Charset allows you to turn MARC-8 encoded strings into UTF-8
strings. MARC-8 is a single byte character encoding that predates
unicode, and allows you to put non-Roman scripts in MARC bibliographic
records.
MARC::Lint provides a mechanism for validating MARC records.
The MARC::* series of modules create a simple object-oriented abstraction
of MARC record handling.
MARC::Record is the core class for representing a single MARC record.
MKDoc is a web content management system written in Perl which focuses on
standards compliance, accessiblity and usability issues, and multi-lingual
websites.
At MKDoc Ltd we have decided to gradually break up our existing commercial
software into a collection of completely independent, well-documented,
well-tested open-source CPAN modules.
Ultimately we want MKDoc code to be a coherent collection of module
distributions, yet each distribution should be usable and useful in
itself.
MKDoc::XML is part of this effort.
You could help us and turn some of MKDoc's code into a CPAN module. You
can take a look at the existing code at http://download.mkdoc.org/.
If you are interested in some functionality which you would like to see as
a standalone CPAN module, send an email to
<mkdoc-modules@lists.webarch.co.uk>
This libary can serve as an advanced lexer for (GNU) makefiles. It parses
makefiles as "documents" and the parsing is lossless. The results are data
structures similar to DOM trees. The DOM trees hold every single bit of the
information in the original input files, including white spaces, blank lines and
makefile comments. That means it's possible to reproduce the original makefiles
from the DOM trees. In addition, each node of the DOM trees is modifiable and
so is the whole tree, just like the PPI module used for Perl source parsing and
the HTML::TreeBuilder module used for parsing HTML source.
This module provides a parser which parses and interprets (though
not fully) LaTeX documents and returns a tree-based representation
of what it finds. This tree is a LaTeX::TOM::Tree. The tree contains
LaTeX::TOM::Node nodes.
This module should be especially useful to anyone who wants to do
processing of LaTeX documents that requires extraction of plain-text
information, or altering of the plain-text components (or
alternatively, the math-text components).
This is a parser for Makefiles. At this very early stage, the parser
only supports a limited set of features, so it may not recognize some
advanced features provided by certain make tools like GNU make.
This is a new try to use Devel::Declare to change the Perl5
language. It learns pretty much everything from Template::Declare,
and has similar interface. With only one difference: how element
attributes are defined.
Lingua::Conjunction exports a single subroutine, conjunction, that
converts a list into a properly punctuated text string.
Seamus Venasse <svenasse@polaris.ca>