RXP is a very fast validating XML parser written by Richard Tobin
of the University of Edinburgh. It complies fully with the W3C test
suites (although we have compiled it without Unicode support for
the time being). pyRXP is a wrapper around this which constructs a
lightweight in-memory "tuple tree" in a single call. This structure
is the lightest one we could define in Python, and it is constructed
entirely in C code, resulting in unprecedented speed. It is a core
part of ReportLab's forthcoming XML toolkit, which aims to offer
simple, fast and pythonic tools for common XML processing tasks.
Python binding for Syck, which, according is, according to the web site:
Syck is an extension for reading and writing YAML swiftly in popular
scripting languages. As Syck loads the YAML, it stores the data
directly in your language's symbol table. This means speed. This
means power. This means Do not disturb Syck because it is so focused
on the task at hand that it will slay you mortally if you get in its
way.
This module adds custom roles to Sphinx for linking to resources
on BitBucket projects.
RTF is the Microsoft Richtext Format, a more portable, mostly-ASCII
formatting language that is exported by word processors like MS Word.
These files generally have the extension .rtf, but occassionally have
.doc extensions as well. This parser is from the Microsoft spec,
"ported" to Unix systems.
This is htmlsplit.rb, a Ruby library to split an HTML document into an
array of tags and contents.
PyCHM is a package that provides bindings for Jed Wing's CHMLIB library.
The MIME (Multipurpose Internet Mail Extensions) specification
(RFC 1521 and successors) defines a mechanism for encoding text consisting
primarily of printable ASCII characters, but which may contain characters
(for example, accented letters in the ISO 8859 Latin-1 character set) which
cannot be encoded as 7-bit ASCII or are non-printable characters which may
confuse mail transfer agents.
This is the m17n IMEngine for IBus (Intelligent Input Bus) framework. It
allows you to use official and contributed keyboard layouts of the m17n
project (available via devel/m17n-db and textproc/m17n-contrib) through
standard IBus interface. m17n-lib currenty supports input of more than 60
languages with more than 70 language-specific input methods.
Queequeg is a tiny English grammar checker for non-native speakers who
are not used to verb conjugation and number agreement. We especially
focus on people who're writing academic papers or business documents
where thorough checking is required. We aim to reduce this laborious
work with automated checking.
Redet allows the user to construct regular expressions and test them against
input data by executing any of a variety of search programs, editors, and
programming languages that make use of regular expressions. When a suitable
regular expression has been constructed it may be saved to a file. redet stands
for Regular Expression Development and Execution Tool. For each program, a
palette showing the available regular expression syntax is provided. Selections
from the palette may be copied to the regular expression window with a mouse
click. Users may add their own definitions to the palette via their
initialization file. Redet also keeps a list of the regular expressions
executed, from which entries may be copied back into the regular expression
under construction. The history list is saved to a file and restored on
startup, so it persists across sessions. So long as the underlying program
supports Unicode, redet allows UTF-8 Unicode in both test data and regular
expressions