Msort sorts files in sophisticated ways. Records may be fixed size,
newline-separated blocks, or terminated by any specified character.
Key fields may be selected by position, tag, or character range. For
each key, distinct exclusions, multigraphs, substitutions, and a sort
order may be defined or locale collation rules used. Comparisons may
be lexicographic, numeric, numeric string, hybrid, random, by string
length, angle, date, time, month name, or ISO8601 timestamp. Keys may
be reversed so as to generate reverse dictionaries. Optional keys are
supported. Unicode is supported, including full case-folding. Msort
itself has a somewhat complex command line interface, but may be
driven by an optional GUI.
MultiMarkdown, or MMD, is a tool to help turn minimally marked-up plain
text into well formatted documents, including HTML, PDF (by way of
LaTeX), OPML, or OpenDocument (specifically, Flat OpenDocument or
'.fodt', which can in turn be converted into RTF, Microsoft Word, or
virtually any other word-processing format).
MMD is a superset of the Markdown syntax, originally created by John
Gruber. It adds multiple syntax features (tables, footnotes, and
citations, to name a few), in addition to the various output formats
listed above (Markdown only creates HTML). Additionally, it builds in
'smart' typography for various languages (proper left- and right-sided
quotes, for example).
NOTE: To use the mmd2pdf script, you must install print/latexmk.
Nux is a small, straightforward, and surprisingly effective open-source
extension of the XOM XML library. Nux is geared towards versatile embedded
integration and interchange, in particular for high-throughput server container
environments (e.g. large-scale Peer-to-Peer messaging network infrastructures
over high-bandwidth networks, scalable MOMs, etc). But its simplicity also
makes it useful for client side XML query/transformation workflow pipelines.
Features include:
- Seamless W3C XQuery support for XOM.
- Efficient and flexible pools and factories for XQueries, XSL Transforms, as
well as Builders that validate against various schema languages, including
W3C XML Schemas, DTDs, RELAX NG, Schematron, etc.
- For simple and complex continuous queries and/or transformations over very
large or infinitely long XML input, a convenient streaming path filter API
combines full XQuery support with straightforward filtering.
- Glue for integration with JAXB and for queries over ill-formed HTML.
- All this is rock-solid, dependable, well documented, and ships in a jar file
that weighs just 60 KB.
An ocaml wrapper for the Expat XML parsing library.
TyXML is an OCaml library that allows you to build XML trees whose validity is
insured by the typechecker. It supports XHTML 1.0 and 1.1, HTML5 and SVG
(partial).
The Open Text Summarizer is an open source tool for summarizing texts.
The program reads a text and decides which sentences are important and
which are not.
RNV is an implementation of Relax NG Compact Syntax validator in ANSI C.
LT XML is an integrated set of XML tools and a developers' tool-kit,
including a C-based API.
The LT XML tool-kit includes stand-alone tools for a wide range of
processing of well-formed XML documents, including searching and
extracting, down-translation (e.g. report generation, formatting),
tokenising and sorting.
Sequences of tool applications can be pipelined together to achieve
complex results.
For special purposes beyond what the pre-constructed tools can
achieve, extending their functionality and/or creating new tools
is easy using the LT XML API. Minimal applications require less
than one-half page of C code to express.
LT XML provides two views of an XML file; one as a flat stream of
markup elements and text; a second as a sequence of tree-structured
XML elements. The two views can be mixed, allowing great flexibility
in the manipulation of XML documents. It also includes a powerful,
yet simple, querying language, which allows the user to quickly and
easily select those parts of an XML document which are of interest.
RXP is a validating XML parser written in C. It is licensed under the
GNU Public Licence.
This is the home of the MathML 2.0 XML Schema from the MathML Working Group.
All the details needed to use it should be covered in appendix A of
MathML 2.0, 2nd edition
This schema is not normative. It is only provided as a tool to validate MathML
instances, without any guarantee on the accuracy of the results. This version
will be updated gradually to incorporate corrections or changes.