uni2ascii and ascii2uni convert between UTF-8 Unicode and any of a variety of
7-bit ASCII equivalents including: hexadecimal and decimal HTML numeric
character references, \u-escapes, standard hexadecimal, and raw hexadecimal.
sansi was written to strip the ANSI control sequences in files, or
output, often, but not limited to those generated by compilers. I
found it difficult to visually grope/grep the output of script(1)
sessions. Especially with the advent of clang. While it's nice to
replay the script(1) sessions to view the highlighted messages.
It's near impossible to read it inline in your favorite pager, or
text editor;
Enter sansi.
<< wdiff >>
From man page of wdiff:
wdiff compares two files, finding which words have been
deleted or added to old_file to get new_file. A word is
anything between whitespace. The output is collected and
used to produce an annotated copy of new_file on standard
output. Suitable annotations produce a nice display of
word differences between the original files.
Example:
text-a
I like FreeBSD.
text-b
I love FreeBSD.
% wdiff -n text-a text-b
I [-like-] {+love+} FreeBSD.
The Xerces Java Parser is a complete implementation of the parser related
portions of JAXP 1.4 and also brings Xerces into compliance with SAX 2.0.2, the
DOM Level 3 Core and Load/Save W3C Recommendations, the XML Inclusions
(XInclude) Version 1.0 W3C Recommendation and the XML Schema 1.0 Structures and
Datatypes Second Edition W3C Recommendations.
Xerces2 is the next generation of high performance, fully compliant XML parsers
in the Apache Xerces family. This version of Xerces introduces the Xerces
Native Interface (XNI), a complete framework for building parser components and
configurations that is extremely modular and easy to program.
The Apache Xerces2 parser is the reference implementation of XNI but other
parser components, configurations, and parsers can be written using the Xerces
Native Interface.
This package provides a simple indentation engine for XML. It is
intended for use in situations where the full power of the popular
PSGML package (DTD parsing, syntax checking) is not required.
These tools are used to convert XML and HTML to and from a line-oriented
format more amenable to processing by classic Unix pipeline processing
tools, like grep, sed, awk, cut, shell scripts, and so forth.
The line-oriented format used by these tools looks very much like, but
is not quite precisely the same as XPath.
xmlformat is a configurable formatter (or "pretty-printer") for
XML documents. It provides control over indentation, line-breaking,
and text wrapping. These properties can be defined on a per-element basis.
An XML pretty printer created to format XML that doesn't make use of
mixed content. In the default mode each element is put on a separate
line with consistent indentation. It can also separate attributes onto
individual lines, sort attributes in a specified or alphabetic order,
expand self closing tags, and more.
Note that the distribution calls this tool "xmlpp", but it has been
renamed so as not to conflict with an xmlpp already in the ports tree.
The Slides doctype and stylesheets are for making presentations.