The posix regex backend for regex-base.
A new all Haskell "tagged" DFA regex engine, inspired by libtre.
This library implements i;unicode-casemap, the simple, non
locale-sensitive unicode collation algorithm described in RFC 5051.
Proper unicode collation can be done using text-icu, but that is a big
dependency that depends on a large C library, and rfc5051 might be
better for some purposes.
This package provides several functions to quickly search for substrings
in strict or lazy ByteStrings. It also provides functions for breaking or
splitting on substrings and replacing all occurrences of a substring (the
first in case of overlaps) with another. GHC before 6.10 are no longer
supported, other compilers only if they support BangPatterns. If you need
it to work with other compilers, send a feature request.
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making it
ideal for screen-scraping.
Tag-stream is a library for parsing HTML//XML to a token stream. It can
parse unstructured and malformed HTML from the web. It also provides an
Enumeratee which can parse streamline html, which means it consumes constant
memory.
The texmath library provides functions to read and write TeX math,
presentation MathML, and OMML (Office Math Markup Language, used in
Microsoft Office). Support is also included for converting math formats to
pandoc's native format (allowing conversion, via pandoc, to a variety of
different markup formats). The TeX reader supports basic LaTeX and AMS
extensions, and it can parse and apply LaTeX macros.
Library for working with URIs, including parsing, rendering, merging,
escaping.
This package provides parsing and rendering functions for XML. It is
based on the datatypes found in the xml-types package. This package is
broken up into the following modules:
* Text.XML: DOM-based parsing and rendering. This is the most commonly
used module.
* Text.XML.Cursor: A wrapper around Text.XML which allows bidirectional
traversing of the DOM, similar to XPath.
* Text.XML.Unresolved: A slight modification to Text.XML which does not
require all entities to be resolved at parsing. The datatypes are
slightly more complicated here, and therefore this module is only
recommended when you need to deal directly with raw entities.
* Text.XML.Stream.Parse: Streaming parser, including some streaming
parser combinators.
* Text.XML.Stream.Render: Streaming renderer.
Hamlet-style quasiquoter for XML content.