The Spreadsheet Library is designed to read and write Spreadsheet Documents.
As of version 0.6.0, only Microsoft Excel compatible spreadsheets are
supported. Spreadsheet is a combination/complete rewrite of the
Spreadsheet::Excel Library by Daniel J. Berger and the ParseExcel Library by
Hannes Wyss. Spreadsheet can read, write and modify Spreadsheet Documents.
wv is a library that allows access to Microsoft Word files.
It can load and parse the Word 6-9 formats (Word 6, 95, 97, 2000).
Several converter executables called wvWare are also provided:
wvHtml, wvLatex, wvCleanLatex, wvDVI, wvPS, wvPDF,
wvText, wvAbw, wvWml, wvMime
Note: wvHtml was previously known as MSWordView.
This is a SCIM IMEngine module which uses m17n library as the backend. It
allows you to use keyboard layouts available via devel/m17n-db and
textproc/m17n-contrib through standard SCIM interface. m17n-lib currently
supports input of more than 60 languages with more than 70 language
specific input methods.
`sgrep' (structured grep) is a tool for searching text files and
filtering text streams using structural criteria. Complex criteria
can be specified as macros using M4.
Sgrep was created by:
Jani Jaakkola, email:Jani.Jaakkola@helsinki.fi
Pekka Kilpelainen, email: Pekka.Kilpelainen@helsinki.fi
From the XP homepage:
XP is an XML 1.0 parser written in Java. It is fully conforming: it
detects all non well-formed documents.
XP has the following design goals: Conformance and correctness, high
performance and a layered structure. It is currently non-validating but can
parse all external entities.
For more details, please see the XP homepage:
The Yacc to LaTeX utility takes (hopefully) any yacc source file,
and derives an Extended Backus-Naur Form (EBNF) description from
it. This EBNF is written out as LaTeX source. The output is a LaTeX
"longtable" environment, that can be included in any LaTeX document,
typically using an \input{} statement.
Sphinx is an open source full text search server, designed from the
ground up with performance, relevance (aka search quality), and
integration simplicity in mind. It's written in C++ and works on Linux
(RedHat, Ubuntu, etc), Windows, MacOS, Solaris, FreeBSD, and a few
other systems.
Sphinx lets you either batch index and search data stored in an SQL
database, NoSQL storage, or just files quickly easily and or index and
search data on the fly, working with Sphinx pretty much as with a
database server.
A variety of text processing features enable fine-tuning Sphinx for
your particular application requirements, and a number of relevance
functions ensures you can tweak search quality as well.
Searching via SphinxAPI is as simple as 3 lines of code, and querying
via SphinxQL is even simpler, with search queries expressed in good
old SQL.
Sphinx clusters scale up to billions of documents and tens of millions
search queries per day, powering top websites such as Craigslist,
DailyMotion, NetLog, etc.
And last but not least, it's open-sourced under GPLv2, and the
community edition is free to use.
Sphinx is an open source full text search server, designed from the
ground up with performance, relevance (aka search quality), and
integration simplicity in mind. It's written in C++ and works on Linux
(RedHat, Ubuntu, etc), Windows, MacOS, Solaris, FreeBSD, and a few
other systems.
Sphinx lets you either batch index and search data stored in an SQL
database, NoSQL storage, or just files quickly easily and or index and
search data on the fly, working with Sphinx pretty much as with a
database server.
A variety of text processing features enable fine-tuning Sphinx for
your particular application requirements, and a number of relevance
functions ensures you can tweak search quality as well.
Searching via SphinxAPI is as simple as 3 lines of code, and querying
via SphinxQL is even simpler, with search queries expressed in good
old SQL.
Sphinx clusters scale up to billions of documents and tens of millions
search queries per day, powering top websites such as Craigslist,
DailyMotion, NetLog, etc.
And last but not least, it's open-sourced under GPLv2, and the
community edition is free to use.
This is the template engine for the Ada Web Server. It is modular and
therefore can be split out of AWS and used on its own.
As it was designed for generating web pages, it's function is to parse
a page template and replace tokens with specified values. This template
engine is amazingly fast due to its concurrent cached compiled templates
support.
tidyp is a fork of tidy on SourceForge at http://tidy.sf.net. The library name
is "tidyp", and the command-line tool is also "tidyp" but all internal API stays
the same.
tidyp will validate your HTML, and output cleaned-up HTML.