This module is for writing RSS files, simply. It transparently handles all
the unpleasant details of RSS, like proper XML escaping, and also has a good
number of Do-What-I-Mean features, like not changing the modtime on a
written-out RSS file if the file content hasn't changed, and like
automatically removing any HTML tags from content you might pass in.
This module isn't meant to have the full expressive power of RSS; instead, it
provides functions that are most commonly needed by RSS-writing programs.
Soothsayer is an intelligent predictive text entry platform. Soothsayer
exploits redundant information embedded in natural languages to generate
predictions. Soothsayer's modular and pluggable architecture allows its
language model to be extended and customized to utilize statistical,
syntactic, and semantic information sources.
A predictive text entry system attempts to improve ease and speed of
textual input. Word prediction consists in computing which word tokens
or word completions are most likely to be entered next. The system
analyses the text already entered and combines the information thus
extracted with other information sources to calculate a set of most
probable tokens.
RapidXml is an attempt to create the fastest XML parser possible,
while retaining useability, portability and reasonable W3C
compatibility. It is an in-situ parser written in modern C++, with
parsing speed approaching that of strlen function executed on the
same data.
RapidXml has been around since 2006, and is being used by lots of
people. HTC uses it in some of its mobile phones.
If you are looking for a stable and fast parser, look no further.
Integration with your project will be trivial, because entire library
is contained in a single header file, and requires no building or
configuration.
Rarian is designed to be a replacement for scrollkeeper. It is
currently undergoing heavy development. As of writing, rarian can be
installed in place of scrollkeeper and everything will work okay.
Rarian manages documentation metadata (as specified by the Open Source
Metadata Framework (OMF) and provides a simple API to allow help browsers
to find, sort, and search the document catalog. It will also be able to
communicate with catalog servers on the Net to search for documents which
are not on the local system.
WordNet is a powerful lexical reference system that combines aspects of
dictionaries and thesauri with current psycholinguistic theories of
human lexical memory. It is produced by the Cognitive Science Laboratory
at Princeton University, under the direction of Professor George Miller.
In WordNet, words are defined and grouped into various related sets of
synonyms. Not only is the system valuable to the casual user as a
powerful thesaurus and dictionary, but also to the researcher as one of
the few freely available, lexical databases. WordNet is available via an
on-line interface and also as easy-to-compile C source code for Unix.
Tokyo Dystopia is a full-text search system. You can search lots of records
for some records including specified patterns. The characteristic of
Tokyo Dystopia is the following.
* High performance of search
* High scalability of target documents
* Perfect recall ratio by character N-gram method
* Phrase matching, prefix matching, suffix matching, and token matching
* Multilingualism with Unicode
* Layered Architecture of APIs
Tokyo Dystopia is available on platforms which have API conforming to C99 and
POSIX. Tokyo Dystopia is a free software licensed under the GNU Lesser General
Public License.
XSV is a command-line tool for performing schema-validity
assessment of XML documents in accord with the
W3C XML Schema specification, second edition.
XSV (XML Schema Validator) is an open source (GPLed) work-in-progress
attempt at a conformant schema-aware processor, as defined by
XML Schema Part 1: Structures, Second Edition of 28 October 2004.
It has been developed at the Language Technology Group of the Human
Communication Research Centre in the School of Informatics at the
University of Edinburgh, with support for one of us (Thompson)
from the World Wide Web Consortium.
[ excerpt (with adaptations) from developer's website ]
The aim of BINS is to generate HTML photo albums.
Some of the functionalities of BINS are:
- album can contains other albums (sub albums): the album can have
a tree structure ;
- generation of a thumbnail and of scaled images of each picture ;
- number and size of scaled pictures can be personalized, in pixels
or percentage of the original image ;
- several description fields (date, location, etc...) can be
associated with the pictures ;
- use the EXIF data structure found on some JPEG (usually, those
produced by digital cameras) to fill automatically some fields
(date and time for example).
fcgiwrap is a simple server for running CGI applications over FastCGI. It hopes
to provide clean CGI support to Nginx (and other web servers that may need it).
Features:
* very lightweight (84KB of private memory per instance)
* fixes broken CR/LF in headers
* handles environment in a sane way (CGI scripts get HTTP-related env. vars
* from FastCGI parameters and inherit all the others from fcgiwrap's
* environment)
* no configuration, so you can run several sites off the same fcgiwrap pool
* passes CGI stderr output to fcgiwrap's stderr (this is by design but
* stderr could be also passed to FastCGI stderr stream)
[ excerpt from developer's web site ]
Download videos from various Flash-based video hosting sites, without
having to use the Flash player. Handy for saving videos for watching
offline, and means you don't have to keep upgrading Flash for sites
that insist on a newer version of the player.
YouTube, eHow, Brightcove (used by many sites like Channel 4, Daily
Telegraph ...), BBC (news, etc), Metacafe, 5min, Google, fliqz,
nicovideo, vimeo, Blip, Break, Collegehumor, Muzu, Sevenload,
Megavideo, Wat.tv. Also includes a 'generic' method which works on
many other sites.