This module takes a list of documents (in English) and
builds a simple in-memory search engine using a vector
space model. Documents are stored as PDL objects, and
after the initial indexing phase, the search should be
very fast. This implementation applies a rudimentary
stop list to filter out very common words, and uses a
cosine measure to calculate document similarity.
All documents above a user-configurable similarity
threshold are returned.
The Regex::Presuf module can be used to build regular expressions out
of 'word lists', lists of strings. The regular expression matches the
same words as the word list. These regular expressions normally run
few dozen percentages faster than a simple-minded '|'-concatenation of
the words.
Perl module for Embeddable Fulltext Search Engine.
Sphinx::Config is a Perl module to read, modify and write configuration file of
Sphinx search engine.
Sphinx::Manager provides utilities to start, stop, restart, and reload the
Sphinx search engine binary (searchd), and to run the Sphinx indexer program.
The utilities are designed to handle abnormal conditions, such as PID files not
being present when expected, and so should be robust in most situations.
Sphinx search engine API Perl client.
Spork - a Perl module for creating standalone HTML slideshows from Kwiki markup
Spreadsheet::ParseExcel makes you to get information from Excel95,
Excel97, Excel2000, Excel 4 formats.
When this module is use'd, it causes regexes in the current namespace to act as
if the /xms flags had been applied to them.
Read the data from a spreadsheet