rmmseg-cpp is a high performance Chinese word segmentation utility for
Ruby. It features full "Ferret":http://ferret.davebalmain.com/ integration
as well as support for normal Ruby program usage.
rmmseg-cpp is a re-written of the original
RMMSeg(http://rmmseg.rubyforge.org/) gem in C++. RMMSeg is written
in pure Ruby. Though I tried hard to tweak RMMSeg, it just consumes
lots of memory and the segmenting process is rather slow.
The interface is almost identical to RMMSeg but the performance is
much better. This gem is always preferable in production
use. However, if you want to understand how the MMSEG segmenting
algorithm works, the source code of RMMSeg is a better choice than
this.
Fldiff is a graphical diff program that shows the differences between two
files/directories, or a file/directory and a CVS or Subversion repository.
It is inspired by xdiff (Motif-based) and xxdiff (Qt-based), whose choice of
GUI toolkit has hampered their portability to many of the systems I work with.
RT is a simple and human-readable table format.
RTtool is a converter from RT into various formats.
RT can be incorporated into RD.
At this time, RTtool can convert RT into HTML and plain text.
To convert into plain text, you need w3m.
sgrep (structured grep) is a tool for searching and indexing text, SGML,XML
and HTML files and filtering text streams using structural criteria. The data
model of sgrep is based on regions, which are nonempty substrings of text.
Regions are typically occurrences of constant strings, SGML-tags, or meaningful
text elements, which are recognizable through some delimiting strings or the
builtin SGML, XML and HTML parser. Regions can be arbitrarily long, arbitrarily
overlapping, and arbitrarily nested.
Sgrep is a convenient tool for making queries to almost any kind of text files
with some well kown structure. These include programs, mail folders, news
folders, HTML, SGML, etc... With relatively simple queries you can display mail
messages by their subject or sender, extract titles or links or any regions
from HTML files, function prototypes from C or make complex queries to SGML
files based on the DTD of the file.
This is a Ruby class for processing XSLT.
RubyPants is a Ruby port of the smart-quotes library SmartyPants. The original
SmartyPants is a free web publishing plug-in for Movable Type,
Blosxom, and BBEdit that easily translates plain ASCII punctuation characters
into "smart" typographic punctuation HTML entities.
Aspell Slovak dictionary.
Slovak hunspell dictionaries