Snowball string-processing language compiler and runtime for Elixir.
Snowball is a small string-processing language designed for creating stemming algorithms in information retrieval. This package provides:
a compiler pipeline that parses
.sblsource files and emits Elixir modules implementing the described stemmer (Snowball.Lexer,Snowball.Preprocessor,Snowball.Analyser,Snowball.Generator),the
mix snowball.genMix task that drives the pipeline over a directory of.sblfiles, andthe
Snowball.Runtimeruntime support module that generated stemmer modules call into at run time (string buffer manipulation, character classes,find_amongdispatch tables, and so on).
This package does not ship any pre-compiled stemmers itself. For
the canonical Snowball algorithms compiled to Elixir modules, see the
companion text_stemmer package.
Generating a stemmer
mix snowball.gen --module-prefix MyApp.Stemmers \
--output-dir lib/my_app/stemmers \
--algorithms-dir priv/snowballSee Mix.Tasks.Snowball.Gen for the full set of options.