LexLuthor (lex_luthor v0.1.2) View Source

LexLuthor is a Lexer in Elixir (say that 10 times fast) which uses macros to generate a reusable lexers. Good times.

LexLuthor is a state based lexer, meaning that it keeps a state stack which you can push states on and pop states off the stack, which are used to filter the applicable rules for a given state. For example:

iex> defmodule StringLexer do
...>   use LexLuthor
...>   defrule ~r/^'/,              fn(_) -> :STRING end
...>   defrule ~r/^[^']+/, :STRING, fn(e) -> { :string, e } end
...>   defrule ~r/^'/,     :STRING, fn(_) -> nil end
...> end
...> StringLexer.lex("'foo'")
{:ok, [%LexLuthor.Token{column: 1, line: 1, name: :string, pos: 1, value: "foo"}]}

Rules are defined by a regular expression, an optional state (as an atom) and an action in the form of an anonymous function.

When passed the string 'foo', the lexer starts in the :default state, so it filters for rules in the default state (the first rule, as it doesn't specify a state), then it filters the available rules by the longest matching regular expression. In this case, since we have only one rule (which happens to match) it's automatically the longest match.

Once the longest match is found, then it's action is executed and the return value matched:

  • If the return value is a single atom then that atom is assumed to be a state and push onto the top of the state stack.
  • If the return value is a two element tuple then the first element is expected to be an atom (the token name) and the second element a value for this token.
  • If the return value is nil then the top state is popped off the state stack.

If lexing succeeds then you will receive an :ok tuple with the second value being a list of LexLuthor.Token structs.

If lexing fails then you will receive an :error tuple which a reason and position.

Link to this section Summary

Functions

Define a lexing rule applicable to the default state.

Define a lexing rule for a specific state.

Link to this section Functions

Link to this macro

defrule(regex, action)

View Source (macro)

Define a lexing rule applicable to the default state.

  • regex a regular expression for matching against the input string.
  • action the function to execute when this rule is applied.
Link to this macro

defrule(regex, state, action)

View Source (macro)

Specs

defrule(Regex.t(), atom(), (String.t() -> atom() | nil | {atom(), any()})) ::
  {:ok, non_neg_integer()}

Define a lexing rule for a specific state.

  • regex a regular expression for matching against the input string.
  • state the lexer state in which this rule applies.
  • action the function to execute when this rule is applied.