Kanaya

Simple, unoptimized regular expression engine written for learning purposes. Kanaya generates it's LR(1) parser from a grammar, then takes a regular expression's parse tree to an equivalent non-deterministic finite automata, then computes the automata's acceptance of stdin. It's a transparent example of two important classes of automata, non-deterministic finite automaton (NFA, for recognizing w/stdin as a member in re/argument) and a non-deterministic push down automaton (PDA, for taking the regular expression to a parse tree), as well as a demonstration of deriving an NFA from a regular expression.

Grammar

The grammar of the regular expressions recognized by kanaya is as follows:

P -> 'a' | ... | 'z' | 'A' | ... | 'Z' | '0' | ... | '1' | (Q)
S -> P | S*
T -> S | T.S
Q -> T | Q+T

This grammar while bnf-ish is virtually identical to the one that's augmented and used to generate kanaya's parser. you can see it in code with the name kanaya_grammar.

Building

Look at my other repository circes. This project needs to be linked with that one.

Examples

$ echo xyyyyyyyy | ./bin/kanaya 'x.(y*+z)'
accepted: 1, used: 35432, 34kib

$ echo 101abcbcba | ./bin/kanaya '(a+b+c)*.(0+1)*.(a+b+c)*'
accepted: 1, used: 92432, 90kib

$ echo 10101110101abac10 | ./bin/kanaya '(a+b+c)*.(0+1)*.(a+b+c)*'
accepted: 0, used: 94424, 92kib

$ echo abccba11 | ./bin/kanaya '(a+b+c)*.(0+1)*.(a+b+c)*'
accepted: 1, used: 98456, 96kib

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
.clang-format		.clang-format
.clangd		.clangd
README.md		README.md
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kanaya

Grammar

Building

Examples

About

Uh oh!

Languages

gittyhubacc/kanaya

Folders and files

Latest commit

History

Repository files navigation

Kanaya

Grammar

Building

Examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages