Hyperscan is a high-performance regular expression matching library.
To use, add the following line to Cargo.toml under [dependencies]:
hyperscan = "0.2"use hyperscan::prelude::*;
fn main() {
let pattern = pattern! {"test"; CASELESS | SOM_LEFTMOST};
let db: BlockDatabase = pattern.build().unwrap();
let scratch = db.alloc_scratch().unwrap();
let mut matches = vec![];
db.scan("some test data", &scratch, |id, from, to, flags| {
println!("found pattern #{} @ [{}, {})", id, from, to);
matches.push(from..to);
Matching::Continue
}).unwrap();
assert_eq!(matches, vec![5..9]);
}Starting with Hyperscan v5.0, several new APIs and flags have been introduced.
rust-hyperscan uses the latest version of the API by default, providing new features such as Literal.
If you want to work with Hyperscan v4.x, you can disable v5 feature at compile time.
[dependencies.hyperscan]
version = "0.2"
default-features = false
features = ["full"]In order to improve regular expression compatibility, Hyperscan v5.0 starts to provide a PCRE-compatible Chimera library.
To enable Chimera support, you need to manually download PCRE 8.41 or above, unzip to the source directory of Hyperscan 5.x, compile and install it.
$ cd hyperscan-5.3.0
$ wget https://ftp.pcre.org/pub/pcre/pcre-8.44.tar.gz
$ tar xvf -C pcre pcre-8.44.tar.gz
$ mkdir build && cd build
$ cmake .. -DCMAKE_INSTALL_PREFIX=`pwd`Then point to the hyperscan installation directory with the HYPERSCAN_ROOT environment variable to enable chimera feature.
$ HYPERSCAN_ROOT=<CMAKE_INSTALL_PREFIX> cargo buildThe chimera feature should be enabled.
[dependencies]
hyperscan = { version = "0.2", features = ["chimera"] }Note: The Chimera library does not support dynamic library linking mode, static feature is automatically enabled when chimera is enabled.
As of version 0.2, rust-hyperscan uses dynamic library linking mode by default. If you need link a static library, you can use the static feature.
[dependencies]
hyperscan = { version = "0.2", features = ["static"] }Hyperscan provides a standalone runtime library, which can be used separately. If you don't need to compile regular expressions at runtime, you can reduce the size of the executable using runtime mode and get rid of C++ dependencies.
[dependencies.hyperscan]
version = "0.2"
default-features = false
features = ["runtime"]