v0.2.4
What's Changed
Optimizer and rewriter
- Turn inliner into a pass and use it in rewriter & optimizer by @justinchuby in #2149
- Add fusion rules for com.microsoft.Attention by @shubhambhokare1 in #2148
- Allow fuse_xformers to return a count of different fusions applied by @shubhambhokare1 in #2159
- GQA Fusion by @gramalingam in #2161
- Optimization to avoid trying multiple attention-based fusions by @shubhambhokare1 in #2168
- Handle empty rewrite rules in rewrite function by @justinchuby in #2164
ONNX IR
- [IR] Expose the Tape module by @justinchuby in #2127
Other Changes
- Create auto release notes by @justinchuby in #2170
Full Changelog: v0.2.3.1...v0.2.4