Last year, I wrote an AVX512/GFNI implementation of the "short" core for Groestl. (Ignore the "long" core code in g512.rs, it doesn't work correctly last I checked). Now that these intrinsics are stabilized, I would love to get this up to the standards of the project and contribute it as an optimized backend.