imatrix: calculate activation-based statistics for new format (GGUF) imatrices #14891

EAddario · 2025-07-26T16:47:29Z

Following up from #9400 and #12718, I've started tinkering with activation-based statistics, in addition to what's currently available via --show-statistics.

At the moment, I'm exploring three options going from from easy to implement and OK approximation, to some assembly required but fairly accurate:

L2 norm of activation difference: where larger values would suggest the tensor has significantly transformed the input with respect to the previous layer.
KL Divergence reduction using a pre-computed logit file: using a similar approach as described by nostalgebraist in logit lens, and based on a pre-computed logit file (e.g. from a previous llama-perplexity --save-all-logits run)
Given that llama-imatrix already generates the actual logits to compute PPL, use Thông T. Nguyễn's logit prism approach to calculate the exact contribution of each layer to the final logit scores

Sharing with the readers, and in particular @compilade and @jukofyork, in case anyone's willing to double check assumptions and/or suggest alternative approaches I haven't considered.

compilade · 2025-07-26T17:02:21Z

tools/imatrix/imatrix.cpp

+            if (!stat.activations.empty()) {
+                const int32_t nact = (int32_t) stat.activations.size();
+                struct ggml_tensor * in_sum  = ggml_new_tensor_2d(ctx, GGML_TYPE_F32, nact / nmat, nmat);
+                ggml_format_name(in_sum, "%s.in_sum", name.c_str()); // ToDo: consider a better name. 'in_act' maybe?


I think in_sum is fine, this fits with the intention of in_sum2.

compilade · 2025-07-26T17:06:57Z

tools/imatrix/imatrix.cpp

+    std::vector<float>   activations;
    std::vector<float>   values;


It might make sense to rename Stats.values to Stats.in_sum2, and Stats.activations to Stats.in_sum.

It should make it more obvious what maps to what in the resulting GGUF.

Use activations to calculate the stats

09bc7c2

EAddario marked this pull request as draft July 26, 2025 16:47

github-actions bot added the examples label Jul 26, 2025

compilade reviewed Jul 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

imatrix: calculate activation-based statistics for new format (GGUF) imatrices #14891

imatrix: calculate activation-based statistics for new format (GGUF) imatrices #14891

Uh oh!

EAddario commented Jul 26, 2025

Uh oh!

compilade Jul 26, 2025

Uh oh!

compilade Jul 26, 2025

Uh oh!

Uh oh!

imatrix: calculate activation-based statistics for new format (GGUF) imatrices #14891

Are you sure you want to change the base?

imatrix: calculate activation-based statistics for new format (GGUF) imatrices #14891

Uh oh!

Conversation

EAddario commented Jul 26, 2025

Uh oh!

compilade Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

compilade Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!