Adding CETT-based thresholding #53

kaselby · 2025-07-10T17:23:15Z

Description

Adds basic support for CETT-based thresholding and refactors activation capture.

…tion capture. Signed-off-by: Kira Selby <[email protected]>

* Update to log each separate lora size independently and correctly identify which checkpoints to load based on lora size. Signed-off-by: Kira Selby <[email protected]> * Store best f1 scores for each layer in central kv store and save best performing model per layer. Add flag to resume training only from best performing lora sizes for each layer. Signed-off-by: Kira Selby <[email protected]> * Bugfixes and move save final predictor into layerwise trainer to avoid saving after early exit due to errors. Signed-off-by: Kira Selby <[email protected]> * Remove restart_if_missing and add documentation for load_best_only Signed-off-by: Kira Selby <[email protected]> --------- Signed-off-by: Kira Selby <[email protected]>

…tion capture. Signed-off-by: Kira Selby <[email protected]>

Signed-off-by: Kira Selby <[email protected]>

kaselby and others added 5 commits July 10, 2025 13:19

Added basic code for CETT threshold calculation and refactored activa…

24c07b8

…tion capture. Signed-off-by: Kira Selby <[email protected]>

Added basic code for CETT threshold calculation and refactored activa…

eaf15e4

…tion capture. Signed-off-by: Kira Selby <[email protected]>

Merge branch 'cett' of github.com:kaselby/sparse_transformers into cett

89cef81

Basic script for CETT.

b6db12b

Signed-off-by: Kira Selby <[email protected]>

kaselby closed this Jul 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding CETT-based thresholding #53

Adding CETT-based thresholding #53

Uh oh!

kaselby commented Jul 10, 2025

Uh oh!

Uh oh!

Adding CETT-based thresholding #53

Adding CETT-based thresholding #53

Uh oh!

Conversation

kaselby commented Jul 10, 2025

Description

Uh oh!

Uh oh!