Open
Description
Descriptions:
Based on current priorities, here are a set of items will be released next with experimental results:
- layerwise intervention weight sharing to further shrink down #params.
- ReFT+LoRA for arithmetic reasoning related steering.
- Inference-time intervention manipulations (e.g., clamping, transfer)
Using this ticket to track progress.