RL-TRIM: Reinforcement Learning drivenTransformer Model Structured Pruning

This framework employs reinforcement learning for the structured pruning of Transformer models, specifically targeting models like LLaMA

Our approach involves pruning at different granularities, including head pruning and intermediate dimension pruning, which directly reduces memory size and computational load, facilitating acceleration on consumer GPUs. By utilizing a reinforcement learning agent to determine the optimal pruning strategy, RL-TRIM achieves a significant balance between model size reduction and performance retention, offering a scalable and efficient solution for optimizing various Transformer architectures. ii

Acknowledgments

AMC: AutoML for Model Compression and Acceleration on Mobile Devices. Thanks for providing the pruning framework
LLM-Pruner, which utilizes LM Evaluation Harness, PEFT, and Alpaca-LoRA. Thanks for the pioneering work on structured pruning of LLMs!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
amc_wanda		amc_wanda
checkpoints/.ipynb_checkpoints		checkpoints/.ipynb_checkpoints
dataset/.ipynb_checkpoints		dataset/.ipynb_checkpoints
env		env
exported_models/.ipynb_checkpoints		exported_models/.ipynb_checkpoints
finetune		finetune
lib		lib
plots		plots
scripts		scripts
templates		templates
FineTune_LLama.ipynb		FineTune_LLama.ipynb
LICENSE		LICENSE
Project Report.pdf		Project Report.pdf
README.md		README.md
amc_fine_tune.py		amc_fine_tune.py
amc_search.py		amc_search.py
eval_mobilenet.py		eval_mobilenet.py
plot.py		plot.py
replace_spaces.py		replace_spaces.py
req.txt		req.txt
testEnv.py		testEnv.py
train_script.sh		train_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RL-TRIM: Reinforcement Learning drivenTransformer Model Structured Pruning

This framework employs reinforcement learning for the structured pruning of Transformer models, specifically targeting models like LLaMA

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

milinbhade1214/rl-trim

Folders and files

Latest commit

History

Repository files navigation

RL-TRIM: Reinforcement Learning drivenTransformer Model Structured Pruning

This framework employs reinforcement learning for the structured pruning of Transformer models, specifically targeting models like LLaMA

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages