NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models

Install dependencies

pip install -r requirements.txt

Pruning

Step 1: Run bash ./scripts/cotrain.sh to try NutePrune pruning.

(Modify pruning_type target_sparsity model_name_or_path lagrangian_warmup_epochsto run different tasks.)

Post Fine-tuning

Step 1: After pruning, prepare the pruned output folder as $baseline_pruned_model, it should consist of LoRA weights file lora_weights.pt and pruning mask file zs.pt.

Step 2: Prepare dataset for training: Download official Alpaca dataset and put the dataset into ./data.

Step 3: Run bash ./scripts/finetune_alpaca.sh to try post fine-tuning on alpaca. (Modify baseline_pruned_model model_name_or_path to run different tasks)

Evaluation

1. PPL

Run bash ./scripts/eval_ppl.sh

2. Zero-shot Commonsense Reasoning

First, install lm-evaluation-harness:

cd lm-evaluation-harness
conda create -n lm-eval python==3.9
conda activate lm-eval
pip install -e .

Install other packages:

pip install deepspeed
pip install sentencepiece

Then Run bash ./scripts/eval_commonsense.sh

3. Benchmarks

To evaluate MMLU, BBH, GSM8K and other LLM benchmarks, we recommond using the latest lm-evaluation-harness:

cd ~
git clone https://github.com/EleutherAI/lm-evaluation-harness.git
cd lm-evaluation-harness
conda create -n leh python==3.9
conda activate leh
pip install -e .
pip install sentencepiece
pip install protobuf

Then merge lora weights and masks by runing bash ./scripts/merge_weights.sh

Then Run bash ./scripts/eval_benchmark.sh

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
evaluation		evaluation
models		models
scripts		scripts
tasks		tasks
trainer		trainer
utils		utils
.gitignore		.gitignore
README.md		README.md
args.py		args.py
ds3_offload.json		ds3_offload.json
eval_ppl.py		eval_ppl.py
merge_weights.py		merge_weights.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models

Install dependencies

Pruning

Post Fine-tuning

Evaluation

About

Releases

Packages

Languages

Lucius-lsr/NutePrune

Folders and files

Latest commit

History

Repository files navigation

NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models

Install dependencies

Pruning

Post Fine-tuning

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages