SIT: Fine-tuning Large Language Models with Sequential Instructions

This is the code to replicate the sequential instruction tuning experiments in the paper SIT: Fine-tuning Large Language Models with Sequential Instructions. [cite]

Our implementation is based on the Open-Instruct and LAVIS repository.

Setup

For text-only experiments

#Prepare enviornment
conda create -n seq_ins python=3.8
conda activate seq_ins
bash setup.sh

For vision-language experiments

cd LAVIS
conda create -n seq_ins_vl python=3.8
conda activate seq_ins_vl
pip install -e .

Next, prepare train and eval data: You can download sequential instruction data for training here, then move it to self-seq/data/ And you can download evaluation data by running:

bash scripts/prepare_eval_data.sh

for vision-langauge data:

cd LAVIS
bash download_vqa.sh

Generation SIT data

Convert original instruction tuning dataset to sequential version:

bash self-seq/scripts/generation_flancot_llama_70b.sh

Train

To train both sequential instruction and original instruction data, you can specify your preferred LLM, path of training dataset at scripts/alpaca/finetune_accelerate_sit_llama_70b.sh and running:

bash scripts/alpaca/finetune_accelerate_sit_llama_70b.sh

Train on vision-langauge data, you can first specify the pre-trained checkpoint at ./LAVIS/lavis/configs/models/blip2

then you can firstly specify the output models path at ./LAVIS/lavis/projects/instructblip/caption_coco_vicuna7b_train.yaml, then

bash run_scripts/blip2/train/eval_instruct_caption_coco.sh

Eval

First, you should prepare evaluation datasets:

bash scripts/prepare_eval_data.sh

Then, you can run eval of all general and sequential tasks, please replace YOUR_MODEL_NAME as the path of your trained models

bash scripts/evaluation.sh YOUR_MODEL_NAME

Citation

Please consider citing us if you use our materials.

@article{hu2024fine,
  title={Fine-tuning Large Language Models with Sequential Instructions},
  author={Hu, Hanxu and Yu, Simon and Chen, Pinzhen and Ponti, Edoardo M},
  journal={arXiv preprint arXiv:2403.07794},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
LAVIS		LAVIS
ablation		ablation
construct_data		construct_data
ds_configs		ds_configs
eval		eval
eval_seq		eval_seq
lm-evaluation-harness @ 6dd7308		lm-evaluation-harness @ 6dd7308
scripts		scripts
self-seq		self-seq
templates		templates
train		train
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
SITlogo.jpg		SITlogo.jpg
mainfig_preprint.jpg		mainfig_preprint.jpg
mainfigure_seq.jpg		mainfigure_seq.jpg
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SIT: Fine-tuning Large Language Models with Sequential Instructions

Setup

Generation SIT data

Train

Eval

Citation

About

Releases

Packages

Contributors 2

Languages

hanxuhu/SeqIns

Folders and files

Latest commit

History

Repository files navigation

SIT: Fine-tuning Large Language Models with Sequential Instructions

Setup

Generation SIT data

Train

Eval

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages