Skip to content

MrigankRaman/LLM_Comp

Repository files navigation

NeurIPS LLM Efficiency Challenge Student Submission

Team name: ReaLLM Conquerors

Description

  • Our submissions for the 4090 track are under 4090_submissions folder

  • Our submissions for the A100 track are under A100_submissions folder

  • Our training code for 4090 track is inside the folder 4090_training_code. Use the dockerfile which will run the training and then the final artifact is present at /submission/qwen_ours_3e-5_4bit/checkpoint-2340 which is an adapter and can be gotten using the docker cp command. We would like to use our first submission

  • Our training code for A100 track is inside the folder A100_training_code. Use the dockerfile which will run the training and then the final artifact is present at /submission/qwen_ours_3e-5_new/checkpoint-2340 which is an adapter for qwen and can be gotten using the docker cp command. We would like to use our first submission

Public Dataset Download Instruction

To download the dataset:

gdown 1E43WbnyL8iXzOw21ye95VdJAYyra6ewd

To recreate our dataset

the same environment as training should work cd data/

bash create_dataset.sh

Disclaimer: There have some slight changes to reclor and ARB dataset since we first created the dataset. So there will be 11 fewer examples in the recreation than the one which you download from google drive. But that should not affect reproduction that much.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •