LORA-PEFT-powered-LLM-adaptation-with-Accelerate-WandB

Welcome to the repository containing code to fine-tune a Large Language Model (LLM) for the task of answering multiple-choice-questions, specifically using the LLM Science Exam dataset from a competition. The primary objective of this project is to finetune a language model and adapt them to accurately predict the correct answers to a set of questions.

This has three Jupyter Notebooks focussed for three parts of the project:

Crafting a Delectable Mix of Data and Prompts!: Setting up an ideal learning environment for Large Language Models (LLMs) is akin to creating a recipe for success. Imagine LLMs as eager learners, ready to absorb a mix of various information. It's not just about throwing words together randomly; we carefully design prompts, like clear instructions, to help LLMs tackle different language tasks – from simple summaries to more intricate challenges. After curating this diverse mix of data, we split it into training and validation sets. The training set acts as the main course, allowing LLMs to grasp language patterns, while the validation set ensures they genuinely comprehend the information. This notebook contains the code for data processing.
Training Large Language Models Made as Delightful as Cooking a Gourmet Meal!: Have you ever thought about the detailed steps involved in training a big language model? It's a bit like getting ready to cook a fancy meal! Each part needs careful consideration and accurate actions, starting with collecting the best ingredients and becoming a pro at cooking methods. So, just as a chef creates a masterpiece by paying attention to every detail, crafting a well-trained language model involves precision and dedication at every stage! This notebook contains the code for training the LLM. I am adding this flowchart for better understanding of the training process.

The fine-tuned model has been uploaded to 🤗 hub. Click 🤗 to land on the model space. You can also see board. Also, I have made the board public.

The Grand AI Buffet Opens: It's finally time to indulge! This notebook throws open the doors, unleashing the PEFT model on real-world data. It's the grand reveal, the moment to witness the culmination of all the efforts – the AI creation, ready to serve up a feast of insights and solutions.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
llm-inference.ipynb		llm-inference.ipynb
preparing-data-for-science-llm-exam.ipynb		preparing-data-for-science-llm-exam.ipynb
training-llm-using-accelerate-and-w-b.ipynb		training-llm-using-accelerate-and-w-b.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LORA-PEFT-powered-LLM-adaptation-with-Accelerate-WandB

Please UPVOTE my notebooks in . You can also connect me on

About

Releases

Packages

Languages

License

Adi-ds/lora-peft-powered-llm-adaptation-with-accelerate---wandb

Folders and files

Latest commit

History

Repository files navigation

LORA-PEFT-powered-LLM-adaptation-with-Accelerate-WandB

Please UPVOTE my notebooks in . You can also connect me on

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages