LLM-Course-Projects

This repository contains projects completed as part of the Large Language Models (LLMs) course at the University of Tehran (Spring 2024). The projects cover a range of topics related to LLMs, including word embeddings, fine-tuning, retrieval-augmented generation, reinforcement learning from human feedback (Alignment), and more.

Course Assignment 1 (CA1)

Word Embeddings and Masked Language Models

In this project, we explore foundational concepts in natural language processing (NLP). We start with word embeddings using the GloVe model and the Gensim library, visualizing and interpreting the semantic relationships captured by these embeddings. Next, we dive into Masked Language Modeling (MLM) with BERT, a crucial technique in contextual language understanding. We experiment with masking tokens in sentences and training BERT to predict them, which is fundamental to its pre-training process.

Transfer Learning with BERT

This part focuses on fine-tuning the pre-trained BERT model for downstream NLP tasks such as text classification and question answering. Using the Transformers library by Hugging Face, we fine-tune BERT on specific datasets and evaluate its performance. The goal is to understand how transfer learning can be applied to enhance BERT's performance on task-specific data.

Course Assignment 2 (CA2)

GPT-2 Text Generation

In this assignment, we work with the GPT-2 model to explore different prompting techniques:

Single Sentence Prompting: We experiment with generating text based on a single sentence prompt, analyzing generation speed, memory usage, and model performance.
Batch Generation Prompting: We compare the performance of single prompts with batch generation, using multiple prompts of varying lengths. This allows us to assess how prompt length and batch processing affect GPT-2's output and efficiency.

Soft Prompt Tuning

Soft prompt tuning involves creating learnable prompts that are optimized for specific tasks. In this project, we use BERT as a base model, applying soft prompts to a dataset of Persian sentences. We implement custom layers for prompt embeddings and evaluate the effectiveness of these soft prompts in improving the model's performance on polarity classification tasks.

Course Assignment 3 (CA3)

Chain-of-Thought and Self-Consistency

This project explores the Chain-of-Thought (CoT) reasoning technique, which enhances the problem-solving capabilities of LLMs. We apply CoT and Self-Consistency methods to the Phi-2 model for a question-answering task, comparing its performance to traditional approaches.

Parameter-Efficient Fine-Tuning (PEFT)

In this section, we focus on efficient fine-tuning methods for large models. We use LoRA (Low-Rank Adaptation) to fine-tune the Phi-2 model for a question generation task, demonstrating how PEFT can reduce computational overhead while maintaining or improving model performance.

Retrieval-Augmented Generation (RAG)

We implement a Retrieval-Augmented Generation (RAG) system using the Llama-2 7B Chat model. The project includes creating a retrieval pipeline with both TF-IDF and semantic retrievers, and integrating these with the language model to generate responses based on retrieved documents. We evaluate the effectiveness of this approach in information retrieval tasks.

Course Assignment 4 (CA4)

Reinforcement Learning from Human Feedback (RLHF)

This project introduces RLHF as a method to align language model outputs with human preferences. We fine-tune a GPT-2 model on a summarization task, train a reward model, and apply Proximal Policy Optimization (PPO) to optimize the model based on human feedback.

Quantization and Instruction Tuning

We explore model quantization using the QLoRA method to fine-tune the Mistral 7B model. This process reduces the model's memory footprint and speeds up inference. Additionally, we perform instruction tuning on the Mistral 7B Instruct model, enabling it to follow complex instructions more effectively.

Text Evaluation Using Language Models

In this final section, we evaluate text generation using BERTScore, a metric for comparing the similarity of generated text to reference text. We experiment with both the official BERTScore implementation and a custom implementation, using the DeBERTa model for evaluation tasks.

Please note that this README was generated with the assistance of ChatGPT (GPT-4 engine), an AI language model developed by OpenAI.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
CA01		CA01
CA02		CA02
CA03		CA03
CA04		CA04
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Course-Projects

Table of Contents

Course Assignment 1 (CA1)

Word Embeddings and Masked Language Models

Transfer Learning with BERT

Course Assignment 2 (CA2)

GPT-2 Text Generation

Soft Prompt Tuning

Course Assignment 3 (CA3)

Chain-of-Thought and Self-Consistency

Parameter-Efficient Fine-Tuning (PEFT)

Retrieval-Augmented Generation (RAG)

Course Assignment 4 (CA4)

Reinforcement Learning from Human Feedback (RLHF)

Quantization and Instruction Tuning

Text Evaluation Using Language Models

About

Releases

Packages

Languages

Ipouyall/LLM-Course-Projects

Folders and files

Latest commit

History

Repository files navigation

LLM-Course-Projects

Table of Contents

Course Assignment 1 (CA1)

Word Embeddings and Masked Language Models

Transfer Learning with BERT

Course Assignment 2 (CA2)

GPT-2 Text Generation

Soft Prompt Tuning

Course Assignment 3 (CA3)

Chain-of-Thought and Self-Consistency

Parameter-Efficient Fine-Tuning (PEFT)

Retrieval-Augmented Generation (RAG)

Course Assignment 4 (CA4)

Reinforcement Learning from Human Feedback (RLHF)

Quantization and Instruction Tuning

Text Evaluation Using Language Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages