stanford-CS330-2022

This repo contains homework assignment solutions for the Stanford CS 330 (Deep Multi-Task and Meta Learning) class offered in Fall 2022. A brief summary of key concepts covered in different assignments is summarized below.

Homework 0: Multitask Training for Recommender Systems

In this assignment, we will implement a multi-task movie recommender system based on the classic Matrix Factorization and Neural Collaborative Filtering algorithms. In particular, we will build a model based on the BellKor solution to the Netflix Grand Prize challenge and extend it to predict both likely user-movie interactions and potential scores. A multi-task neural network architecture is implemented and the effect of parameter sharing and loss weighting on model performance is explored. The main goal of these exercises is to get familiar with multi-task architectures, the training pipeline, and coding in PyTorch.

Homework 1: Data Processing and Black-Box Meta-Learning

Goal of this assignment is to understand meta-learning for few shot classification. Following are the key undertakings:

Learn how to process and partition data for meta learning problems, where training is done over a distribution of training tasks
Implement and train memory augmented neural networks, a black-box meta-learner that uses a recurrent neural network [1].
Analyze the learning performance for different size problems.
Experiment with model parameters and explore how they improve performance. The analysis is performed using Omniglot dataset [2]---a dataset with 1623 characters from 50 languages and each character has 20 28x28 images.

Homework 2: Prototypical Networks and Model-Agnostic Meta-Learning

This assignment experiments with two meta-learning algorithms, prototypical networks (protonets) [3] and model-agnostic meta-learning (MAML) [4] for few-shot image classification on the Omiglot dataset [2]. Following are the key tasks in the assignment:

Implementation of the protonets and MAML algorithms
Interpretation of the key metrics of both the algorithms
Investigate the effect of task composition during protonet training on evaluation.
Investigate the effect of different inner loop adaptation settings in MAML.
Investigate the performance of both algorithms on meta-test tasks that have more support data than training tasks do.

Homework 3: Few-Shot Learning with Pre-trained Language Models

This assignment will explore several methods for performing few-shot (and zero-shot) learning with pre-trained language models (LMs), including variants of fine-tuning and in-context learning. The goal of this assignment is to gain familiarity with performing few-shot learning with pre-trained LMs, learn about the relative strengths and weaknesses of fine-tuning and in-context learning, and explore some recent methods proposed for improving on the basic form of these algorithms, for example different pormpting techniques for in-context learning and Low-rank Adaptation (LoRA) for fine-tuning.

References

[1] Santoro, A., Bartunov, S., Botvinick, M., Wierstra, D., & Lillicrap, T. (2016, June). Meta-learning with memory-augmented neural networks. In International conference on machine learning (pp. 1842-1850). PMLR.

[2] Lake, B. M., Salakhutdinov, R., & Tenenbaum, J. B. (2015). Human-level concept learning through probabilistic program induction. Science, 350(6266), 1332-1338.

[3] Snell, J., Swersky, K., & Zemel, R. (2017). Prototypical networks for few-shot learning. Advances in neural information processing systems, 30.

[4] Finn, C., Abbeel, P., & Levine, S. (2017, July). Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning (pp. 1126-1135). PMLR.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
hw0		hw0
hw1		hw1
hw2		hw2
hw3		hw3
tutorials		tutorials
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stanford-CS330-2022

Homework 0: Multitask Training for Recommender Systems

Homework 1: Data Processing and Black-Box Meta-Learning

Homework 2: Prototypical Networks and Model-Agnostic Meta-Learning

Homework 3: Few-Shot Learning with Pre-trained Language Models

References

About

Releases

Packages

Languages

Jaskanwal/stanford-CS330-2022

Folders and files

Latest commit

History

Repository files navigation

stanford-CS330-2022

Homework 0: Multitask Training for Recommender Systems

Homework 1: Data Processing and Black-Box Meta-Learning

Homework 2: Prototypical Networks and Model-Agnostic Meta-Learning

Homework 3: Few-Shot Learning with Pre-trained Language Models

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages