CS6910 Assignment 1

This repository contains the files for the first assignment of the course CS6910 - Deep Learning at IIT Madras.

Implementing a FeedForward Neural Network with Backpropagation from scratch. Also implemented a few optimizers and loss functions.

Jump to section: Usage

Optimizers and Loss Functions

Optimizers implemented:

SGD - Stochastic Gradient Descent
Momentum - Momentum SGD
NAG - Nesterov Accelerated Gradient (optimized version)
RMSProp - Root Mean Square Propagation
Adam - Adaptive Moment Estimation
Nadam - Nesterov Adaptive Moment Estimation

Loss functions implemented:

Cross Entropy
Mean Squared Error

The default values set in the file train.py are from hyperparameter tuning done using wandb sweeps.

Wandb report for the assignment: https://wandb.ai/cs20b013-bersilin/cs6910-assignment-1/reports/CS6910-Assignment-1-Report--VmlldzozNzIzMTM2

Dataset

The dataset used is Fashion MNIST. The dataset is available in the keras.datasets module.

Used Python Libraries and version

Python 3.7.10
Numpy 1.19.5
Keras 2.4.3 (for the dataset)
Wandb 0.10.30
Sklearn 0.24.1 (for the confusion matrix and test-train split)

Usage

To run the file manually use the following command:

# This will run the default values set in train.py

$ python3 train.py -wp <wandb_project_name> -we <wandb_entity_name>

To run the file with custom values, check out the follwoing section. This shows the list of all the options available and a bit of information about them.

$ python3 train.py -h

# The output of the above command is as follows:

usage: train.py [-h] -wp WANDB_PROJECT -we WANDB_ENTITY [-d DATASET] [-e EPOCHS]
                [-b BATCH_SIZE] [-l LOSS] [-o OPTIMIZER] [-lr LEARNING_RATE]
                [-m MOMENTUM] [-beta BETA] [-beta1 BETA1] [-beta2 BETA2]
                [-eps EPSILON] [-w_d WEIGHT_DECAY] [-w_i WEIGHT_INIT]
                [-nhl NUM_LAYERS] [-sz HIDDEN_SIZE] [-a ACTIVATION]

options:
  -h, --help            show this help message and exit
  -wp WANDB_PROJECT, --wandb_project WANDB_PROJECT
                        Wandb project name
  -we WANDB_ENTITY, --wandb_entity WANDB_ENTITY
                        Wandb entity name
  -d DATASET, --dataset DATASET
                        Dataset to use choices=['fashion_mnist', 'mnist']
  -e EPOCHS, --epochs EPOCHS
                        Number of epochs
  -b BATCH_SIZE, --batch_size BATCH_SIZE
                        Batch size
  -l LOSS, --loss LOSS  Loss function to use choices=['cross_entropy',
                        'mean_squared_error']
  -o OPTIMIZER, --optimizer OPTIMIZER
                        Optimizer to use choices=['sgd', 'momentum', 'nag', 'rmsprop', 'adam', 'nadam']
  -lr LEARNING_RATE, --learning_rate LEARNING_RATE
                        Learning rate
  -m MOMENTUM, --momentum MOMENTUM
                        Momentum for Momentum and NAG
  -beta BETA, --beta BETA
                        Beta for RMSProp
  -beta1 BETA1, --beta1 BETA1
                        Beta1 for Adam and Nadam
  -beta2 BETA2, --beta2 BETA2
                        Beta2 for Adam and Nadam
  -eps EPSILON, --epsilon EPSILON
                        Epsilon for Adam and Nadam
  -w_d WEIGHT_DECAY, --weight_decay WEIGHT_DECAY
                        Weight decay
  -w_i WEIGHT_INIT, --weight_init WEIGHT_INIT
                        Weight initialization choices=['random', 'xavier']
  -nhl NUM_LAYERS, --num_layers NUM_LAYERS
                        Number of hidden layers
  -sz HIDDEN_SIZE, --hidden_size HIDDEN_SIZE
                        Hidden size
  -a ACTIVATION, --activation ACTIVATION
                        Activation function choices=['sigmoid', 'tanh', 'relu']

To run a sweep using wandb:

Set the values of count and project name in sweep_code.py and then run the following command:

$ python3 sweep_code.py

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Notebook		Notebook
CS6910_Assignment_1_Submission.pdf		CS6910_Assignment_1_Submission.pdf
README.md		README.md
sweep_code.py		sweep_code.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS6910 Assignment 1

Optimizers and Loss Functions

Dataset

Used Python Libraries and version

Usage

To run a sweep using wandb:

About

Releases

Packages

Languages

bersilin-robert1609/CS6910-Assignment1

Folders and files

Latest commit

History

Repository files navigation

CS6910 Assignment 1

Optimizers and Loss Functions

Dataset

Used Python Libraries and version

Usage

To run a sweep using wandb:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages