Projects

Model Extraction Attack for Video Classification

Research Project For Inter Iit Tech Meet 10.0

Performed Model Extraction of Video Swin Transformer and MoviNet trained on the Kinetics datasets in Black Box and Grey Box settings to obtain competitive results in the task
Used conditional video generator and adverserial crafting along with knowledge distillation based techniques
Won the gold medal in the High Prep event amongst 22 IITs across India
Links : report; code

Papers We read

Contributor | Github Repository

This is an open source repository maintained by Vision And Language Group containing summaries and analysis of recently published research papers
Contributed to summaries from recent Deep Learning Conferences.
Link : code

Semantic Segmentation using U-Net

Open Project | Vision and Language Group

Implemented the U‑Net Architecture for semantic segmentation using PyTorch framework on the HELEN* Dataset for performing pixel‑level classification of human faces.
Tried out various loss functions for model training and evaluated the model on class‑wise F1 scores
Links : code