Nirbhay Sharma

About Me

Hi I am Nirbhay Sharma!

Hi, I am Nirbhay Sharma. I am currently pursuing M.Tech in Artificial Intelligence (AI) from Indian Institute of Science (IISc) Bangalore . I have completed my B.Tech in Computer Science and Engineering (CSE) from Indian Institute of Technology (IIT) Jodhpur
I am a Machine Learning enthusiast and have a good hands on experience in Pytorch and Python I take a keen interest in reading new state-of-the-art research papers and implementing them from scratch fascinates me.
Recent Implementation highlights Representation Learning (Implemented SimCLR, SupCon, BYOL, BarlowTwins, Triplet Margin, SimSiam from scratch) and SSL methods for GNNs (Currently in progress)
Current research on "Leveraging generative modelling for self supervised learning" under prof Venkatesh Babu and prof Chiranjib Bhattacharya

Education

M.Tech Artificial Intelligence, Indian Institute of Science (IISc) Bangalore, CGPA: 8.5 /10, AIR- 6 GATE DA
B.Tech Computer Science and Engineering, Indian Institute of Technology (IIT) Jodhpur, CGPA: 8.97 /10
Class 12, PCM, Dehradun Public School, Percentage: 96.4
Class 10, SD Public School, CGPA: 10 /10

Publications

Aggregation-Assisted Proxyless Distillation: A Novel Approach for Handling System Heterogeneity in Federated Learning, International Joint Conference on Neural Networks (IJCNN) 2024 , [Paper]
An Extremely Lightweight CNN Model For the Diagnosis of Chest Radiographs in Resource-constrained Environments, International Journal of Medical Physics 2023 , [Paper]

Skills

ML/Data Science

Pytorch, Matplotlib, Pandas, Numpy, Sklearn, Deep Learning, Computer Vision,

Languages

Python, C/C++, HTML/CSS, Java, Javascript, Haskell, Prolog, Julia, Bash,

Developement

Docker, Django, Flask, React, Nodejs, Firebase, Mongodb, Mysql, Heroku, Git/Github,

IISC Semester Work

Semester-3 Highlights

Presented literature review on PAC Bayes Concentration Bounds [CI] in Concentration Inequalities course @IISc
Presented Self Supervised Learning Methods [TVA] like SimCLR, BYOL, MoCo, DINO, SupCon etc in topics course @IISc
Presented literature review on Score Matching, Normalizing Flows, Neural ODE [TVA] @IISc
Presented my novel method on Energy Based Representation Learning [TVA] @IISc

Semester-1 @ IISc | ML/DL

Tech: Numpy, Python

The repo contains the first semester assignments and PYQs for Stochastic Models and Applications (STOMA), Linear Algebra and Applications (LAA), Data Structures and Algorithms (DSA), Computational Methods of Optimization (CMO)

Semester-2 @ IISc | ML/DL

Tech: Pytorch, Numpy, Python

The repo contains the second semester assignments and PYQs for Natural Language Processing (NLP), Machine Learning for Signal Processing (MLSP), Game Theory

Semester-3 @ IISc | ML/DL/Theory

Tech: Pytorch, Numpy, Python

The repo contains the third semester assignments and PYQs for Statistical Leaning Theory (SLT), Concentration Inequalities (CI), Topics in Visual Analytics (TVA), Dynamics of Linear System (DLS)

Industry Experience

Mastercard

Data Scientist [Internship]

May2025 - July2025

Multi Teacher GNN To MLP Distillation

Worked with AI Garage on Multi-Teacher GNN to MLP knowledge distillation for efficient inference
Trained multiple teacher GNN models like GCN, GAT, SAGE on node and edge prediction, contrastive tasks
Proposed GMOE2MLP, a graph mixture of expert to MLP distillation method and a novel cluster contrast loss to distill the embeddings from best teacher to MLP

Faaya Astu

ML Engineer [Full Time]

July2023 - July2024

Content Generation, Avatar Generation

Trained Stable Diffusion ControlNet models on Lineart and Colorbox control on VastAI GPU instance and deployed them on RunPod for more flexibility and control on print generation
Trained Low Rank Adaptation (LoRA) models using Kohya_SS for custom face and background generation
Experimented with custom ComfyUI workflows with integrated ControlNet, LoRA, InstantID models
Containerised ComfyUI with Docker and deployed them as Serverless Endpoints on RunPod and exposed endpoint APIs to AWS Lambda to create APIs for APP using AWS API gateway

Exawizards

AI Engineer [Internship]

June2022 - July2022

Split Neural Network Models

Worked on Split Neural Network ML paradigm and Splitted Mask-RCNN, FCN_Resnet50, YOLOv5 models for Instance segmentation, segmentation, face detection tasks
Implemented Autoencoder model for efficient image compression to latent space and setup Pysyft to communicate latents from Jetson Nano to GPU server, preserving data privacy at Jetson Nano

Research Experience

IIT Jodhpur

August2022 - May2023

FedAgPD: Aggregation-Assisted Proxyless Distillation

Supervisor: Dr. Deepak Mishra

Proposed a novel FL Framework FedAgPD to simultaneously handle model and data heterogeneity
Leveraged Deep Mutual Learning at Client and Aggregation followed by Gaussian Noise based data free distillation at the Server, eliminating need of proxy dataset or GAN's
FedAgPD achieved 2x better performance compared to SOTA FL algorithms like FedDF, FedMD, Kt-pfl

IIT Jodhpur

June2021 - May2022

Extremely Lightweight CNN for Chest X-Ray Diagnosis

Supervisor: Dr. Angshuman Paul

Designed a novel Lightweight CNN model (ExLNet) for the abnormal detection of Chest Radiographs
Fused Squeeze and Excitation blocks with Depth-wise convolution to create DCISE layer as a component of ExLNet, which outperforms SOTA models like Mobilenet, Shufflenet on NIH, VinBig medical datasets

IIT Jodhpur

August2022 - May2023

Cell Detection and Classification

Supervisor: Dr. Angshuman Paul

Detected and classified cells data sample into necrotic and apoptotic cells
Finetuned various SOTA object detectors such as YOLO, SSD, RetinaNet, DeTR
Achieved remarkable results using DeTR with a Mean Average Precision (MAP) of 40.0

Projects

Leveraging Generative Modelling for SSL | ML/DL (Ongoing)

Tech: Pytorch, Neural ODE, Python, EBM

Leveraging gnenerative modelling for rich and generalizable self supervised representations

Generative AI Implemenation | ML/DL (Ongoing)

Tech: Pytorch, GANs, Diffusion

Implementing generative AI methods like Noise condition score network, Score matching

Representation Learning Algorithms | ML/DL

Tech: Pytorch, Contrastive Learning, Python

Implemented SSL methods like SimCLR, SupCon, BYOL, Barlow Twins, SimSiam, Triplet Margin and report their performance on CIFAR10/100 with two varients of encoder architecture, ResNet 18/50

Self Supervised Learning Methods for GNNs | ML/DL (Ongoing)

Tech: Pytorch, GNN, Python, SSL

Implementing SSL for GNNs

Image Captioning using Detection Transformer (DeTR) | ML/DL

Tech: Pytorch, Transformers, Python

Implemented modified DeTR from scratch in pytorch for image captioning task. Trained DeTR on Flickr30k dataset for 500 epochs and achieved a BLEU score of 57.36 on Flickr8k dataset

Vision Transformers Implementation | ML/DL

Tech: Pytorch, Vision Transformers, Python

Implemented 11 SOTA research papers on vision transformers variants like Swin Transformer, Pyramid ViT, Convolution ViT etc. for Image Classification from scratch in pytorch

Regularizing Federated Learning via Adversarial Model Perturbations | ML/DL

Tech: FL, Pytorch, Python, AMP

The project aims at implementing the state of the art methods for Federated Learning (FL) like scaffold, FedNTD, FedProx, FedAvg and regularize the client using adversarial model perturbations to reach flat minima.

CNNAlgos-Comparison | ML/DL

Tech: Python, Pytorch

The project consists of various deep CNN architectures (coded from scratch) on Retinal eye disease dataset (kaggle), and performed a comparative study among these deep architectures

Image Colorization | ML/DL

Tech: Python, Pytorch

The project aims at implementing Pix2Pix GAN architecture from scratch on RGB and LAB image format, to convert a black and white image to colored image

Mask-NoMask detection | ML/DL

Tech: Python, Pytorch, Flask, OpenCV

the project is build using pytorch library and the final trained model is then used for real time detection using openCV and also the testing can be done from web application build using flask

PRA-Visulaizer | web-dev

Tech: React, Javascript, HTML/CSS, JSX

The project consists of visualization of various page replacement algorithms such as (FIFO, LRU, OPR) etc. given number of frames and demand pages, the app can visualize how various algorithms handle page replacement.

Connect