Hi, I am Nirbhay Sharma. I am currently pursuing M.Tech in Artificial Intelligence (AI) from Indian Institute of Science (IISc) Bangalore . I have completed my B.Tech in Computer Science and Engineering (CSE) from Indian Institute of Technology (IIT) Jodhpur
I am a Machine Learning enthusiast and have a good hands on experience in Pytorch and Python I take a keen interest in reading new state-of-the-art research papers and implementing them from scratch fascinates me.
Recent Implementation highlights Representation Learning (Implemented SimCLR, SupCon, BYOL, BarlowTwins, Triplet Margin, SimSiam from scratch) and SSL methods for GNNs (Currently in progress)
M.Tech Artificial Intelligence, Indian Institute of Science (IISc) Bangalore, CGPA: 8.5 /10, AIR- 6 GATE DA
B.Tech Computer Science and Engineering, Indian Institute of Technology (IIT) Jodhpur, CGPA: 8.97 /10
Class 12, PCM, Dehradun Public School, Percentage: 96.4
Class 10, SD Public School, CGPA: 10 /10
Publications
Aggregation-Assisted Proxyless Distillation: A Novel Approach for Handling System Heterogeneity in Federated Learning, International Joint Conference on Neural Networks (IJCNN) 2024 , [Paper]
An Extremely Lightweight CNN Model For the Diagnosis of Chest Radiographs in Resource-constrained Environments, International Journal of Medical Physics 2023 , [Paper]
The repo contains the first semester assignments and PYQs for Stochastic Models and Applications (STOMA), Linear Algebra and Applications (LAA), Data Structures and Algorithms (DSA), Computational Methods of Optimization (CMO)
Semester-2 @ IISc | ML/DL
Tech: Pytorch, Numpy, Python
The repo contains the second semester assignments and PYQs for Natural Language Processing (NLP), Machine Learning for Signal Processing (MLSP), Game Theory
Semester-3 @ IISc | ML/DL/Theory
Tech: Pytorch, Numpy, Python
The repo contains the third semester assignments and PYQs for Statistical Leaning Theory (SLT), Concentration Inequalities (CI), Topics in Visual Analytics (TVA), Dynamics of Linear System (DLS)
Industry Experience
Mastercard
Data Scientist [Internship]
May2025 - July2025
Multi Teacher GNN To MLP Distillation
Worked with AI Garage on Multi-Teacher GNN to MLP knowledge distillation for efficient inference
Trained multiple teacher GNN models like GCN, GAT, SAGE on node and edge prediction, contrastive tasks
Proposed GMOE2MLP, a graph mixture of expert to MLP distillation method and a novel cluster contrast loss to distill the embeddings from best teacher to MLP
Faaya Astu
ML Engineer [Full Time]
July2023 - July2024
Content Generation, Avatar Generation
Trained Stable Diffusion ControlNet models on Lineart and Colorbox control on VastAI GPU instance and deployed them on RunPod for more flexibility and control on print generation
Trained Low Rank Adaptation (LoRA) models using Kohya_SS for custom face and background generation
Experimented with custom ComfyUI workflows with integrated ControlNet, LoRA, InstantID models
Containerised ComfyUI with Docker and deployed them as Serverless Endpoints on RunPod and exposed endpoint APIs to AWS Lambda to create APIs for APP using AWS API gateway
Exawizards
AI Engineer [Internship]
June2022 - July2022
Split Neural Network Models
Worked on Split Neural Network ML paradigm and Splitted Mask-RCNN, FCN_Resnet50, YOLOv5 models for Instance segmentation, segmentation, face detection tasks
Implemented Autoencoder model for efficient image compression to latent space and setup Pysyft to communicate latents from Jetson Nano to GPU server, preserving data privacy at Jetson Nano
Proposed a novel FL Framework FedAgPD to simultaneously handle model and data heterogeneity
Leveraged Deep Mutual Learning at Client and Aggregation followed by Gaussian Noise based data free distillation at the Server, eliminating need of proxy dataset or GAN's
FedAgPD achieved 2x better performance compared to SOTA FL algorithms like FedDF, FedMD, Kt-pfl
IIT Jodhpur
June2021 - May2022
Extremely Lightweight CNN for Chest X-Ray Diagnosis
Designed a novel Lightweight CNN model (ExLNet) for the abnormal detection of Chest Radiographs
Fused Squeeze and Excitation blocks with Depth-wise convolution to create DCISE layer as a component of ExLNet, which outperforms SOTA models like Mobilenet, Shufflenet on NIH, VinBig medical datasets
Detected and classified cells data sample into necrotic and apoptotic cells
Finetuned various SOTA object detectors such as YOLO, SSD, RetinaNet, DeTR
Achieved remarkable results using DeTR with a Mean Average Precision (MAP) of 40.0
Projects
Leveraging Generative Modelling for SSL | ML/DL (Ongoing)
Tech: Pytorch, Neural ODE, Python, EBM
Leveraging gnenerative modelling for rich and generalizable self supervised representations
Generative AI Implemenation | ML/DL (Ongoing)
Tech: Pytorch, GANs, Diffusion
Implementing generative AI methods like Noise condition score network, Score matching
Representation Learning Algorithms | ML/DL
Tech: Pytorch, Contrastive Learning, Python
Implemented SSL methods like SimCLR, SupCon, BYOL, Barlow Twins, SimSiam, Triplet Margin and report their performance on CIFAR10/100 with two varients of encoder architecture, ResNet 18/50
Self Supervised Learning Methods for GNNs | ML/DL (Ongoing)
Tech: Pytorch, GNN, Python, SSL
Implementing SSL for GNNs
Image Captioning using Detection Transformer (DeTR) | ML/DL
Tech: Pytorch, Transformers, Python
Implemented modified DeTR from scratch in pytorch for image captioning task. Trained DeTR on Flickr30k dataset for 500 epochs and achieved a BLEU score of 57.36 on Flickr8k dataset
Vision Transformers Implementation | ML/DL
Tech: Pytorch, Vision Transformers, Python
Implemented 11 SOTA research papers on vision transformers variants like Swin Transformer, Pyramid ViT, Convolution ViT etc. for Image Classification from scratch in pytorch
Regularizing Federated Learning via Adversarial Model Perturbations | ML/DL
Tech: FL, Pytorch, Python, AMP
The project aims at implementing the state of the art methods for Federated Learning (FL) like scaffold, FedNTD, FedProx, FedAvg and regularize the client using adversarial model perturbations to reach flat minima.
CNNAlgos-Comparison | ML/DL
Tech: Python, Pytorch
The project consists of various deep CNN architectures (coded from scratch) on Retinal eye disease dataset (kaggle), and performed a comparative study among these deep architectures
Image Colorization | ML/DL
Tech: Python, Pytorch
The project aims at implementing Pix2Pix GAN architecture from scratch on RGB and LAB image format, to convert a black and white image to colored image
Mask-NoMask detection | ML/DL
Tech: Python, Pytorch, Flask, OpenCV
the project is build using pytorch library and the final trained model is then used for real time detection using openCV and also the testing can be done from web application build using flask
PRA-Visulaizer | web-dev
Tech: React, Javascript, HTML/CSS, JSX
The project consists of visualization of various page replacement algorithms such as (FIFO, LRU, OPR) etc. given number of frames and demand pages, the app can visualize how various algorithms handle page replacement.