Work
-
Apple
- Optimized end-to-end MacBook defect detection by improving synthetic data, enhancing unsupervised and supervised ML models. Hosted refined pipeline on internal platform, enabling over ∼ 1000s employees explore model selection.
- Boosted synthetic data quality by ∼ 25% utilizing Apple’s MM1 and fostering cross-functional team collaboration.
- Doubled anomaly detection recall score via a combined student-teacher and autoencoder-based architecture.
- Achieved ∼ 7.5% IoU score improvement with SegFormer (Transformer-based), outperforming Upernet (CNN-based).
- Improved object detection mAP@IoU=0.5 by ∼ 6% using YOLOv8x vs. YOLOv5s, with multi-node distb. training.
-
Carnegie Mellon University School of Computer Science
- Engineered interactive visualizations showcasing SD internals, running entirely on the browser leveraging WebGPU
- Accelerated SD model by 70% through multi-node latent consistency training on 32 V100 GPUs
- Creating a web-based tool for LLM finetuning, to be adopted in the AI curriculum for US high schools under AI4GA
- Streamlining finetuning by eliminating complex installations, enabling a one-click experience with minimal dependencies
-
Hexo
- Finetuned a stable diffusion model on several custom product datasets using dreambooth on AWS EC2.
- Built robust pipelines for inpainting and outpainting, efficiently generating product backgrounds without distortions.
- Improved inference generation through extensive hyperparameter experimentations, ControlNet, and mask modulations.
- Devised custom model performance evaluation procedures through FID (~7.5), UIDS, and PIDS metric calculations.
-
Orangewood Labs
- Built AR tracking for custom objects using BRIEF feature matching, RANSAC homography, and content warping.
- Utilized topological optimization tools to achieve a 40 percent reduction in weight of robotic links.
- Manufactured a heat chamber at 1/10th cost of off-the-shelf products for thermal performance testing of robot.
Education
-
2023.08 - 2025.05 |
Pittsburgh, PA |
Carnegie Mellon University
- Machine Learning
- Deep Learning
- Advanced Computer Vision
Awards
-
2023.07.01
K.C. Mahindra Education Trust
K.C. Mahindra Scholarship is an interest-free loan scholarship for pursuing advanced studies overseas. Awarded to top ~3% of the applicants among a pool of 2000+ applicants.
-
2022.11.01
The RSI International Conference on Robotics and Mechatronics
Paper titled 'An Abstract Model for Onshore Wind turbine Blade Maintenance' won the 'Best Paper Award' at the '10th RSI International Conference on Robotics and Mechatronics (ICRoM)' among a pool of 1000+ submissions across the world.
-
2023.05.01
DAAD (German Academic Exchange Service)
The DAAD-WISE Scholarship funds students to carry out a research internship at a publicly-funded higher education institution in Germany. I was one of the 100-odd students from India to be awarded this scholarship.
-
2023.04.01
National Science Academies (IASc-INSA-NASI)
Summer Research Fellowship Programme (SRPF) is a prestigious research internship programme supported by the three National Science Academies (IASc-INSA-NASI). I was one of the few students to be selected from a pool of over 30 thousand applicants.
Publications
-
10th RSI International Conference on Robotics and Mechatronics (ICRoM)
-
Journal of Physics: Conference Series
-
9th RSI International Conference on Robotics and Mechatronics (ICRoM)
Skills
| Programming Languages |
| Python |
| C++ |
| Js |
| Machine Learning & AI |
| PyTorch |
| Computer Vision |
| NumPy |
| Data Analysis |
| Cloud & Infrastructure |
| AWS |
| GCP |
| Linux |
| Engineering Tools |
| SolidWorks |
| FEA |
| CAD |
Projects
-
2023.09 - 2023.12
- Executed Lucas Kanade tracker with template correction and inverse compositional algorithm, optimizing runtime
- Reconstructed 3D point clouds via F matrix computation, stereo triangulation, and bundle adjustment optimization
-
2022.06 - 2022.11
- Evaluated a classification model of ResNet18 architecture with Levit, Swin, Convnext, and EfficientNetV2 families
- Attained accuracy of 0.98 and F1 score of 0.85 through construction of an ensemble of multiple architectures
- Performed semantic segmentation using transfer learning with pre-trained ImageNet weights for improved performance
- Achieved an Intersection over Union (IoU) of 0.95 using the U-Net framework and ResNet34 architecture