Ojas Taskar

Hey there! I'm Ojas, a Master of Science (MSE) Robotics student at Johns Hopkins University's Laboratory for Computational Sensing & Robotics. I'm currently working on implementing diffusion-based token generation backbones in 5B parameters or lesser Vision-Languages Models, using optimized CUDA kernels for improved pipeline inference, benchmarking against autoregressive models on RTX3060 GPU.

Previously, at IACL, JHU , I worked on bi-directional DDIMs for medical image synthesis with Prof. Jerry Prince. I also worked with Dr. Amir Kheradmand at the VOR Lab, Johns Hopkins Medicine on millimetre-accurate 6 DoF pose estimation. During undegrad, I interned at KIREAP, Inc. with Dr. Omkar Halbe and Mr. Karthik Desai on problems involving motion blur reversal and autonomous vision-based drone landing pipelines. I was also a robotics intern at 1 Martian Way Industries with Mr. Karan Kamdar, on epsilon-greedy deep Q-learning RL algorithms for on-board visual navigation of drones.

Statement of Interest: I like to build robust perception systems, working end-to-end from dataset curation, architectures and infrastructioning, evaluation metrics and heuristics, deployment under real-time constraints and closed-loop performance improvement. I also enjoy incorporating geometric information, diffusion-based generative priors and self-supervised methods into learned models via pre/post-training to improve performance without explicit re-training. I'm also interested in using VLMs/VLAs for robot manipulation, navigation and path-planning, focusing on systems that utilize spatial awareness in these models. I write efficient, CUDA-accelerated code for deploying on edge compute boards, and have worked across dockerized systems, distributed training and model profiling to identify bottlenecks.

e-mail / linkedin / github / resume / showcase

I have also been a part of:

1 Martian Way Industries

June 2023 - Aug 2023

Robotics Intern
report

• Formulated a Python GUI to simplify working with Microsoft AirSim command line interface tools.
• Designed a GRU architecture to detect cracks in concrete from drone's onboard video, with accuracy of 88%.
• Created a epsilon-greedy RL algorithm with stable baselines3 for a power-line surveying drone, improving fault-finder efficiency by 26%.
• Executed hardware-in-loop simulations using PX4 and ArduPilot on a onboard flight computer to detect compile-time bugs reliably.

News

Research Experience

I have also been a part of:

Projects

Education

Teaching Experience

Technical Skills