Learning-based control
Safe navigation and continuous control for mobile robots, trained end-to-end with deep reinforcement learning.
Working at the intersection of deep reinforcement learning, computer vision, robotics, and production AI — from safe mobile-robot navigation to real-time perception infrastructure.
My work spans the full loop of an intelligent machine — sensing the world, learning a policy, and shipping it reliably. These threads reinforce each other.
Safe navigation and continuous control for mobile robots, trained end-to-end with deep reinforcement learning.
Object & anomaly detection, OCR, lane and 3D scene understanding — from point clouds to industrial imaging.
Production-grade ML: GPU inference, backend services, containers and CI/CD that move research into the real world.
Peer-reviewable contributions in robot navigation and efficient perception. Full text on arXiv.
This study trains a deep reinforcement learning agent to enable a wheeled mobile robot to navigate autonomously and collision-free toward a target in complex, obstacle-rich environments. Using only raw LiDAR observations and the goal pose, a deep neural network maps perception directly to continuous velocity commands. The work compares DDPG and PPO and introduces an enhanced PPO network architecture together with a tailored reward function, validated in both obstacle and obstacle-free Gazebo environments.
This study develops an optimized, lightweight CNN for lane detection that combines binary segmentation with Affinity Fields. Lane instances are recovered by clustering the segmentation and affinity outputs, which lets the method handle a varying number of lanes and lane-change scenarios without a fixed lane-count assumption. Using a less complex network than existing approaches, the method is demonstrated on the TuSimple dataset.
Research, perception, and systems work — built, trained, and shipped.
Mapless mobile-robot navigation: deep-RL motion planners — PPO, DDPG, SAC and TD3 — that drive a TurtleBot to a goal from sparse LiDAR and the target pose, with no SLAM and no map.
From-scratch reimplementations of PointNet, PointNet++, DGCNN and PointMLP for point-cloud classification and part segmentation.
An end-to-end 3D object-detection pipeline on RGB-D data, comparing a from-scratch model against a fine-tuned EfficientNet-B3 backbone under a small-data regime.
A Vision-Transformer autoencoder for industrial inspection: reconstruct the image, then localize defects from patch-wise reconstruction error.
A real-time object-detection pipeline built with YOLOv3 and non-max suppression on the COCO dataset.
A from-scratch ray tracer in C (miniLibX): ray–object intersections, Phong shading, and an .rt scene parser producing photorealistic renders.