Blog

Beyond the Papers: Scaling Laws and Data Requirements in End-to-End Robotics
March 31, 2025
After implementing ACT, Diffusion Policy, and 3D Diffusion Policy for various manipulation tasks, I've discovered significant gaps between research papers and real-world performance. This blog examines the overfitting problems in current approaches and explores a critical question: Can scaling laws from language models apply to robotics?
Can we do the Stochastic Modeling in Visual Place Recognition?
February 1, 2025
Inspired by VAEs, I had this thought: "Hey, if uncertainty modeling works so well there, why not try it in other areas?" Since I'm pretty familiar with visual place recognition and noticed almost no one was doing this kind of stochastic modeling there, I thought it'd be a perfect playground for experimentation. This blog records my journey - from how I formulated the problem to what happened when I actually tried to make it work. Spoiler alert: things didn't go quite as planned, but the insights were fascinating!
Understanding the Dynamic Balance in Variational Autoencoders
January 18, 2025
This blog explores the fascinating antagonistic process in Variational Autoencoders (VAEs) between the encoder's predicted variance and decoder quality. We examine how the variance adapts throughout training, creating an automatic curriculum that balances reconstruction accuracy with latent space exploration. The post includes mathematical proofs and visualizations to illustrate this dynamic equilibrium.
Building Stable and Consistent Robot Control for Learning
December 23, 2024
This blog introduces a system for stable and consistent robot control that combines velocity control, delta pose representations, and null-space optimization. The framework ensures smooth motion generation and frame-independent movements, producing high-quality training data for advanced learning algorithms like Diffusion Policies and reinforcement learning models.