Machine Learning

This section is not an overview of machine learning (ML) techniques nor an introduction to the field of machine learning. Instead, it focuses on the use cases of ML in robotics and specific implementations where ML is applied to enhance robotic systems. From perception to decision-making, this section explores practical guides and real-world applications.

This section demonstrates how machine learning enhances robotic systems by enabling functionalities like object detection, natural language understanding, and decision-making. By focusing on specific implementations and integration techniques, it bridges the gap between theoretical ML concepts and practice.

Key Subsections and Highlights

Creating Custom Semantic Segmentation Data Explains how to prepare and annotate custom datasets for semantic segmentation tasks. Includes guidelines for outsourcing labeling tasks and examples using GIMP.
Introduction to Reinforcement Learning Covers reinforcement learning concepts and Bellman equations. Discusses methods like dynamic programming, Monte Carlo, and temporal difference learning, with an emphasis on robotic applications.
Generative Modeling High-level overview of major generative modeling families including VAEs, GANs, flow-based models, diffusion models, and autoregressive approaches.
Offline Reinforcement Learning Introduces offline RL, why fixed-dataset learning matters in robotics, and commonly used methods such as CQL, behavior regularization, and IQL.
Introduction to Diffusion Models and Diffusion Policy Comprehensive introduction to diffusion models and their application in robotics through diffusion policies. Covers ODE and SDE formulations, their practical implications, and how diffusion policies enable multi-modal action learning for complex robotic tasks.
GRPO for Diffusion Policies in Robotics Introduces Group Relative Policy Optimization (GRPO) and its application to diffusion policies using SDE formulation for stochastic sampling. Covers GRPO’s origins in LLMs, the mathematical framework, and practical implementation strategies for optimizing robot policies with reward-based learning.
Mediapipe: Real-Time ML for Robotics Introduces MediaPipe for live ML inference on various platforms, including Android, iOS, and IoT. Highlights body pose tracking, hand tracking, and object detection pipelines.
NLP for Robotics Explores how natural language processing (NLP) enables robots to understand and respond to human language. Includes an overview of transformer models and HuggingFace library usage.
Python Libraries for Reinforcement Learning A comparison of popular RL libraries like OpenAI’s Spinning Up, Stable Baselines, and RLlib. Provides guidance on choosing the right library based on scalability and ease of use.
YOLO Integration with ROS and GPU Acceleration Step-by-step tutorial for integrating the YOLO object detection framework with ROS. Includes GPU acceleration setup with CUDA for real-time performance.
Training Darknet on Custom Datasets Explains how to train YOLO (Darknet) models on custom datasets. Covers data preparation, configuration, and tips to improve detection performance.
YOLOv5 on NVIDIA Jetson Platforms Guides training YOLOv5 and deploying it on Jetson edge devices using TensorRT. Includes environment setup, ONNX export, and TensorRT integration with ROS.
Distributed Training With PyTorch A tutorial on scaling deep learning models across multiple GPUs and nodes using PyTorch’s DistributedDataParallel (DDP).
Imitation Learning With a Focus on Humanoids Covers the foundations of imitation learning and provides a practical guide for data collection and policy deployment on humanoid robots.
6D Pose Estimation with YOLO and ZED Practical implementation notes for fusing YOLO detections with ZED depth data for 6D pose and velocity tracking workflows.
Practical Guide to Model Quantization and TensorRT Optimization Techniques for reducing model footprint and accelerating inference with quantization strategies and TensorRT deployment.
Installing YOLO on ARM Architecture Devices Deployment-focused guide for getting YOLO running efficiently on ARM-based platforms.
Comprehensive Guide to Albumentations Overview of augmentation pipelines and practical transform choices for robust vision model training.
Kornia Technical Guide Technical overview of using Kornia for differentiable computer vision operations inside deep learning pipelines.
Integrating OLLAMA LLMs with Franka Arm Integration approach for connecting local LLM inference through OLLAMA to Franka robot control workflows.
Multi-task Learning: A Starter Guide Introductory guide to shared-representation multi-task model design and training considerations.
Understanding Kalman Filters and Visual Tracking Explains Kalman filtering fundamentals and their practical role in visual tracking pipelines.
Knowledge Distillation Practical Implementation Guide Applied walkthrough for transferring performance from larger teacher models into smaller deployable students.
Neural Network Optimization Using Model Pruning Practical pruning methods to reduce model complexity while preserving task performance.
Deep Learning Techniques for 3D Datasets Survey of methods and implementation patterns for deep learning over point clouds and other 3D data modalities.
Optical Flow: Classical to Deep Learning Implementation Comparative guide from traditional optical-flow methods to modern deep-learning implementations.

Key Subsections and Highlights

Resources