RIOS Intelligent Machines

AI Perception & Computer Vision

Real-Time GPU Inference Pipeline

Licensable→ Validated

GPU-accelerated YOLO (v8/v11) with PyTorch and TensorRT backends. Multi-camera multiplexing, Prometheus metrics, Kubernetes-native deployment. Production inference at <20ms across 30+ camera streams.

Pluggable Perception Framework

Licensable→ Production

C++/Python observer architecture with pluggable algorithm pipeline. Three-stage depth processing (plane segmentation → corner estimation → full veneer estimation). 14 observer instances in production, 20+ diagnostic failure codes.

Real-Time Segmentation Suite

Licensable→ Research

SAM2 and EfficientSAM extended for real-time video — camera predictors, streaming inference, multi-object tracking, mask hole filling. 4× speed improvement to 25ms/frame. ~4,000 LOC on top of the foundation models.

6DoF Pose Estimation Pipeline

Open source↗ Production

Full training-to-deployment pipeline (Pix2Pose, ICCV 2019). Multi-camera batch inference with Detectron2, ClearML experiment tracking, BOP-format datasets, Docker reproducibility.

→pix2pose_train · pix2pose_ros · pix2pose_ros_msgs · pix2pose-clearml

Wood & Defect Ontology Models

Licensable→ Production

Domain-specific YOLO11-seg models — 10 object classes for wood products. mAP 0.778–0.923 across modal, amodal, and binary segmentation. Production defect detection and top-sheet segmentation against similar-colored backgrounds.

Industrial Camera Drivers

Open source↗ Production

ROS / ROS2 drivers for LUCID Triton RGB (Arena SDK), Lucid Helios ToF depth, FLIR Spinnaker, Luxonis OAK-D. Hardware triggering, timestamp filtering, buffered subscriptions with stale-data detection. Deployed across 4 production workcells.

→arena_camera_ros · arena_camera_ros2 · rios-depthai-ros · spinnaker_camera_ros2 · camera_control_msgs · dx_camera · dx_depth_camera · dx_calibration

Food Quality Classification

Licensable→ Production

Real-time food defect classification on conveyor lines. MobileNetV3 / ResNet50 (6 architectures benchmarked). 3-class output with GradCAM debugging, 27ms inference, windowed majority voting. 50+ versioned datasets, deployed across 6 production lines.

YOLO ROS Deployment

Licensable→ Production

YOLO detection wrapped as a first-class ROS node — published bounding boxes, batched multi-camera inference, parameterized model swap. Drives downstream tracking and grasp targeting.

Visual Object Tracking

Open source↗ Research

PySOT single-object tracker wrapped for ROS. Bounding-box tracking with detection re-initialization for moving-conveyor scenarios.

→pysot_ros

Robot Control & Motion

FANUC Direct Control Integration

Licensable→ Production

Custom KAREL server and TCP/IP RMI client for direct FANUC controller access. Stream motion with spline trajectory generation. Joint and cartesian state streaming. Five-state robot status model (NOT_CONNECTED → CONNECTED → ENABLED → ACTIVE → FAULTED). Bypasses the standard teach-pendant workflow. Deployed across 8 FANUC R-2000iC / 220U robots.

Application State Machine Framework

Licensable→ Production

Composable FSM architecture (20 states) for industrial automation. Base framework with launcher, commander, event synchronization. Domain-agnostic — works for any sequential automation process. Multi-robot coordination for pick / place / QC with PLC-synchronized fault recovery.

Real-Time Conveyor Tracking

Licensable→ Production

50Hz encoder-driven tracking with TF frame publishing. Five encoder strategies (VirtualCSI, Float, IO, PLC, IOGate). Enables picking from and placing onto a continuously moving conveyor without line stops. Validated at 8.5–10 BPM, >100 ft/min.

PLC Integration Layer

Open source↗ Production

Allen-Bradley PLC communication via EtherNet/IP (libplctag). Bidirectional tag read/write published as ROS topics. YAML-driven configuration. CLI utility already published on PyPI.

→dx_plc · dx_io

Multi-Paper Grasping System

Open source↗ Validated

Integrated robotic grasping combining three published approaches — GR-ConvNet (IROS 2020), MVP Grasp, and Visual Pushing-Grasping. RealSense depth, 4 model variants, full training pipeline with ROS integration.

→agnet

Tactile Sensing & Grasp Intelligence

Licensable→ Validated

Piezoelectric tactile sensor (XTS-1) — 14×37 taxel array at 3–17kHz, picoCoulomb force measurement, 10 thermistors, 13 strain gauges. Three-layer driver stack with PIMPL pattern. TactileNet (dual-finger 3D CNN) classifies grasp events from 100ms windows. Integrated with Robotiq 2-finger gripper and custom Cypress FX3 + PSoC + FPGA firmware.

Motor & Hopper Control

Open source↗ Production

Generic motor driver abstractions, feed-hopper control logic, and the interface message packages shared across the dx_* stack — motor commands and status, hopper coordination, common message definitions.

→dx_motor · dx_motor_driver_msgs · dx_msgs · dx_behavior_msgs · hopper_control_msgs

Robot Geometry & Kinematics

Open source↗ Production

FANUC robot URDF descriptions (LR Mate 200iD/7L, R-1000iA/130F, R-2000iC/220U), MoveIt ikfast kinematics plugins, geometry primitives shared across the dx_* stack. URDFs are independently available from FANUC and ROS-Industrial; this is the integrated RIOS bundle as deployed in production.

→dx_geometry

Edge AI Platform

Omni-Vision — MCP Inference Backend

Open source↗ Validated

Production v2.0.0 — MCP and FastAPI server for YOLOE object detection. Six prompt modes (prompt-free, text, visual cross-image). Async GPU inference, web playground, Prometheus / Grafana monitoring. Powers interactive video analysis for customer evaluations.

→omni-vision-backend

Omni-Agent Studio

Open source↗ Validated

Multi-LLM agent studio (OpenAI, Anthropic, Ollama) and interactive video-analysis app for industrial automation workflows. Upload video, draw monitoring zones on an interactive canvas, analyze content through AI chat. Object detection runs through MCP with six prompt modes — prompt-free, text, and visual cross-image reference — at real-time WebSocket frame rates. TypeScript / Next.js. Composes with the Omni-Vision backend so agents can call vision tools to drive inspection, analysis, and operator-facing reasoning on the factory floor.

→omni-agent

Edge AI Agent Core

Licensable→ Production

Containerized edge agent for industrial deployments — camera management, GPU inference (YOLO via Ultralytics / TensorRT), ROS2, MQTT, live streaming, telemetry. Two internal frameworks underneath: a worker-orchestration engine (15+ YAML-configured worker types wired into a channel-based pipeline) and a high-performance C++ data/inference engine. Runs on industrial-rugged NVIDIA Jetson Orin hardware (CERA) — ≥30 fps capture, up to a week of onboard offline storage, NVIDIA DeepStream / GStreamer real-time inference, Allen-Bradley / EtherNet-IP PLC integration, LTE fallback. Deployed under Helm-managed clusters across 25+ edge sites.

Sense & Respond — No-Code Event Logic

Licensable→ Validated

Visual builder where customers define regions of interest (box, polygon, line), set ML-prediction-based event triggers, and generate KPI reports — turning raw model predictions into structured semantic events without writing code, then deploying that logic straight to the edge. Output to PLC, ROS2, or HTTP. Customer-configured monitoring across wood products, metals, and construction materials.

Cloud Platform & Operations

Licensable→ Production

Multi-tenant web and mobile platform organized as Company → Sites → Workcell Groups → Clusters → Camera Feeds, with role-based access, deployment history, drift indicators, configuration management, and reporting. Underneath: Helm-based GitOps with Rancher Fleet, a Kubernetes operator for edge CRDs, HashiCorp Vault for secrets, and RClone for data sync — keeping every cluster in sync with cloud-managed configuration across geographically distributed customer sites.

Teta — Dev Tooling Ecosystem

Open source↗ Production

Domain-agnostic developer tooling — project bootstrapping, app scaffolding, camera test harness, configuration management, diagnostics, code generation, and MCP integration for AI-assisted workflows. Seven repos forming a coherent dev environment.

→teta · teta-app · teta-camera · teta-config · teta-diag · teta-generate · teta-mcp

Edge Runtime Infrastructure

Open source↗ Production

Low-level glue for the edge stack — pinned CUDA installer scripts, ROS-with-GUI Docker images, Kubernetes NodePort controller for cluster-internal services, file sync for offline-tolerant deployments, and udev rule generation for industrial USB devices.

→cuda_installers · ros-docker-gui · rios-nodeport-controller · rios-file-sync · udevcreator

Citadel Foundation Libraries

Open source↗ Production

Shared foundation libraries underneath the dx_* ROS1 stack — Python bindings (pybind11), YAML configuration utilities, ROS communication primitives, event-handling interface, and a rosbridge utility layer. Closure-clean: the rest of the foundation builds without any held package.

→dx_rios_utils · dx_rios_pybind · dx_rios_yaml · dx_rios_communication · dx_events · rios-ros-utilities

ML Pipeline & Data Infrastructure

Foundation-to-Edge Auto-Annotation (FAST)

Licensable→ Validated

End-to-end pipeline from SAM3 foundation-model auto-annotation → pseudo-labeling → YOLO distillation to edge models, with roughly 100× speedup from teacher to edge student. A self-improving annotation loop (auto-annotate with fine-tuned checkpoints → human review/correction → fine-tune → iterate) runs as a weekly 7-step automated process: validate → annotate → curate → export → train → evaluate → report. Production YOLO11-seg models reach mAP 0.778 (modal), 0.651 (amodal), and 0.923 (2-class) across a 10-class wood/defect ontology, with versioned edge deployment and integrity verification. 50+ versioned datasets form a compounding data flywheel behind every production model.

Foreman — Large-Scale Video Auto-Labeling

Licensable→ Research

Automated annotation pipeline for manufacturing video at TB scale. C++ frame extraction at 500+ fps across 64 cores, semantic deduplication, 246-dimension feature engineering, and GPU-accelerated interactive learning via a web UI for tool-assisted labeling (click / rect / polygon with live retraining). Intelligent stratified sampling compresses millions of frames at 1000:1, then generates YOLO training sets with teacher/student distillation.

Annotation Review & Curation (GRADE)

Licensable→ Research

Full-stack web app for reviewing ML-generated annotations. Accept / reject workflows, lasso selection, keypoint annotation, COCO export. Closes the human-in-the-loop for auto-annotation pipelines.

Edge Model Deployment

Licensable→ Research

Model registry with versioned deployment to edge devices. S3 distribution with SHA256 integrity verification. TensorRT optimization for Jetson / CUDA targets. Continuous model updates to 25+ deployed edge clusters.

ROS Bag & Data Pipeline

Open source↗ Production

Production data capture and curation — multi-topic rosbag recording, metadata extraction and indexing, configurable data collector for triggered captures, and shared utilities for parsing and replaying production traces.

→rios_rosbag · rosbag_metadata · dx_data_collector · rios_data_utils

Model Benchmarking Harness

Open source↗ Validated

Automated benchmarking harness for object detection models against Roboflow-hosted datasets. Reproducible eval reports, drift tracking across model versions, quality-metric interface messages for downstream consumption.

→rios-roboflow-benchmarking

Synthetic OCR Training Data

Open source↗ Research

Programmatic generator for OCR training corpora — printed text on procedurally varied backgrounds, distortion / blur / noise pipelines, multilingual dictionaries. Fork of Belval/TRDG (MIT) with RIOS extensions.

→TextRecognitionDataGenerator

Production Operations

Operational Failure Detection Suite

Licensable→ Validated

163 systematic experiments, 9 specialized detectors for robotic pick-and-place failures (double picks, partial picks, misalignment). Zero false positives on normal operations, 100% detection on confirmed failures. Interactive threshold tuning UI (SvelteKit + FastAPI). 97.7% false-positive reduction via state-conditional anomaly detection.

Operational Intelligence Platform

Licensable→ Production

PLC event → synchronized multi-camera video (UniFi Protect NVR integration) for root-cause analysis. Structured failure categories, operator intervention tracking, cross-cluster pod management, Prometheus metrics, production database logging. Central to achieving 80% line availability.

Multi-Robot Operator Interface

Licensable→ Production

Real-time HMI (Rails 7) for 8-robot operations — safety status, workcell control (start / stop / namespace cycling / go-home), recipe management (3PLY, 4PLY, 5PLY, Double 3PLY), diagnostics. WebSocket to ROS via rosbridge_suite. Primary operator interface for 4-shift, 24/7 production.

Factory-Floor Kubernetes

Open source↗ Production

K3s / Helm deployment across 4 workcells. Earthly hermetic builds, multi-arch Docker images, air-gapped OCI registry with pull-through cache, Tailscale remote access. 40+ PRs merged during live production with same-day deploys.

→rios_github_actions · rios-github-workflow-templates · ros-focal2noble

Edge Health Monitoring

Licensable→ Production

Camera FOV drift detection via multi-algorithm keypoint tracking, image quality assessment, critical-zone monitoring, ML-based template matching for baseline identification. Proactive issue detection across 25+ deployed edge devices.

Diagnostic & Doc Tooling

Open source↗ Production

Operator-facing infrastructure — rosbridge load tester for HMI capacity planning, Slab → Notion migration tool that moved the entire engineering knowledge base, Docusaurus templates for product documentation sites.

→rosbridge-loadtester · slab_to_notion_importer · docusaurus-templates

Foundation Diagnostics & Ground-Truth

Open source↗ Production

ROS rate diagnostics for monitoring topic publication health across the dx_* stack, plus a ground-truth testing harness for production validation. Foundation tooling for the operations team running 24/7 production.

→dx_rate_diagnostics · dx_ground_truth_testing

Published Research

Transparent Object Reconstruction (TNSR)

Open source↗ Research

Neural Surface Refinement for transparent and reflective objects (CVPR 2024). Novel approach to a notoriously difficult 3D reconstruction problem. Foundation for industrial inspection of transparent surfaces.

→TNSR · anu_nerf · tnsr-pages

3D Pose Estimation (Pose3R)

Open source↗ Research

Novel architecture for 3D object pose estimation from images. Advances state-of-the-art for robotic manipulation planning.

→Pose3R

Ray Deformation Networks

Open source↗ Research

Novel ray-based deformation approach for 3D scene understanding (WACV 2024). Extended 3D perception capabilities for industrial environments.

→raydeform-pages

Robotic Grasping (3 papers)

Open source↗ Validated

Integrated implementation of GR-ConvNet (IROS 2020), MVP Grasp, and Visual Pushing-Grasping — combined into a single production-ready system. Research-to-production pipeline for grasping across object types.

→agnet

Dense Correspondence Networks

Open source↗ Research

PyTorch implementation of dense correspondence learning (DC2) wrapped for ROS — pixel-level descriptors for category-level manipulation. Foundation work toward generalizable robotic grasping.

→pytorch_dc2_ros

RIOS 2018–2026

What was built.

AI Perception & Computer Vision

Real-Time GPU Inference Pipeline

Pluggable Perception Framework

Real-Time Segmentation Suite

6DoF Pose Estimation Pipeline

Wood & Defect Ontology Models

Industrial Camera Drivers

Food Quality Classification

YOLO ROS Deployment

Visual Object Tracking

Robot Control & Motion

FANUC Direct Control Integration

Application State Machine Framework

Real-Time Conveyor Tracking

PLC Integration Layer

Multi-Paper Grasping System

Tactile Sensing & Grasp Intelligence

Motor & Hopper Control

Robot Geometry & Kinematics

Edge AI Platform

Omni-Vision — MCP Inference Backend

Omni-Agent Studio

Edge AI Agent Core

Sense & Respond — No-Code Event Logic

Cloud Platform & Operations

Teta — Dev Tooling Ecosystem

Edge Runtime Infrastructure

Citadel Foundation Libraries

ML Pipeline & Data Infrastructure

Foundation-to-Edge Auto-Annotation (FAST)

Foreman — Large-Scale Video Auto-Labeling

Annotation Review & Curation (GRADE)

Edge Model Deployment

ROS Bag & Data Pipeline

Model Benchmarking Harness

Synthetic OCR Training Data

Production Operations

Operational Failure Detection Suite

Operational Intelligence Platform

Multi-Robot Operator Interface

Factory-Floor Kubernetes

Edge Health Monitoring

Diagnostic & Doc Tooling

Foundation Diagnostics & Ground-Truth

Published Research

Transparent Object Reconstruction (TNSR)

3D Pose Estimation (Pose3R)

Ray Deformation Networks

Robotic Grasping (3 papers)

Dense Correspondence Networks

Inquiries.

Open source

General inquiries

RIOS
2018–2026