GPU-accelerated YOLO (v8/v11) with PyTorch and TensorRT backends. Multi-camera multiplexing, Prometheus metrics, Kubernetes-native deployment. Production inference at <20ms across 30+ camera streams.
From 2018 to 2026, RIOS developed and deployed cutting-edge AI and robotics across industries — e-commerce, manufacturing, food processing, and wood products. The IP is now available for licensing and as open source.
GPU-accelerated YOLO (v8/v11) with PyTorch and TensorRT backends. Multi-camera multiplexing, Prometheus metrics, Kubernetes-native deployment. Production inference at <20ms across 30+ camera streams.
C++/Python observer architecture with pluggable algorithm pipeline. Three-stage depth processing (plane segmentation → corner estimation → full veneer estimation). 14 observer instances in production, 20+ diagnostic failure codes.
SAM2 and EfficientSAM extended for real-time video — camera predictors, streaming inference, multi-object tracking, mask hole filling. 4× speed improvement to 25ms/frame. ~4,000 LOC on top of the foundation models.
Full training-to-deployment pipeline (Pix2Pose, ICCV 2019). Multi-camera batch inference with Detectron2, ClearML experiment tracking, BOP-format datasets, Docker reproducibility.
Domain-specific YOLO11-seg models — 10 object classes for wood products. mAP 0.778–0.923 across modal, amodal, and binary segmentation. Production defect detection and top-sheet segmentation against similar-colored backgrounds.
ROS / ROS2 drivers for LUCID Triton RGB (Arena SDK), Lucid Helios ToF depth, FLIR Spinnaker, Luxonis OAK-D. Hardware triggering, timestamp filtering, buffered subscriptions with stale-data detection. Deployed across 4 production workcells.
Real-time food defect classification on conveyor lines. MobileNetV3 / ResNet50 (6 architectures benchmarked). 3-class output with GradCAM debugging, 27ms inference, windowed majority voting. 50+ versioned datasets, deployed across 6 production lines.
YOLO detection wrapped as a first-class ROS node — published bounding boxes, batched multi-camera inference, parameterized model swap. Drives downstream tracking and grasp targeting.
PySOT single-object tracker wrapped for ROS. Bounding-box tracking with detection re-initialization for moving-conveyor scenarios.
Custom KAREL server and TCP/IP RMI client for direct FANUC controller access. Stream motion with spline trajectory generation. Joint and cartesian state streaming. Five-state robot status model (NOT_CONNECTED → CONNECTED → ENABLED → ACTIVE → FAULTED). Bypasses the standard teach-pendant workflow. Deployed across 8 FANUC R-2000iC / 220U robots.
Composable FSM architecture (20 states) for industrial automation. Base framework with launcher, commander, event synchronization. Domain-agnostic — works for any sequential automation process. Multi-robot coordination for pick / place / QC with PLC-synchronized fault recovery.
50Hz encoder-driven tracking with TF frame publishing. Five encoder strategies (VirtualCSI, Float, IO, PLC, IOGate). Enables picking from and placing onto a continuously moving conveyor without line stops. Validated at 8.5–10 BPM, >100 ft/min.
Allen-Bradley PLC communication via EtherNet/IP (libplctag). Bidirectional tag read/write published as ROS topics. YAML-driven configuration. CLI utility already published on PyPI.
Integrated robotic grasping combining three published approaches — GR-ConvNet (IROS 2020), MVP Grasp, and Visual Pushing-Grasping. RealSense depth, 4 model variants, full training pipeline with ROS integration.
Piezoelectric tactile sensor (XTS-1) — 14×37 taxel array at 3–17kHz, picoCoulomb force measurement, 10 thermistors, 13 strain gauges. Three-layer driver stack with PIMPL pattern. TactileNet (dual-finger 3D CNN) classifies grasp events from 100ms windows. Integrated with Robotiq 2-finger gripper and custom Cypress FX3 + PSoC + FPGA firmware.
Generic motor driver abstractions, feed-hopper control logic, and the interface message packages shared across the dx_* stack — motor commands and status, hopper coordination, common message definitions.
FANUC robot URDF descriptions (LR Mate 200iD/7L, R-1000iA/130F, R-2000iC/220U), MoveIt ikfast kinematics plugins, geometry primitives shared across the dx_* stack. URDFs are independently available from FANUC and ROS-Industrial; this is the integrated RIOS bundle as deployed in production.
Production v2.0.0 — MCP and FastAPI server for YOLOE object detection. Six prompt modes (prompt-free, text, visual cross-image). Async GPU inference, web playground, Prometheus / Grafana monitoring. Powers interactive video analysis for customer evaluations.
Multi-LLM agent studio (OpenAI, Anthropic, Ollama) and interactive video-analysis app for industrial automation workflows. Upload video, draw monitoring zones on an interactive canvas, analyze content through AI chat. Object detection runs through MCP with six prompt modes — prompt-free, text, and visual cross-image reference — at real-time WebSocket frame rates. TypeScript / Next.js. Composes with the Omni-Vision backend so agents can call vision tools to drive inspection, analysis, and operator-facing reasoning on the factory floor.
Containerized edge agent for industrial deployments — camera management, GPU inference (YOLO via Ultralytics / TensorRT), ROS2, MQTT, live streaming, telemetry. Two internal frameworks underneath: a worker-orchestration engine (15+ YAML-configured worker types wired into a channel-based pipeline) and a high-performance C++ data/inference engine. Runs on industrial-rugged NVIDIA Jetson Orin hardware (CERA) — ≥30 fps capture, up to a week of onboard offline storage, NVIDIA DeepStream / GStreamer real-time inference, Allen-Bradley / EtherNet-IP PLC integration, LTE fallback. Deployed under Helm-managed clusters across 25+ edge sites.
Visual builder where customers define regions of interest (box, polygon, line), set ML-prediction-based event triggers, and generate KPI reports — turning raw model predictions into structured semantic events without writing code, then deploying that logic straight to the edge. Output to PLC, ROS2, or HTTP. Customer-configured monitoring across wood products, metals, and construction materials.
Multi-tenant web and mobile platform organized as Company → Sites → Workcell Groups → Clusters → Camera Feeds, with role-based access, deployment history, drift indicators, configuration management, and reporting. Underneath: Helm-based GitOps with Rancher Fleet, a Kubernetes operator for edge CRDs, HashiCorp Vault for secrets, and RClone for data sync — keeping every cluster in sync with cloud-managed configuration across geographically distributed customer sites.
Domain-agnostic developer tooling — project bootstrapping, app scaffolding, camera test harness, configuration management, diagnostics, code generation, and MCP integration for AI-assisted workflows. Seven repos forming a coherent dev environment.
Low-level glue for the edge stack — pinned CUDA installer scripts, ROS-with-GUI Docker images, Kubernetes NodePort controller for cluster-internal services, file sync for offline-tolerant deployments, and udev rule generation for industrial USB devices.
Shared foundation libraries underneath the dx_* ROS1 stack — Python bindings (pybind11), YAML configuration utilities, ROS communication primitives, event-handling interface, and a rosbridge utility layer. Closure-clean: the rest of the foundation builds without any held package.
End-to-end pipeline from SAM3 foundation-model auto-annotation → pseudo-labeling → YOLO distillation to edge models, with roughly 100× speedup from teacher to edge student. A self-improving annotation loop (auto-annotate with fine-tuned checkpoints → human review/correction → fine-tune → iterate) runs as a weekly 7-step automated process: validate → annotate → curate → export → train → evaluate → report. Production YOLO11-seg models reach mAP 0.778 (modal), 0.651 (amodal), and 0.923 (2-class) across a 10-class wood/defect ontology, with versioned edge deployment and integrity verification. 50+ versioned datasets form a compounding data flywheel behind every production model.
Automated annotation pipeline for manufacturing video at TB scale. C++ frame extraction at 500+ fps across 64 cores, semantic deduplication, 246-dimension feature engineering, and GPU-accelerated interactive learning via a web UI for tool-assisted labeling (click / rect / polygon with live retraining). Intelligent stratified sampling compresses millions of frames at 1000:1, then generates YOLO training sets with teacher/student distillation.
Full-stack web app for reviewing ML-generated annotations. Accept / reject workflows, lasso selection, keypoint annotation, COCO export. Closes the human-in-the-loop for auto-annotation pipelines.
Model registry with versioned deployment to edge devices. S3 distribution with SHA256 integrity verification. TensorRT optimization for Jetson / CUDA targets. Continuous model updates to 25+ deployed edge clusters.
Production data capture and curation — multi-topic rosbag recording, metadata extraction and indexing, configurable data collector for triggered captures, and shared utilities for parsing and replaying production traces.
Automated benchmarking harness for object detection models against Roboflow-hosted datasets. Reproducible eval reports, drift tracking across model versions, quality-metric interface messages for downstream consumption.
Programmatic generator for OCR training corpora — printed text on procedurally varied backgrounds, distortion / blur / noise pipelines, multilingual dictionaries. Fork of Belval/TRDG (MIT) with RIOS extensions.
163 systematic experiments, 9 specialized detectors for robotic pick-and-place failures (double picks, partial picks, misalignment). Zero false positives on normal operations, 100% detection on confirmed failures. Interactive threshold tuning UI (SvelteKit + FastAPI). 97.7% false-positive reduction via state-conditional anomaly detection.
PLC event → synchronized multi-camera video (UniFi Protect NVR integration) for root-cause analysis. Structured failure categories, operator intervention tracking, cross-cluster pod management, Prometheus metrics, production database logging. Central to achieving 80% line availability.
Real-time HMI (Rails 7) for 8-robot operations — safety status, workcell control (start / stop / namespace cycling / go-home), recipe management (3PLY, 4PLY, 5PLY, Double 3PLY), diagnostics. WebSocket to ROS via rosbridge_suite. Primary operator interface for 4-shift, 24/7 production.
K3s / Helm deployment across 4 workcells. Earthly hermetic builds, multi-arch Docker images, air-gapped OCI registry with pull-through cache, Tailscale remote access. 40+ PRs merged during live production with same-day deploys.
Camera FOV drift detection via multi-algorithm keypoint tracking, image quality assessment, critical-zone monitoring, ML-based template matching for baseline identification. Proactive issue detection across 25+ deployed edge devices.
Operator-facing infrastructure — rosbridge load tester for HMI capacity planning, Slab → Notion migration tool that moved the entire engineering knowledge base, Docusaurus templates for product documentation sites.
ROS rate diagnostics for monitoring topic publication health across the dx_* stack, plus a ground-truth testing harness for production validation. Foundation tooling for the operations team running 24/7 production.
Neural Surface Refinement for transparent and reflective objects (CVPR 2024). Novel approach to a notoriously difficult 3D reconstruction problem. Foundation for industrial inspection of transparent surfaces.
Novel architecture for 3D object pose estimation from images. Advances state-of-the-art for robotic manipulation planning.
Novel ray-based deformation approach for 3D scene understanding (WACV 2024). Extended 3D perception capabilities for industrial environments.
Integrated implementation of GR-ConvNet (IROS 2020), MVP Grasp, and Visual Pushing-Grasping — combined into a single production-ready system. Research-to-production pipeline for grasping across object types.
PyTorch implementation of dense correspondence learning (DC2) wrapped for ROS — pixel-level descriptors for category-level manipulation. Foundation work toward generalizable robotic grasping.
Filing an issue on the relevant repository is the fastest path. For anything else open-source related — questions, contributions, mirroring:
info@rios.aiLicensing, acquisition, research partnerships, data access — everything that isn't open source routes through here.
licensing@rios.ai