Cybersecurity is a two-way street where both vendors and researchers must act responsibly. At Alias we're committed to improving the robotics & cybersecurity industry response times to security bugs — and to spread the word, we disclose our research openly as it happens.
25+ publications across cybersecurity AI, robot security, agent heuristics, datasets and game-theoretic security — accepted at the world's top research and security venues. 30+ CVE IDs issued as a CVE Numbering Authority since 2020.
What is the best agent harness for cybersecurity? Benchmarking five heterogeneous scaffolds (CSI::Claude, ::Codex, ::Mistral, ::GCAI, ::CAI) on 33 Cybench challenges with a fixed model (alias2-mini), no single scaffold dominates — each contributes a unique solve the others miss. A blackboard multi-agent architecture that lets scaffolds exchange typed findings reaches 19/33 (57.6%), a 27% relative gain over the best individual scaffold (15/33, 45.5%) — 25% faster at comparable cost. Heterogeneity of scaffolds is the structural lever toward cybersecurity superintelligence.
A 14-month corpus of cybersecurity LLM trajectories collected through the open-source CAI framework: 230,935 session logs, 26 M user prompts from 16,768 source IPs across 123 countries, totalling 18.07 TB of durable storage. Hands-on by construction (36% offensive, 20% attacker-intent, 28% business, 4% defensive) — the largest described corpus of LLM-driven hacker trajectories. It exposes how operators routinely paste live credentials, hostnames and tokens into prompts they know are being logged, motivating the case for on-premise, operator-trust-boundary cybersecurity LLMs.
LLM-powered cyberattack agents are rapidly outperforming traditional benchmarks like Jeopardy-style CTFs and static cyber ranges, exposing the need for more adaptive evaluation environments. Dynamic Cyber Ranges—augmented with AI-driven defender agents—significantly reduce attack success (down to 0–55%) and maintain robustness as models evolve, with even smaller on-premise models proving highly effective at defense.
Has generative AI compromised robot cybersecurity? What historically required deep knowledge of ROS, ROS 2, and robotic system internals can now be automated by anyone using AI tools. Three case studies demonstrate vulnerabilities in consumer robots: an autonomous lawnmower, a powered exoskeleton, and a window-cleaning robot, with CAI discovering 38 vulnerabilities automatically.
Cybersecurity superintelligence -- artificial intelligence exceeding the best human capability in both speed and strategic reasoning -- represents the next frontier in security. This paper documents the emergence of such capability through three major contributions that have pioneered the field of AI Security.
Can we make Cybersecurity AI agents better by guiding them strategically with context-additions that resemble how "humans think" using Game Theory? Our algorithm, Generative Cut-the-Rope (G-CTR) implements this and reduces ambiguity, collapses the LLM’s search space, suppresses hallucinations, and keeps the model tightly anchored to the most strategically relevant parts of the problem.
Are Capture-the-Flag competitions obsolete? In 2025, Cybersecurity AI (CAI) systematically conquered some of the world's most prestigious hacking competitions, achieving Rank #1 at multiple events and consistently outperforming thousands of human teams. Across five major circuits have become a solved game for well-engineered AI agents.
Practical insights learned from participating in the Dragos OT CTF 2025 using the Cybersecurity AI (CAI) framework, highlighting strengths and limitations of AI agents when operating in real OT challenge environments and detailing actionable lessons for defensive and offensive workflows.
Existing benchmarks assess isolated skills rather than integrated performance. To address this limitation, we present the Cybersecurity AI Benchmark (CAIBench), a modular meta-benchmark framework that allows evaluating LLM models and agents across offensive and defensive cybersecurity domains, taking a step towards meaningfully measuring their labor-relevance.
Empirical evaluation of AI systems in cybersecurity Attack/Defense CTFs reveals defensive agents achieve 54.3% patching success versus 28.3% offensive initial access, though operational constraints eliminate this advantage, providing first controlled evidence challenging AI attacker superiority claims.
In cooperation with other researchers, this book stipulates the inclusion of security in robotics from the earliest design phases onward. We advocate for quantitative methods of security management, cover vulnerability scoring systems and account for the highly distributed nature of robots.
We show how simple attacks are feasible in OT and how an industrial cybersecurity solution is not capable of capturing the complexity of modern robot interactions. We extend one of such solutions with a robot-specific Endpoint Protection Platform (EPP) and successfully protect the robot from attacks.
Alias Robotics has been a CVE Numbering Authority (CNA) since February 2020 — a status shared with Microsoft, Google, Cisco and ~250 organizations worldwide, but unique to us for robots and robotic components. 30+ CVE IDs issued, two CISA ICS advisories co-authored, and a decade of robot-security research backing every line of CAI & alias. This is the field-validated security background that separates a cybersecurity AI lab from a cybersecurity AI app.
11 CVE IDs · CISA ICSA-21-280-02 · thousands of MiR100/200/250/500/1000 affected
CVE-2020-10269 — Hardcoded WiFi access-point credentialsCVE-2020-10270 — Default Control Dashboard credentialsCVE-2020-10271 — Unauthenticated ROS APIs CVSS 9.8CVE-2020-10272 — ROS computational graph exposed CVSS 8.8CVE-2020-10273 — No encryption on stored artifactsCVE-2020-10274/10275 — REST API default-credential bypassCVE-2020-10276 — Default SICK safety-PLC password (E-stop bypass)CVE-2020-10277 — No BIOS passwordCVE-2020-10280 — Wireless interface insecurity13 CVE IDs · CISA ICSA-21-315-02 · 6 DDS vendors · co-research w/ Trend Micro, ADLINK, TXOne
CVE-2021-38427 — RTI Connext stack-based buffer overflowCVE-2021-38429 — OCI OpenDDS network amplification DoSCVE-2021-38441 — CycloneDDS XML write-what-whereCVE-2021-38443 — CycloneDDS invalid-structure handlingCVE-2021-38445 — FastDDS PID_BUILTIN_ENDPOINT_QOS crashCVE-2021-38487 — RTI Connext network amplification80+ flaws filed in RVD · 76% rated High/Critical · Akerbeltz ransomware POC
CVE-2016-6210 — OpenSSH password DoS (UR CB 3.1, fw 3.10–3.13)Cooperative disclosure with BSI (Germany) & INCIBE (Spain)
First CVE batch · foundational ROS communication-graph flaws · RVD founding 2019
CVE-2019-19625CVE-2019-19626CVE-2019-1962738 vulnerabilities discovered by CAI in 7 hours · vs ~33 human-effective hours
Disclosure policy: 90-day responsible disclosure, inspired by Google Project Zero · RVD: the Robot Vulnerability Database, founded & sponsored by Alias Robotics · Reach our CNA: cve@aliasrobotics.com
Alias Robotics is a research-driven company. We are committed to advancing the state of the art in robot cybersecurity and we are proud to be part of the research community.
Key research initiatives, workshops, and focused actions that advance the field of robot cybersecurity through collaborative research, practical demonstrations, and research workshops and collaborations.