What do these badges mean?
- 🚀ShippingCode exists. Multiple GitHub repos already reference this paper — people are building on it.
- 📈ClimbingCitation velocity is rising. Researchers are starting to pick it up.
- 💤QuietPublished but no notable signal yet. Most papers live here — could become anything later.
- 🎭HypeHeavy social buzz but no shipping signal. The counter-signal — defer until Twitter/X data is wired up.
- 2605.18727·May 18, 2026·~11 mincs.ROcs.AI
DexHoldem: Playing Texas Hold'em with Dexterous Embodied System
Feng Chen, Tianzhe Chu, Li Sun, Pei Zhou, +5
ELI5A robot hand learns to play Texas Hold'em by manipulating cards and chips on a table. The benchmark tests whether it can see what's happening, decide what move to make, physically execute it with a dexterous hand, and keep the game playable for the next move.
Problem solvedReal-world robot systems need evaluation beyond single skills—they must perceive dynamic scenes, make context-aware decisions, execute complex hand movements, and maintain usable environments. Existing benchmarks don't test this full closed-loop capability, leaving gaps between what robots can do in isolation and what they can do in practice.
- 2605.18704·May 18, 2026·~10 mineess.SPcs.LG
Learned Memory Attenuation in Sage-Husa Kalman Filters for Robust UAV State Estimation
Kenan Majewski, Marcin Żugaj
ELI5Instead of a fixed knob that trades off speed vs. stability, this system learns a smart adaptive knob for Kalman filters that can handle noisy sensors and dropouts on drones by watching patterns in sensor errors.
Problem solvedDrones lose GPS, vibrate, and face changing noise patterns—standard Kalman filters assume fixed noise that doesn't change. A single forgetting factor can't simultaneously handle sudden sensor glitches and long-term drift without breaking.
- 2605.18617·May 18, 2026·~12 mincs.ROcs.AIcs.CV
ManiSoft: Towards Vision-Language Manipulation for Soft Continuum Robotics
Ziyu Wei, Luting Wang, Chen Gao, Li Wen, +1
ELI5A new benchmark and simulator that teaches robots with bendy, flexible arms to manipulate objects using vision and language instructions. Unlike rigid robot arms, soft arms can squeeze through tight spaces, but they're harder to control because you can't tell exactly where they are—this work builds tools to train and test policies that work with these floppy arms.
Problem solvedMost robot manipulation research focuses on stiff arms with precise sensors; soft arms are more adaptable for cluttered spaces but lack reliable position feedback and have complex distributed control. This benchmark and simulator let researchers develop and test manipulation policies specifically designed for deformable robots, filling a gap in the field.
- 2605.18593·May 18, 2026·~12 mincs.CRcs.AIcs.RO
Not What You Asked For: Typographic Attacks in Household Robot Manipulation
Ali Iranmanesh, Peng Liu
ELI5Researchers show that stickers with printed text can trick household robots into grabbing the wrong objects. By exploiting how vision systems (like CLIP) read both images and text, adversarial stickers cause robots to physically manipulate incorrect items—a real safety problem that's never been tested in actual robot tasks before.
Problem solvedHousehold robots using vision-language models are vulnerable to cheap, easy-to-deploy adversarial attacks that cause them to perform wrong physical actions. Prior work only tested these attacks in simulations or navigation; this exposes the threat in actual manipulation tasks where safety failures have real consequences.
- 2605.18556·May 18, 2026·~12 mincs.ROcs.AI
Key-Gram: Extensible World Knowledge for Embodied Manipulation
Jingjing Fan, Siyuan Li, Botao Ren, Zhidong Deng
ELI5A robot learns to follow instructions better by storing word meanings in a separate memory bank instead of mixing language understanding with visual perception. Think of it like giving a robot a dictionary it can look up while watching what's in front of it, rather than trying to understand words and see at the same time.
Problem solvedRobot manipulation policies struggle when instructions and visual understanding compete for the same computational resources, making it hard to reuse knowledge across tasks or add new instructions without retraining. This separates language knowledge so robots can reason about what they see while quickly pulling up relevant instruction meanings.
- 2605.18553·May 18, 2026·~14 mincs.CVcs.AI
StableHand: Quality-Aware Flow Matching for World-Space Dual-Hand Motion Estimation from Egocentric Video
Huajian Zeng, Chaohua Yao, Yuantai Zhang, Jiaqi Yang, +2
ELI5A system that reconstructs what both your hands are doing in 3D space from a head-mounted camera, even when hands leave the frame or are hidden behind objects. It learns which frames have reliable hand tracking and uses that confidence to fill in the gaps.
Problem solvedRobot learning from human demonstrations needs accurate hand positions and poses, but egocentric video loses hands constantly (turning your head, picking things up). Existing methods treat all frames equally, causing errors that break robot imitation learning.
- 2605.18460·May 18, 2026·~6 mincs.AIcs.LGcs.NE
When Fireflies Cluster; Enhancing Automatic Clustering via Centroid-Guided Firefly Optimization
MKA Ariyaratne, Azwirman Gusrialdi, Yury Nikulin, Jaakko Peltonen
ELI5A new clustering algorithm inspired by how fireflies group together automatically finds the right number of clusters and their shapes without needing humans to guess first, solving problems where K-Means fails on oddly-shaped or unevenly-dense data.
Problem solvedK-Means requires you to specify cluster count beforehand and struggles with non-uniform shapes and densities. This method automatically determines optimal clusters and handles complex spatial patterns, making sensor networks and spatial data easier to organize without manual tuning.
- 2605.18407·May 18, 2026·~10 mincond-mat.mes-hallcond-mat.mtrl-scics.AI
Qumus: Realization of An Embodied AI Quantum Material Experimentalist
Lihan Shi, Zhaoyi Joy Zheng, Xinzhe Juan, Yimin Wang, +13
ELI5A robot lab that uses AI to independently run physics experiments—it generates ideas, plans procedures, executes them with robotic hands, analyzes results, and learns from mistakes, successfully creating graphene and nanoscale devices on its own.
Problem solvedScientists spend months on repetitive materials experiments with slow feedback loops. This system compresses that cycle by having an AI agent autonomously execute, monitor, and refine experiments in real time without human intervention between steps.
- 2605.18385·May 18, 2026·~9 mincs.ROcs.AI
Towards Ubiquitous Mapping and Localization for Dynamic Indoor Environments
Halim Djerroud, Nico Steyn, Olivier Rabreau, Patrick Bonnin, +1
ELI5Instead of robots figuring out where they are using their own cameras, you put cameras on the walls throughout a building that create and maintain a shared map for all robots to use. This offloads the hard work from the robots themselves.
Problem solvedRobots typically struggle with mapping and localization in changing indoor spaces and need expensive onboard sensors. Fixed ceiling/wall cameras can provide reliable global positioning for multiple robots cheaply, letting simpler robots work effectively in shared spaces.
- 2605.18373·May 18, 2026·~12 mincs.ROcs.LGmath.DS
Dynamic robotic cloth folding with efficient Koopman operator-based model predictive control
Edoardo Caldarelli, Franco Coltraro, Adrià Colomé, Lorenzo Rosasco, +1
ELI5A robot learns to fold cloth quickly by using a math trick that converts the messy, nonlinear physics of cloth into a simpler linear model, then uses this simplified model to plan fast folding motions in real time.
Problem solvedRobots struggle to fold cloth quickly and accurately because cloth dynamics are complex and hard to simulate. This approach makes planning 10–100x faster by swapping the expensive physics simulator for a learned linear model during motion planning.
- 2605.18303·May 18, 2026·~10 mincs.LGcs.AIcs.CV
PH-Dreamer: A Physics-Driven World Model via Port-Hamiltonian Generative Dynamics
Xueyu Luan, Chenwei Shi
ELI5Instead of learning physics from scratch, this method teaches a world model to follow physics principles like energy conservation. It imagines future states more accurately by embedding real physical laws into its internal reasoning, like adding guardrails that keep simulations realistic.
Problem solvedAI agents learning through simulation often produce jerky, inefficient movements and unrealistic imagined futures because they don't respect physics principles. This causes poor real-world transfer and wasted energy. The method fixes this by baking in conservation laws, making agents plan smoother, more realistic behavior.
- 2605.18197·May 18, 2026·~12 mincs.ROcs.AIcs.CV
RGB-only Active 3D Scene Graph Generation for Indoor Mobile Robots
Giorgia Modi, Davide Buoso, Giuseppe Averta, Daniele De Martini
ELI5A robot with just a regular camera can now build a detailed 3D map of a room showing where objects are and how they relate to each other, and it can intelligently choose where to look next to find more things—without needing expensive depth sensors like LiDAR.
Problem solvedRobots have been limited to expensive depth cameras (LiDAR, RGB-D) for 3D mapping, and they waste time exploring randomly instead of strategically looking where they're likely to find new objects. This lets any robot with just a camera do better 3D scene understanding.
- 2605.18184·May 18, 2026·~10 mincs.ROcs.AIcs.CV
Fixed External Cameras as Common Prior Maps for Active 3D Scene Graph Generation
Giorgia Modi, Davide Buoso, Giuseppe Averta, Daniele De Martini
ELI5A robot uses fixed cameras around a room (like security cameras) as a head start to understand what's in the space, then actively explores to fill in gaps and build a complete 3D map of objects and their relationships.
Problem solvedRobots waste time exploring environments from scratch when fixed external cameras could give them context first. This lets them explore more efficiently by knowing where to look next, cutting exploration time and improving what they discover.