
research note
LabOSBench — Benchmarking Computer Use Agents for Scientific Instrument Control
LabOSBench addresses the challenge of evaluating computer-use AI agents on scientific instrument GUIs, where controlling sophisticated devices involves complex, procedural, feedback-driven workflows










