Windows Agent Arena – Evaluating Multi-Modal OS Agents at Scale (microsoft.github.io)1 points by albertzeyer 9 hours ago | 0 comments