Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper โข 2412.04454 โข Published Dec 5, 2024 โข 59
TextDiffuser: Diffusion Models as Text Painters Paper โข 2305.10855 โข Published May 18, 2023 โข 3