Vision-driven Android device automation using Midscene. Operates entirely from screenshots — no DOM or accessibility labels required. Can interact with all v...
**Expanded command set and more precise device automation features.** - Added new commands: `launch` (to open apps or URLs), `runadbshell` (to execute arbitrary ADB shell commands), and `assert` (for natural-language state verification). - Allowed reference image-based UI targeting using the `tap --locate` option with JSON configuration. - Documented full list of actions supported by `act` commands (tap, long-press, scroll, type, key events, navigation, etc.). - Improved best practices and workflow guidance, including the use of `-y` for non-interactive execution with npx. - Expanded description and trigger keywords to cover more automation and QA use cases. - Added instructions for using and converting detailed HTML/Markdown automation reports.