Repository navigation

#

omniparser

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python
6481
11 天前

Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves

Python
3161
19 天前
OpenAdaptAI/OpenAdapt

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

Python
1239
1 个月前

OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.

Python
31
12 天前

AI agent that controls a computer

Python
29
2 个月前

AI-powered computer control for automated testing. FactifAI uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.

TypeScript
22
3 天前

Cappuccino is an GUI Agent based on desktop screen. It is a Manus-like AI Agent that can be deployed locally.

Python
21
25 天前

🤖 deploy OmniParser v2 model on Amazon SageMaker with async inference endpoint

Python
7
2 个月前

Placeholder for Omniparser Schemas used by universal-etl-parser

1
24 天前

Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models

1
2 个月前

Docker implementation of the OmniParser screen parsing tool

1
4 个月前