Agentic AI Catalog

AI Agents Directory

Compare autonomous coding agents, orchestration frameworks, visual builders, and evaluation observability tools based on autonomy levels and model support.

Category:

Autonomy:

Devin

Coding Agents

Lvl 3: Autonomous

The world's first fully autonomous AI software engineer, capable of handling complex multi-step programming tasks, environment configuration, and bug refactoring remotely.

Proprietary (Proprietary)Type: subscription

Models: Claude 3.5 Sonnet, GPT-4o

View Details ➔

Cline

Coding Agents

Lvl 2: Directed

An autonomous coding agent that runs inside VS Code. Cline can read/write files, edit settings, install packages, and execute terminal commands with user permission.

Open-Source (Apache-2.0)Type: free

Models: Claude 3.5 Sonnet, DeepSeek-V3, GPT-4o

View Details ➔

Aider

Coding Agents

Lvl 2: Directed

A popular command-line AI coding agent that allows programmers to edit code in local git repositories, automatically generating git commits on edit completions.

Open-Source (Apache-2.0)Type: free

Models: Claude 3.5 Sonnet, DeepSeek-V3, GPT-4o

View Details ➔

Claude Code

Coding Agents

Lvl 2: Directed

Anthropic's official terminal-based coding assistant. Claude Code executes CLI commands, inspects local directories, writes files, and performs fast semantic code searches.

Proprietary (Proprietary)Type: pay-as-you-go

Models: Claude 3.5 Sonnet

View Details ➔

Cursor

Coding Agents

Lvl 1: Assistive

An AI-first code editor branched from VS Code. Features composer multi-file editing, chat workspace commands, and autocomplete.

Proprietary (Proprietary)Type: freemium

Models: Claude 3.5 Sonnet, GPT-4o, Cursor-Small

View Details ➔

Windsurf

Coding Agents

Lvl 1: Assistive

An AI IDE designed around the concept of a 'FlowState', allowing seamless transitions between autonomous editing, interactive chats, and terminal executions.

Proprietary (Proprietary)Type: freemium

Models: Claude 3.5 Sonnet, GPT-4o

View Details ➔

OpenHands

Coding Agents

Lvl 3: Autonomous

Formerly OpenDevin, OpenHands is a highly autonomous open-source agent platform executing complex coding workflows within sandboxed Docker containers.

Open-Source (MIT)Type: free

Models: Claude 3.5 Sonnet, GPT-4o, Llama 3.3

View Details ➔

Claude Cowork

Productivity & General

Lvl 2: Directed

Anthropic's collaborative enterprise agent. Integrates directly into slack workspaces and dashboard portals to review team documentation, draft releases, and execute workflows.

Proprietary (Proprietary)Type: subscription

Models: Claude 3.5 Sonnet

View Details ➔

ChatGPT Agent

Productivity & General

Lvl 2: Directed

OpenAI's assistant agent, utilizing custom GPT tools, python code executors, and search engines to manage workflows, files, and queries autonomously.

Proprietary (Proprietary)Type: subscription

Models: GPT-4o, o1, o3-mini

View Details ➔

MultiOn

Productivity & General

Lvl 3: Autonomous

An autonomous web agent capable of using a browser to log in, fill forms, execute shopping carts, and fetch data from web portals.

Proprietary (Proprietary)Type: subscription

Models: Custom Agentic Router, GPT-4o

View Details ➔

Lindy

Productivity & General

Lvl 3: Autonomous

An autonomous business helper designed to handle daily administrative tasks, email management, meeting coordination, and CRM updates.

Proprietary (Proprietary)Type: subscription

Models: GPT-4o, Claude 3.5 Sonnet

View Details ➔

Microsoft Copilot Studio

Productivity & General

Lvl 3: Autonomous

Microsoft's enterprise platform for building custom, conversational agent copilots connected to internal corporate databases, SharePoint repositories, and CRM platforms.

Proprietary (Proprietary)Type: paid

Models: GPT-4o, Azure OpenAI Models

View Details ➔

Julius AI

Data & Research

Lvl 2: Directed

An advanced data analysis agent capable of writing Python scripts, generating graphs, cleaner data sheets, and building regression graphs automatically.

Proprietary (Proprietary)Type: subscription

Models: GPT-4o, Claude 3.5 Sonnet

View Details ➔

Consensus

Data & Research

Lvl 2: Directed

An AI search engine that acts as a research agent, scanning over 200 million academic papers to synthesize scientific findings and deliver consensus consensus reports.

Proprietary (Proprietary)Type: freemium

Models: GPT-4o

View Details ➔

Snowflake Cortex

Data & Research

Lvl 2: Directed

Snowflake's warehouse-native AI platform, enabling users to execute natural language database queries (Text-to-SQL), perform classification, and run sentiment analyses directly inside secure database tables.

Proprietary (Proprietary)Type: pay-as-you-go

Models: Llama 3.1 405B, Mistral Large, Snowflake Arctic

View Details ➔

ThoughtSpot

Data & Research

Lvl 2: Directed

An enterprise search-driven business intelligence tool that uses generative AI to translate natural language queries into SQL database lookups and generate interactive business dashboards.

Proprietary (Proprietary)Type: paid

Models: GPT-4o

View Details ➔

LangGraph

Developer Frameworks

Lvl 4: Multi-Agent

An open-source library built by LangChain to compile stateful, multi-agent systems with cyclic execution parameters, perfect for complex autonomous behaviors.

Open-Source (MIT)Type: free

Models: Model-agnostic

View Details ➔

CrewAI

Developer Frameworks

Lvl 4: Multi-Agent

A popular developer framework for building collaborative multi-agent workspaces where agents are assigned specific roles (e.g. Writer, Researcher) and coordinate to execute projects.

Open-Source (MIT)Type: free

Models: Model-agnostic

View Details ➔

AutoGen

Developer Frameworks

Lvl 4: Multi-Agent

Microsoft's framework for orchestrating multi-agent conversations. Supports customizable conversation rules, Python code execution, and human-in-the-loop triggers.

Open-Source (MIT)Type: free

Models: Model-agnostic

View Details ➔

LlamaIndex

Developer Frameworks

Lvl 3: Autonomous

A data framework designed to connect private data sources to LLMs, featuring advanced RAG tools, semantic vector database loaders, and data parsing agents.

Open-Source (MIT)Type: free

Models: Model-agnostic

View Details ➔

LangChain

Developer Frameworks

Lvl 2: Directed

The pioneer framework for building LLM applications, offering standardized abstractions for prompts, models, vector databases, memory, and chain structures.

Open-Source (MIT)Type: free

Models: Model-agnostic

View Details ➔

visual-builder

Visual Builders

Lvl 3: Autonomous

A workflow automation platform with a node-based visual editor, incorporating specialized AI nodes for prompt chaining, vector databases, and agent memory.

Open-Source (Sustainable-Use-License)Type: freemium

Models: Model-agnostic

View Details ➔

Dify

Visual Builders

Lvl 3: Autonomous

An open-source LLM app development platform combining visual workflow design, RAG management, and monitoring dashboards inside a unified developer UI.

Open-Source (Apache-2.0)Type: freemium

Models: Model-agnostic

View Details ➔

Gumloop

Visual Builders

Lvl 3: Autonomous

Formerly known as Vext, Gumloop is a powerful visual workflow automation builder that allows developers to run complex web scraping, file parsing, and bulk prompt jobs visually.

Proprietary (Proprietary)Type: freemium

Models: Model-agnostic

View Details ➔

Flowise

Visual Builders

Lvl 3: Autonomous

An open-source visual node tool built on LangChain, letting developers build LLM pipelines, prompt trees, and RAG systems through simple drag-and-drop actions.

Open-Source (Apache-2.0)Type: free

Models: Model-agnostic

View Details ➔

LangSmith

Observability & Eval

Lvl 1: Assistive

LangChain's production platform for tracing, debugging, testing, and evaluating agent behaviors, providing complete visibility into prompt histories and token counts.

Proprietary (Proprietary)Type: freemium

Models: Model-agnostic

View Details ➔

Arize Phoenix

Observability & Eval

Lvl 1: Assistive

An open-source AI evaluation and observability platform, enabling developers to run local trace databases, audit RAG queries, and detect hallucinations using standard OpenTelemetry protocols.

Open-Source (Apache-2.0)Type: free

Models: Model-agnostic

View Details ➔

DeepEval

Observability & Eval

Lvl 1: Assistive

An open-source evaluation framework for unit testing LLM applications. DeepEval acts like a 'pytest' for AI systems, evaluating toxicity, accuracy, and hallucination scores.

Open-Source (Apache-2.0)Type: free

Models: Model-agnostic

View Details ➔

Braintrust

Observability & Eval

Lvl 1: Assistive

An enterprise-grade software stack for evaluating, logging, and optimizing AI applications, helping teams test prompts and track logs in unified dashboard systems.

Proprietary (Proprietary)Type: paid

Models: Model-agnostic

View Details ➔