NodeTool Core

Powerful, Flexible Node-Based Workflows for AI Applications

📚 Overview

NodeTool Core is a powerful Python library for building and running AI workflows using a modular, node-based approach. It provides the foundation for the NodeTool ecosystem, enabling developers to create sophisticated AI applications with minimal code.

✨ Key Features

🔄 Node-based workflow system - Compose complex workflows from simple building blocks
🤖 Multi-provider AI support - Seamless integration with OpenAI, Anthropic, Ollama, and more
🧩 Modular architecture - Easily extend with custom nodes and functionality
⚡ High-performance execution engine - Run workflows efficiently on CPU or GPU
🔄 Workflow streaming API - Get real-time updates on workflow progress
🧠 Advanced agent system - Create intelligent agents with specialized capabilities
💾 Storage and persistence - Save and manage workflows and results
📊 Type system - Strong typing for workflow validation and documentation

🚀 Quick Start

Installation

# Install using pip
pip install nodetool-core

# Or with Poetry
poetry add nodetool-core

Basic Usage

import asyncio
from nodetool.dsl.graph import graph, run_graph
from nodetool.dsl.providers.openai import ChatCompletion
from nodetool.metadata.types import OpenAIModel

# Create a simple workflow
g = ChatCompletion(
    model=OpenAIModel(model="gpt-4"),
    messages=[{"role": "user", "content": "Explain quantum computing in simple terms"}]
)

# Run the workflow
result = asyncio.run(run_graph(graph(g)))
print(result)

📖 Documentation

Comprehensive documentation is available at docs.nodetool.ai.

🧩 Examples

    context = ProcessingContext()

    provider = get_provider(Provider.OpenAI)
    model = "gpt-4o"

    retrieval_tools = [
        GoogleSearchTool(context.workspace_dir),
        BrowserTool(context.workspace_dir),
    ]

    agent = Agent(
        name="Research Agent",
        objective="""
        Research the competitive landscape of AI code assistant tools.
        1. Use google search and browser to identify a list of AI code assistant tools
        2. For each tool, identify the following information:
            - Name of the tool
            - Description of the tool
            - Key features of the tool
            - Pricing information
            - User reviews
            - Comparison with other tools
        3. Summarize the findings in a table format
        """,
        provider=provider,
        model=model,
        tools=retrieval_tools,
        output_schema={
            "type": "object",
            "properties": {
                "tools": {
                    "type": "array",
                    "items": {
                        "type": "object",
                        "properties": {
                            "name": {"type": "string"},
                            "description": {"type": "string"},
                            "key_features": {"type": "string"},
                            "pricing": {"type": "string"},
                            "user_reviews": {"type": "string"},
                            "comparison_with_other_tools": {"type": "string"},
                        },
                    },
                },
            },
        },
    )
    async for item in agent.execute(context):
        if isinstance(item, Chunk):
            print(item.content, end="", flush=True)

    print(f"\nWorkspace: {context.workspace_dir}")
    print(f"Results: {agent.results}")

More examples can be found in the examples directory.

🏗️ Architecture

NodeTool's architecture is designed to be flexible and extensible.

graph TD
A[NodeTool Editor<br>ReactJS] -->|HTTP/WebSocket| B[API Server]
A <-->|WebSocket| C[WebSocket Runner]
B <-->|Internal Communication| C
C <-->|WebSocket| D[Worker with ML Models<br>CPU/GPU<br>Local/Cloud]
D <-->|HTTP Callbacks| B
E[Other Apps/Websites] -->|HTTP| B
E <-->|WebSocket| C
D -->|Optional API Calls| F[OpenAI<br>Replicate<br>Anthropic<br>Others]

    classDef default fill:#e0eee0,stroke:#333,stroke-width:2px,color:#000;
    classDef frontend fill:#ffcccc,stroke:#333,stroke-width:2px,color:#000;
    classDef server fill:#cce5ff,stroke:#333,stroke-width:2px,color:#000;
    classDef runner fill:#ccffe5,stroke:#333,stroke-width:2px,color:#000;
    classDef worker fill:#ccf2ff,stroke:#333,stroke-width:2px,color:#000;
    classDef api fill:#e0e0e0,stroke:#333,stroke-width:2px,color:#000;
    classDef darkgray fill:#a9a9a9,stroke:#333,stroke-width:2px,color:#000;

    class A frontend;
    class B server;
    class C runner;
    class D worker;
    class E other;
    class F api;

🤝 Contributing

We welcome contributions from the community! Please see our Contributing Guidelines for more information on how to get involved.

Development Setup

Clone the repository

git clone https://github.com/yourusername/nodetool-core.git
cd nodetool-core

Install dependencies with Poetry
```
poetry install
```
Install pre-commit hooks
```
pre-commit install
```
Run tests
```
poetry run pytest
```

📄 License

AGPL License

📚 Learn More

Example 2: PDF Indexing for RAG Applications

This example shows how to create a workflow that loads a PDF document, extracts text, splits it into sentences, and indexes the chunks in a ChromaDB vector database for later retrieval:

import asyncio
import os
from nodetool.dsl.graph import graph, run_graph
from nodetool.dsl.chroma.collections import Collection
from nodetool.dsl.chroma.index import IndexTextChunks
from nodetool.dsl.lib.data.langchain import SentenceSplitter
from nodetool.dsl.lib.file.pymupdf import ExtractText
from nodetool.dsl.nodetool.os import LoadDocumentFile
from nodetool.metadata.types import FilePath, LlamaModel

# Set up paths
dirname = os.path.dirname(__file__)
file_path = os.path.join(dirname, "deepseek_r1.pdf")

# Create indexing workflow
g = IndexTextChunks(
    collection=Collection(name="papers"),
    text_chunks=SentenceSplitter(
        text=ExtractText(
            pdf=LoadDocumentFile(path=FilePath(path=file_path)),
        ),
        document_id=file_path,
    ),
)

# Run the workflow
asyncio.run(run_graph(graph(g)))

Key Concepts

When using NodeTool programmatically, keep these key concepts in mind:

Nodes: Each node represents a specific operation or function. Nodes have inputs and outputs that can be connected to form a workflow.
Graph: A collection of nodes and their connections, representing the entire workflow.
DSL (Domain-Specific Language): NodeTool provides a Python DSL for creating workflows, with specialized modules for different domains (e.g., nodetool.dsl.google.mail, nodetool.dsl.chroma.collections).
Execution: Workflows are executed using the run_graph function, which takes a graph object created with the graph function.

Workflow Execution Architecture

NodeTool Core includes a sophisticated workflow execution engine that processes directed graphs of computational nodes. Understanding how workflows are executed can help you build more efficient and effective workflows.

WorkflowRunner

The WorkflowRunner class is the heart of NodeTool's execution engine. It handles:

Parallel execution of independent nodes
GPU resource management with ordered locking
Result caching for cacheable nodes
Error handling and retry logic for GPU OOM situations
Progress tracking and status updates
Support for both regular nodes and group nodes (subgraphs)

Execution Process

When you run a workflow, the following steps occur:

Initialization: The runner is initialized with a job ID and automatically detects the available device (CPU, CUDA, or MPS).
Graph Loading: The workflow graph is loaded from the request, and nodes are instantiated.
Input Processing: Input parameters are assigned to the corresponding input nodes.
Graph Validation: The graph is validated to ensure all edges are valid and all required inputs are provided.
Node Initialization: All nodes in the graph are initialized.
Graph Processing:
- Nodes without incoming edges are processed first
- As nodes complete, messages are sent to downstream nodes
- Nodes are processed when all their required inputs are available
- GPU-intensive nodes acquire a lock before execution to manage resources
Result Collection: Results from output nodes are collected and returned.
Finalization: Resources are cleaned up, and the final status is reported.

Advanced Features

Parallel Execution: Independent nodes are executed in parallel using asyncio.
GPU Management: The runner intelligently manages GPU resources, with retry logic for out-of-memory situations.
Subgraph Support: Group nodes can contain entire subgraphs, enabling hierarchical workflows.
Progress Tracking: The runner provides real-time progress updates during execution.

Using the Workflow API 🔌

NodeTool provides a powerful Workflow API that allows you to integrate and run your AI workflows programmatically.

You can use the API locally now, api.nodetool.ai access is limited to Alpha users.

API Usage

Loading Workflows

const response = await fetch("http://localhost:8000/api/workflows/");
const workflows = await response.json();

Running a Workflow

HTTP API

curl -X POST "http://localhost:8000/api/workflows/<workflow_id>/run" \
-H "Content-Type: application/json" \
-d '{
    "params": {
        "param_name": "param_value"
    }
}'

const response = await fetch(
  "http://localhost:8000/api/workflows/<workflow_id>/run",
  {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      params: params,
    }),
  }
);

const outputs = await response.json();
// outputs is an object with one property for each output node in the workflow
// the value is the output of the node, which can be a string, image, audio, etc.

Streaming API

The streaming API is useful for getting real-time updates on the status of the workflow.

See run_workflow_streaming.js for an example.

These updates include:

job_update: The overall status of the job (e.g. running, completed, failed, cancelled)
node_update: The status of a specific node (e.g. running, completed, error)
node_progress: The progress of a specific node (e.g. 20% complete)

The final result of the workflow is also streamed as a single job_update with the status "completed".

const response = await fetch(
  "http://localhost:8000/api/workflows/<workflow_id>/run?stream=true",
  {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      params: params,
    }),
  }
);

const reader = response.body.getReader();
const decoder = new TextDecoder();

while (true) {
  const { done, value } = await reader.read();
  if (done) break;

  const lines = decoder.decode(value).split("\n");
  for (const line of lines) {
    if (line.trim() === "") continue;

    const message = JSON.parse(line);
    switch (message.type) {
      case "job_update":
        console.log("Job status:", message.status);
        if (message.status === "completed") {
          console.log("Workflow completed:", message.result);
        }
        break;
      case "node_progress":
        console.log(
          "Node progress:",
          message.node_name,
          (message.progress / message.total) * 100
        );
        break;
      case "node_update":
        console.log(
          "Node update:",
          message.node_name,
          message.status,
          message.error
        );
        break;
    }
  }
}

WebSocket API

The WebSocket API is useful for getting real-time updates on the status of the workflow. It is similar to the streaming API, but it uses a more efficient binary encoding. It offers additional features like canceling jobs.

See run_workflow_websocket.js for an example.

const socket = new WebSocket("ws://localhost:8000/predict");

const request = {
  type: "run_job_request",
  workflow_id: "YOUR_WORKFLOW_ID",
  params: {
    /* workflow parameters */
  },
};

// Run a workflow
socket.send(
  msgpack.encode({
    command: "run_job",
    data: request,
  })
);

// Handle messages from the server
socket.onmessage = async (event) => {
  const data = msgpack.decode(new Uint8Array(await event.data.arrayBuffer()));
  if (data.type === "job_update" && data.status === "completed") {
    console.log("Workflow completed:", data.result);
  } else if (data.type === "node_update") {
    console.log("Node update:", data.node_name, data.status, data.error);
  } else if (data.type === "node_progress") {
    console.log("Progress:", (data.progress / data.total) * 100);
  }
  // Handle other message types as needed
};

// Cancel a running job
socket.send(msgpack.encode({ command: "cancel_job" }));

// Get the status of the job
socket.send(msgpack.encode({ command: "get_status" }));

API Demo

Download the html file
Open in a browser locally.
Select the endpoint, local or api.nodetool.ai (for alpha users)
Enter API token (from Nodetool settings dialog)
Select workflow
Run workflow
The page will live stream the output from the local or remote API

Installation

# Install using Poetry
poetry install

Development

Setup

Clone the repository
Install dependencies with Poetry:
```
poetry install
```

Testing

Run tests with pytest:

poetry run pytest

Code Style

This project uses Black for code formatting:

poetry run black .

License

AGPL License

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.github		.github
docs		docs
examples		examples
src/nodetool		src/nodetool
tests		tests
tmp		tmp
tools		tools
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
changelog.py		changelog.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
setup.cfg		setup.cfg

License

nodetool-ai/nodetool-core

Folders and files

Latest commit

History

Repository files navigation

NodeTool Core

📚 Overview

✨ Key Features

🚀 Quick Start

Installation

Basic Usage

📖 Documentation

🧩 Examples

🏗️ Architecture

🤝 Contributing

Development Setup

📄 License

📚 Learn More

Example 2: PDF Indexing for RAG Applications

Key Concepts

Workflow Execution Architecture

WorkflowRunner

Execution Process

Advanced Features

Using the Workflow API 🔌

API Usage

Loading Workflows

Running a Workflow

HTTP API

Streaming API

WebSocket API

API Demo

Installation

Development

Setup

Testing

Code Style

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages