comfyui_LLM_party
Run ComfyUI Easily with InstaSD
Skip the complex setup. InstaSD helps creative professionals build workflows and deploy them to the world:
- One-click deployment
- Any model, any node
- Powerful GPUs for rapid iteration
Available Nodes
browser_use_tool
Browser Automatic Control Tool Node Documentation
Overview
The Browser Automatic Control Tool Node is a part of the ComfyUI LLM Party, designed to automate tasks using language models. This node allows you to connect with a browser to perform specific tasks based on instructions given through a language model. It automates browser interactions, enabling seamless automation and integration into workflows.
What This Node Does
This node leverages large language models (LLMs) to automatically execute tasks in a browser based on provided task descriptions. It processes the task through the configured LLM provider, carries out the task in a browser, and then returns the results.
Inputs
The Browser Automatic Control Tool Node accepts the following inputs:
- Model Name: The name of the language model to be used, such as "gpt-4o".
- Provider: The provider of the language model. The options include:
- OpenAI
- Azure
- Anthropic
- Base URL: An optional base URL for the API endpoint. If not specified, it defaults based on the chosen provider and model.
- API Key: An optional API key for authenticating with the selected model provider. Alternatively, the key can be specified in a configuration file.
- Is Enable: A toggle to enable or disable the node. If set to false, the node will not execute.
Outputs
This node produces the following outputs:
- Tool: A JSON string that describes the function performed by the node. It includes details about the task to be executed and expects a task description as input to perform the desired automation in the browser.
Usage in ComfyUI Workflows
In ComfyUI workflows, this node can be used to automate various web-based tasks and integrate them into existing automation processes. Users can:
- Set up a workflow where a task description is fed to this node.
- Use the node to interact with a browser and execute the task described.
- Retrieve the results and utilize them in subsequent steps of the workflow.
Examples of tasks could include automating data retrieval from websites, performing online actions like form submissions, or any custom task that can be described in natural language and executed via a browser.
Special Features or Considerations
- Language Model Integration: The node supports integration with multiple LLM providers, including OpenAI, Azure, and Anthropic. Each provider may require specific API keys or URLs, which can be set within the node or through a configuration file.
- Flexible Configuration: Users can easily configure the model, provider, and API details within the node settings, allowing for tailored automation solutions.
- Internationalization Support: The display name for the node changes based on the user's locale settings. It automatically adapts to either English or Chinese display names, providing a more personalized user experience.
The Browser Automatic Control Tool Node provides an efficient way to automate browser tasks, integrating seamlessly into ComfyUI workflows and expanding automation capabilities through the power of language models.