Core Concepts
Bytebot is not your typical web scraping or automation tool. By leveraging advanced AI, Bytebot learns from your interactions to create flexible, intent-driven automations that adapt to changes on the web. This page outlines the core concepts that make Bytebot unique.
What is Bytebot?
Bytebot uses large language models (LLMs) to translate user-provided descriptions (e.g., “Click on the cart”) into precise browser actions. Each time you interact with a web page, Bytebot interprets your clicks and inputs, learns from them, and builds a repeatable workflow based on these actions. This makes Bytebot ideal for tasks like web scraping, testing, data entry, and any other browser automation need that benefits from AI adaptability.
How Bytebot Works
Bytebot operates across four key layers to create workflows from natural language:
- Ingesting Natural Language Prompts: Bytebot understands plain-language instructions, such as descriptions of clicks or text inputs.
- Learning from Actions: Bytebot observes your interactions and learns the intent behind each step, converting it into reusable actions.
- Generating Browser Actions: Using AI, Bytebot translates your actions into structured steps, like clicking or entering text, and adds these to a growing workflow.
- Executing the Workflow: Bytebot then executes the actions as an automated sequence, creating a reliable workflow that adapts to changes in page layout.
By combining your actions with AI-driven predictions, Bytebot reliably generates automation that aligns with your goals—even as web pages change over time.
What are Browser Actions?
At the core of Bytebot’s automations are browser actions—the steps Bytebot creates based on your instructions. Each browser action includes:
- Action Type: The type of interaction, such as “Click”, “CopyAttribute”, or “ExtractTable”.
- XPath: A path-like syntax that Bytebot uses to identify elements on the page, so you don’t have to handle technical details yourself.
While browser actions are created automatically by Bytebot, they are visualized in the interface so you can confirm that the right steps are in place. Bytebot’s AI interprets your intent and handles the specifics, letting you focus on what you want to achieve.