Skip to main content

AutoPOM-Agent Documentation

AutoPOM-Agent is an AI autonomous spider that crawls web applications and generates Playwright Page Object Models in Java, JavaScript, or TypeScript.

What you will find here​

  • Configuration to run controlled crawls and manage credentials.
  • Architecture covering the agentic loop, state graph, and synthesis pipeline.
  • Core guides for selector mapping, self-healing, and multi-language POM output strategy.
  • Tutorials for first runs and authenticated exploration.
  • Use cases for practical implementation patterns in enterprise contexts.

Core workflow​

  1. Start with a base URL.
  2. Agent observes and decides next action.
  3. Interactive elements are extracted and semantically named.
  4. Selectors are validated and healed.
  5. POM code is generated from structured JSON page models in the selected language.

What's new​

  • Interactive Capture Mode: Manually navigate/authenticate before mapping a page.
  • Existing Browser Capture: Attach to any running browser via CDP for surgical POM generation.
  • Chrome Profile Support: Reuse your existing browser sessions, cookies, and credentials.
  • Locator Storage Strategy: Choose between inline or external storage for generated locators.
  • Configurable pom_language with support for java, javascript, and typescript.
  • Configurable browser_adapter with mock and live playwright modes.
  • Language-specific output folders under output/<language>/.
  • Live progress logs during execution ([CRAWL], [MAP], [SKIP]).
  • Execution summary reports for managerial visibility.