AutoPOM-Agent Documentation
AutoPOM-Agent is an AI autonomous spider that crawls web applications and generates Playwright Page Object Models in Java, JavaScript, or TypeScript.
What you will find here​
- Configuration to run controlled crawls and manage credentials.
- Architecture covering the agentic loop, state graph, and synthesis pipeline.
- Core guides for selector mapping, self-healing, and multi-language POM output strategy.
- Tutorials for first runs and authenticated exploration.
- Use cases for practical implementation patterns in enterprise contexts.
Core workflow​
- Start with a base URL.
- Agent observes and decides next action.
- Interactive elements are extracted and semantically named.
- Selectors are validated and healed.
- POM code is generated from structured JSON page models in the selected language.
What's new​
- Interactive Capture Mode: Manually navigate/authenticate before mapping a page.
- Existing Browser Capture: Attach to any running browser via CDP for surgical POM generation.
- Chrome Profile Support: Reuse your existing browser sessions, cookies, and credentials.
- Locator Storage Strategy: Choose between
inlineorexternalstorage for generated locators. - Configurable
pom_languagewith support forjava,javascript, andtypescript. - Configurable
browser_adapterwithmockand liveplaywrightmodes. - Language-specific output folders under
output/<language>/. - Live progress logs during execution (
[CRAWL],[MAP],[SKIP]). - Execution summary reports for managerial visibility.