Update CLAUDE.md
This commit is contained in:
18
CLAUDE.md
18
CLAUDE.md
@@ -36,19 +36,21 @@ Key features:
|
||||
|
||||
**Use this when:** Working with sites that already provide feeds but need enhancement (e.g., content modification, filtering, or expanding truncated content).
|
||||
|
||||
## WebDriverAbstract - JavaScript/XHR-Heavy Sites
|
||||
## WebDriverAbstract - JavaScript/XHR-Heavy Sites (NOT AVAILABLE)
|
||||
|
||||
**File:** [Bridge_Docs/04_WebDriverAbstract.md](Bridge_Docs/04_WebDriverAbstract.md)
|
||||
|
||||
For websites that heavily rely on JavaScript or XMLHttpRequest (XHR) to load content. Uses Selenium WebDriver with a real browser instance.
|
||||
**IMPORTANT: DO NOT USE WebDriverAbstract!**
|
||||
|
||||
Important notes:
|
||||
- Requires a running Selenium server (Docker image or local ChromeDriver)
|
||||
- Very resource-intensive - only use when other methods fail
|
||||
- Must explicitly wait for elements to appear after page loads or interactions
|
||||
- Always call `$this->cleanUp()` in a finally block
|
||||
This server does not have Selenium WebDriver available. WebDriverAbstract cannot be used for bridge creation.
|
||||
|
||||
**Use this when:** The target website loads content dynamically via JavaScript and cannot be scraped with standard HTML parsing methods.
|
||||
**If a website requires WebDriverAbstract** (i.e., content is loaded entirely via JavaScript and cannot be accessed through other methods), **state that a bridge cannot be created for this website** due to the lack of Selenium support on the server.
|
||||
|
||||
Alternative approaches to try before giving up:
|
||||
- Check if the website has a hidden API or JSON endpoint that can be accessed directly
|
||||
- Look for RSS/Atom feeds that might exist
|
||||
- Examine network requests in browser dev tools to find data sources
|
||||
- Try XPathAbstract or BridgeAbstract with different parsing strategies
|
||||
|
||||
## XPathAbstract - XPath-Based Bridges
|
||||
|
||||
|
||||
Reference in New Issue
Block a user