Connector suggestion: anybrowse for live web content ingestion with Cloudflare bypass

## Context

Unstructured handles document parsing beautifully but getting live web content into the pipeline is still a pain -- especially for Cloudflare-protected sites.

## Suggestion

[anybrowse](https://anybrowse.dev) could work as a web source connector -- fetches URLs with real browser rendering and returns clean markdown ready for Unstructured to further process or chunk.

```python
import requests

def fetch_url(url: str) -> str:
    r = requests.post("https://anybrowse.dev/scrape", json={"url": url})
    return r.json().get("markdown", "")

# Then pass to unstructured partition
from unstructured.partition.text import partition_text
elements = partition_text(text=fetch_url("https://example.com"))
```

Works on Cloudflare-protected sites, JS-rendered pages, and standard HTML.

- Free: 10/day, no key
- Paid: $5 for 3,000 scrapes
- Docs: https://anybrowse.dev/docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Connector suggestion: anybrowse for live web content ingestion with Cloudflare bypass #4288

Context

Suggestion

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Connector suggestion: anybrowse for live web content ingestion with Cloudflare bypass #4288

Description

Context

Suggestion

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions