cyberneticlibrary

Extract web data at scale reliably

universal-scraping-architectskillsetup L317,464
alirezarezvani/claude-skills
What it does

Extract structured data from any website via adaptive scraping and schema inference

Best for

Data engineers needing flexible web scraping without writing per-site parsers.

Inputs
  • · website URL
  • · optional: desired data fields
Outputs
  • · JSON or CSV with fields, inferred schema, cache for re-runs
Requires
  • · Playwright or BeautifulSoup
  • · LLM for schema inference
Preconditions

Website accessible

Failure modes
  • · IP rate limiting
  • · robots.txt blocks
  • · schema inference hallucination
Trust signals
  • · Adaptive scraper adjusts method (static vs JS-rendered)