Skill_Seekers now automatically detects and uses llms.txt files when available, providing 10x faster documentation ingestion.
The llms.txt convention is a growing standard where documentation sites provide pre-formatted, LLM-ready markdown files:
llms-full.txt - Complete documentationllms.txt - Standard balanced versionllms-small.txt - Quick referenceNo config changes needed. Just run normally:
python3 cli/doc_scraper.py --config configs/hono.json
Optionally specify llms.txt URL:
{
"name": "hono",
"llms_txt_url": "https://hono.dev/llms-full.txt",
"base_url": "https://hono.dev/docs"
}
| Method | Time | Requests |
|---|---|---|
| HTML Scraping (20 pages) | 20-60s | 20+ |
| llms.txt | < 5s | 1 |
Sites known to provide llms.txt:
If llms.txt download or parsing fails, automatically falls back to HTML scraping with no user intervention required.