Sokjan Crawler

Private research crawler operated by the Nicodemus project stack.
Non-commercial • Respectful

About

Sokjan is a focused web crawler used for private research, search experimentation, and language-model tooling. It runs from a small self-hosted lab environment and is not a mass-indexing or advertising crawler.

If you see requests from this bot, they are part of limited-scope experiments: indexing public pages, testing search relevance, and feeding a private assistant with fresher data.

User-Agent
SokjanBot/1.0 (+https://sokjan.net;/)
Contact

For block requests configure robots.txt as described.

Crawl & Respect Policy

Sokjan obeys robots.txt, honours standard crawl-delay directives where present, and is tuned for low request rates.

The crawler is target-scoped and can be fully excluded from your site using standard robots rules:

User-agent: SokjanBot
Disallow: /

To allow Sokjan but keep it gentle, you can instead use, for example:

User-agent: SokjanBot
Crawl-delay: 10
Robots-aware Low-rate No resale of data Private lab crawler