...
Provider URL | Counting Clause | Multi-Click Time Span | User Identification | Crawler Clause | Crawler Identification | Crawler Count Report |
---|---|---|---|---|---|---|
HTTP Status Code is 200 or 304. | for HTML 10s; for PDF 30s | at least IP, preferably Session | robots, prefetches, caching, federated searches(n.a.) | Black-List, client HTTP header | separate report | |
HTTP Status Code is 200, 206, 301, 302 or 304. | one calendar month | IP | robots, automated downloads (wget) | Access of robots.txt; # of requests 10,000 items/month; C-Class access 10% of stock; known robot-Domain/IP | separate column in report | |
HTTP Status code is 200 on abstract or full-text page | 24 hours | IP | search engine crawlers + automated | AWStats' black list | discarded | |
Default: HTTP Status codes (200;304) | Default: 1 hour | IP | search engine crawlers | Black-List | separate column in report | |
HTML: Tracking Pixel; Other: bytes transferred 95% of file size | Each Pageview is counted only once per visit. Visit means series of clicks coming from one IP-Number/Session-ID less than 30 minutes apart. | IP+User-Agent; Cookie-Session, Login-Session | search engine crawlers; automated downloads (optional) | proprietary Blacklist | discarded |