Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

...

Provider URL

Counting Clause

Multi-Click Time Span

User Identification

Crawler Clause

Crawler Identification

Crawler Count Report

Counter Code of Practice Draft 3

HTTP Status Code is 200 or 304.

for HTML 10s; for PDF 30s

at least IP, preferably Session

robots, prefetches, caching, federated searches(n.a.)

Black-List, client HTTP header

separate report

About LogEc

HTTP Status Code is 200, 206, 301, 302 or 304.

one calendar month

IP

robots, automated downloads (wget)

Access of robots.txt; # of requests 10,000 items/month; C-Class access 10% of stock; known robot-Domain/IP

separate column in report

Interoperable Repository Statistics

HTTP Status code is 200 on abstract or full-text page

24 hours

IP

search engine crawlers + automated

AWStats' black list

discarded

AWStats

Default: HTTP Status codes (200;304)

Default: 1 hour

IP

search engine crawlers

Black-List

separate column in report

IFABC

HTML: Tracking Pixel; Other: bytes transferred 95% of file size

Each Pageview is counted only once per visit. Visit means series of clicks coming from one IP-Number/Session-ID less than 30 minutes apart.

IP+User-Agent; Cookie-Session, Login-Session

search engine crawlers; automated downloads (optional)

proprietary Blacklist

discarded