1、NWebCrawler
This is a web crawler program written in C#.Features:
Configuable: thread count, waiting time, connection timeout, allow MIME types and priorities, download folders.
Statstics information: URL count, total downloaded files, total downloaded bytes, CPU utility and available memory.
Preferential crawler: user can set priority for MIME types (high, above, normal, below, low).
Robust: 10+ URL normalization rules, crawler trap [...]
