5 Interesting Open Source Projects

1、NWebCrawler

This is a web crawler program written in C#.Features:

Configuable: thread count, waiting time, connection timeout, allow MIME types and priorities, download folders.
Statstics information: URL count, total downloaded files, total downloaded bytes, CPU utility and available memory.
Preferential crawler: user can set priority for MIME types (high, above, normal, below, low).
Robust: 10+ URL normalization rules, crawler trap [...]

12 Open Source Microblogging Applications

1. StatusNet(Laconica)

StatusNet is a Twitter clone, built with PHP + MySQL. It’s an opensource microblogging application, widely used by lots of microblog sites, such as Twitterfeed, Hellotxt and Gravity. It’s mature and it offers a smooth usage. The source can be found here.

2. Jaiku Engine

Jaiku was the microblogging site of Google. However, they decided no [...]

HTML Parser Libraries for .Net

HTMLParser.Net

 HTMLParser.Net is a .Net library built on codebase of popular javabased HTMLParser available on sourceforge.net. If you are building applications that involve screen scrapping of HTML pages or data extraction from the web sites, then you definitely want to have a tool like HTMLParser.Net in your arsenal. Parsing of a page is as simple as [...]