<$BlogRSDUrl$> Marcus P. Zillman, M.S., A.M.H.A. Author/Speaker/Consultant
Marcus P. Zillman, M.S., A.M.H.A. Author/Speaker/Consultant
Internet Happenings, Events and Sources


Sunday, December 05, 2004  

YACY Distributed Web Crawler
http://freshmeat.net/projects/yacyproxy/?branch_id=51198&release_id=179952

YACY is a distributed Web crawler and also a caching HTTP/HTTPS proxy. Pages that pass through the proxy are indexed and can be searched using a built-in HTTP server. YACY peers connect each other and form a P2P-based index exchange network based on distributed hash tables. Explicit web crawls can be done locally or collaboratively, forming a global search and distributed indexing engine for the Web. It also provides URL filtering with blacklist sharing among other proxy peers, individual Web/servlet page hosting, a file sharing zone, and a database engine. This has been added to the search engines section of the 2005 Internet MiniGuides.

posted by Marcus Zillman | 4:15 AM
archives
subject tracers™