cobweb 1.2.1
Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.
Gemfile:
=
instalar:
=
Runtime Dependencies (9):
addressable
>= 2.3.8
haml
>= 4.0.7
json
>= 1.8.3
nokogiri
>= 1.6.0
rake
>= 0
redis
>= 3.2.1
redis-namespace
>= 1.6.0
sinatra
>= 1.4.6
slop
>= 4.2.0