-, nutch.apache.org.
webcrawler : , , . webcrawler URL-, -, , 101 . , , and-or-the, , , .
, . , , - .
, Google - , , google, . Pagerank http://en.wikipedia.org/wiki/PageRank , -, , . , xml sitemap, - . . gsitecrawler.com/- .
- Google , Google , , , , , - google, .
, - , SEO, , seomoz.com ... , .
, !, .