. "Access-Control-Allow-Origin", , .
The easiest way to implement such a crawler is to write addon (firefox) or extension (chrome), and then enter your javascript code on the visited page. Thus, you will see exactly what the author of the document sees. You can simply call the document, body.innerText, and then send the content to your server for indexing.
I have a crawler that works with multiple browsers on different IP address traversals.
source
share