I am trying to create a web crawler.
I need 2 things:
- Convert HTML to a DOM object.
- Run existing JavaScripts on demand.
The result that I am expecting is a DOM object where the JavaScript executing the load is already executed.
In addition, I need an option to execute additional JavaScripts upon request (in such cases as:
onMouseOver ,
onMouseClick , etc.) First of all, I could not find a good source of documentation.
I looked at the
Webkit Homepage , but could not find much information for the users of the package and examples of useful examples. In addition, in some forums I saw instructions not to use the Webkit interface for scanners, but directly the internal DOM and Javascript packages. I am looking for
Documentation and
Code Examples .
In addition, any recommendations for proper use.
Workspace:
source share