Register a function to be called on every HTML element matched by the selector parameter.
Processing different HTML elements can be conveniently done using multiple onHTML callbacks.
element selector
callback function
Register a function to be called on every request.
With the onRequest callback function, you can customize the HTTP request before it is executed.
callback function
Register a function to be called on every response.
The onResponse callback function is called after the response has been received.
callback function
Register a function to be called on every response when headers and status are already received, but body is not yet read.
Like in onRequest, you can call request.abort to abort the transfer. This might be useful if, for example, you're following all hyperlinks, but want to avoid downloading files.
callback function
Register a function to be called at the end of scraping. Function will be executed after onHTML, as a final part of the scraping.
callback function
This is the crawler module's main class.
Example