I want to clear the homepage of one of the new stackexchange websites: https://webapps.stackexchange.com/ (only once and for just a few pages, nothing that should bother the servers). If I wanted this from stackoverflow, I know there is a database dump, but for the new stackexchange they do not exist yet.
Here is what I want to do.
Step 1: Choose URL
URL <- "https://webapps.stackexchange.com/"
Step 2: read the table
readHTMLTable(URL)
Step 2: try it with XML this time
htmlTreeParse(URL)
So, I was able to read the page, but now the structure is in a div. How can it be used now to create the same as readHTMLTable?
source
share