Since this is easy in R, I use the rvest HTML parsing package to extract information from a website.
I am wondering what my User-Agent (if any) is during the request, since the User-Agent is assigned to the Internet browser or is there a way to install it somehow?
My code that opens a session and extracts information from HTML is below:
library(rvest) se <- html_session( "http://www.wp.pl" ) %>% html_nodes("[data-st-area=Glonews-mozaika] li:nth-child(7) a") %>% html_attr( name = "href" )
source share