C # WebClient - View source question

I am using C # WebClient to send information to enter the page and read all the results.

The page I'm trying to load includes flash (which translates to HTML in the browser). I assume that it will break out so as not to fall under search engines.

The flash that interests me is just the text (not the image / video), etc., and when I am "View Selection Source" in firefox, I really see the text in the HTML that I want to see.

(I wonder when I look at the source for the entire page, I donโ€™t see the text in the HTML that I want to see. Could this be related?)

Currently, after I published my login details and loaded the HTML back, I see a page that DOES NOT display Flash-HTML (as if I were viewing the source for the entire page).

Thanks in advance,

Jim

PS: I have to indicate that POST actually works, my login is successful.

+1
source share
2 answers

Fiddler ( ) , . fiddler, , , . , , , -, , , HTML, .

( "scraping 101" ) - , . , , , , .

, :

  • / ., , cookie / , ( ) . , cookie. , cookie ( , "cookieless sessions" ), " cookie", ( ). , cookie, , , , ( cookie).
  • - cookie . , ( cookie) .
  • POST vs. GET, , , HTTP-, .
  • ( !) , , , , . , HTML.
  • HTTP-., , , , (-cookie) . , , . .
  • . script (, " Flash, -" ), . WebRequest: WebRequest ContentType = "application/xhtml + xml, text/xml, text/html; charset = utf-8" ? , -. : .NET , HttpWebRequest ( WebClient) , , WebClient cookie (-). . .
  • (, ajax, flash ..). ( HTTP-) , . , , HTTP , , , , . , ajax, script . , , , , script.
  • ordering - : HTTP- , -. , (, ). , text/html, HTML ajax/flash/etc. .
+7

(, , HTML, . ?)

, DOM javascript . javascript , .

0

Source: https://habr.com/ru/post/1747998/


All Articles