Cleaning a site using the login page

Question

Cleaning a site using the login page

I am currently logging into the site from the site using the following script.

browser = webdriver.Chrome('E:/Shared Folders/Users/runnerjp/chromedriver/chromedriver.exe')
browser.get("https://www.timeform.com/horse-racing/account/sign-in?returnUrl=%2Fhorse-racing%2F") 
time.sleep(3)
username = browser.find_element_by_id("EmailAddress")
password = browser.find_element_by_id("Password")
username.send_keys("usr")
password.send_keys("pass")
login_attempt = browser.find_element_by_xpath("//input[@type='submit']")
time.sleep(3)
login_attempt.submit()

it works, but I find that using the Chrome web driver clogs my processor. Is there any alternative code that I could use, this does not mean that I need to physically load the login page?

+4

python selenium-chromedriver

emma perkins Jun 04 '17 at 17:35

source share

9 answers

James Kent · Answer 1 · 2017-06-09T09:29:03+0000

, - , .
- javascript/ajax- .., - , javascript. , , "" , phantomjs. phantomjs html javascript , chrome .

"", requests. , - "" , http ( ):

import requests
requests.get('https://api.github.com/user', auth=('user', 'pass'))

CodeMonkey

- , , , , , / .

CodeMonkey · Answer 2 · 2017-06-07T08:12:20+0000

requests . :

import requests
requests.get('https://api.github.com/user', auth=('user', 'pass'))

: http://docs.python-requests.org/en/master/user/authentication/

Albin CR · Answer 3 · 2017-06-14T06:11:07+0000

TestCafe.

TestCafe - - ( e2e). TestCafes Node.js WebDriver.

TestCafe . DOM- TestCafe Selectors. TestCafe JavaScript - ClientFunction (. ).

TestCafe , . . .

TestCafe :

1) , Node.js( ).

2) TestCafe open cmd :

npm install -g testcafe

- . : 1) - "test.js"

import { Selector } from ‘testcafe’;

fixture `Getting Started`
    .page `http://devexpress.imtqy.com/testcafe/example`;

test(‘My first test’, async t => {
    await t
        .typeText(‘#developer-name’, ‘John Smith’)
        .click(‘#submit-button’)
        .expect(Selector(‘#article-header’).innerText).eql(‘Thank you, John Smith!‘);
});

2) (, ), cmd:

testcafe chrome test.js

3) .

TestCafe : , ( , Pi Safari iOS), (, Sauce Labs) (, Nightmare). , TestCafe Continuous Integration.

You can use the same to scrape data and save to file easily

Nima Ghotbi · Answer 4 · 2017-06-07T08:19:03+0000

CPU , PhantomJS Chrome. PhantomJS :

https://realpython.com/blog/python/headless-selenium-testing-with-python-and-phantomjs/

brc · Answer 5 · 2017-06-07T19:24:14+0000

'grab':

from grab import Grab

g = Grab()
g.go('https://www.timeform.com/horse-racing/account/sign-in?returnUrl=%2Fhorse-racing%2F')
g.doc.set_input('EmailAddress','some@email.com')
g.doc.set_input('Password','somepass')
g.doc.submit()

print g.doc.body

Costis94 · Answer 6 · 2017-06-08T06:54:23+0000

mechanize, 3,2 .

from mechanize import Browser
import time    #just to check elapsed time and check performance
started_time = time.time()

browser = Browser()
url = 'https://www.timeform.com/horse-racing/account/sign-in?returnUrl=%2Fhorse-racing%2F'
browser.open(url)
browser.select_form(nr = 0)
browser["EmailAddress"] = 'putyouremailhere'
browser["Password"] = 'p4ssw0rd'

logged = browser.submit()
redirected_url = logged.read()
print redirected_url

#you can delete this section:
elapsed_time = time.time() - started_time
print elapsed_time,' seconds'

, !:)

internety · Answer 7 · 2017-06-10T11:54:08+0000

:

(Javascript ..), (.. ghostdriver) , ().
, , , robobrowser (py3), mechanize browserplus. , javascript , .

Ujjaval Moradiya · Answer 8 · 2017-06-13T07:51:18+0000

, , , , , , , http ( URL-).

urllib .

. , , , urllib. , .

Requests

:

: , 2 id pwd, , , - , , , - user_id user_pwd id pwd . " URL"

dataToSend = {'user_id':'id you want to pass', 'user_pwd':'specify pwd here'}
# Here you can specify headers and cookie, specify if required 
response = requests.post(url, data=dataToSend, headers={'content-type':'specify if required', 'user-agent':'chrome...'})

if(response.status_code == 200):
     contentReceived = response.content
     # Here you need to observe the received content, most of the time content will be in json format, so you need to decode here.
     if(contentReceived == 'Response is same that you have expected'):
          print "Successfully"
     else:
          print "Failed"
else:
     print "Failed"

, , cookie .

Mateusz Moneta · Answer 9 · 2017-06-13T11:09:16+0000

https://scrapy.org/. , .

JavaScript, scrapy-splash: https://github.com/scrapy-plugins/scrapy-splash.

" Scrapy" : https://doc.scrapy.org/en/latest/topics/request-response.html#topics-request-response-ref-request-userlogin

Cleaning a site using the login page

More articles: