Python xpath query does not return text value

Question

Python xpath query does not return text value

I am trying to clear the data from the next page using the lxml module in Python: http://www.thehindu.com/todays-paper/with-afspa-india-has-failed-statute-amnesty/article7376286.ece . I want to get the text in the first paragraph, but the following code returns a null value

from lxml import html
import requests

page = requests.get('http://www.thehindu.com/todays-paper/with-afspa-india-has-failed-statute-amnesty/article7376286.ece')
tree = html.fromstring(page.text)
data = tree.xpath('//*[@id="left-column"]/div[6]/p[1]/text()')
print data

I do not understand what I'm doing wrong here. Please suggest if there are better ways to do what I'm trying to do.

+1

python xpath web-scraping lxml

Saharsh agarwal Jul 9 '15 at 15:20

source share

2 answers

Brent d · Answer 1 · 2015-07-09T16:26:16+0000

Try //div[class='article-text']/p/text()

Piyush · Answer 2 · 2015-11-03T11:15:47+0000

you can use xpath as follows:

div[@class='article-text']/p[1]/text()

Python xpath query does not return text value

More articles: