I want to provide an alternative solution based on lxml.
The solution uses the function provided in lxml.cssselect
import urllib import lxml.html from lxml.cssselect import CSSSelector connection = urllib.urlopen('http://www.yourTargetURL/') dom = lxml.html.fromstring(connection.read()) selAnchor = CSSSelector('a') foundElements = selAnchor(dom) print [e.get('href') for e in foundElements]
source share