How can I use BeautifulSoup to search for all links on a page pointing to a specific domain?

Question

How can I use BeautifulSoup to search for all links on a page pointing to a specific domain?

+4

python beautifulsoup

Juanjo conti Jan 28 '10 at 0:10

source share

1 answer

viksit · Answer 1 · 2010-01-28T00:23:30+0000

Use SoupStrainer,

from BeautifulSoup import BeautifulSoup, SoupStrainer import re # Find all links links = SoupStrainer('a') [tag for tag in BeautifulSoup(doc, parseOnlyThese=links)] linkstodomain = SoupStrainer('a', href=re.compile('example.com/'))

Edit: Modified example from white paper.

How can I use BeautifulSoup to search for all links on a page pointing to a specific domain?

More articles: