This is because the source has multiple copies of the link.
You can easily make them unique by putting them in a set.
>>> print set(re.findall("res/[0-9]+", html))
set(['res/3833795', 'res/3837945', 'res/3835377', 'res/3837941', 'res/3837942',
'res/3837950', 'res/3100203', 'res/3836997', 'res/3837643', 'res/3835174'])
But if you are going to do something more complex, I recommend that you use a library that can parse HTML. Either BeautifulSoup or lxml .
source
share