How to pass data to another function from a class (in HTMLParser)?

Question

How to pass data to another function from a class (in HTMLParser)?

I'm starting to learn python. My version of python 3.1.

I have never studied OOP before, so I'm confused by HTMLParser.

from html.parser import HTMLParser class parser(HTMLParser): def handle_data(self, data): print(data) p = parser() page = """<html><h1>title</h1><p>I'm a paragraph!</p></html>""" p.feed(page)

I will get this:

title
I am a paragraph!

I want this data to be passed to the function, what should I do?

Sorry for my poor English and thanks for your help!

+4

function python class

zjk Feb 02 '10 at 15:08

source share

2 answers

Just an example:

 def my_global_fun(data): print "processing", data class parser(HTMLParser): def my_member_fun(self, data): print "processing", data def handle_data(self, data): self.my_member_fun(data) # or my_global_fun(data)

Successful OOP Training!

+2

ron Feb 02 '10 at 15:13

source share

sberry · Accepted Answer · 2010-02-02T15:35:25+0000

I did not consider the HTMLParser module itself, but I see that this channel essentially calls handle_data, which prints in your derived class. @ron responds to sending data directly to your function, which is completely normal. However, since you are new to OOP, you can take a look at this code.

This is Python 2.x, but I think the only thing that has changed is the location of the html.parser package instead of HTMLParser.

 from HTMLParser import HTMLParser class MyParser(HTMLParser): def handle_data(self, data): self.output.append(data) def feed(self, data): self.output = [] HTMLParser.feed(self, data) p = MyParser() page = """<html><h1>title</h1><p>I'm a paragraph!</p></html>""" p.feed(page) print p.output output ['title', "I'm a paragraph!"]

Here I redefine the HTMLParser feed method. Instead, when p.feed(page) , it will call my method, which creates / sets the instance variable, called output, to an empty list, and then calls the feed method in the base class (HTMLParser) and it goes to the point that it doing fine. Thus, by overriding the feed method, I was able to do some additional things (added a new output variable). The handle_data method is similarly an override method. In fact, the handle_data HTMLParser method doesn't even do anything ... anything (as per the docs.)

So, just to clarify ...

You call p.feed(page) , which calls the MyParser.feed method. MyParser.feed sets the self.output variable and an empty list, then calls HTMLParser.feed. The handle_data method adds a line to the end of the output list.

You now have access to the data through a call to p.output.

How to pass data to another function from a class (in HTMLParser)?

More articles: