What is the safest way to extract <title> from an HTML file using xpath?

Question

What is the safest way to extract <title> from an HTML file using xpath?

Here is my current xpath code "/html/head/title".

But you know that in the real world environment html the code format is usually interrupted, for example. The tag is <html>missing, it may cause an exception. So, I would like to know if there is a safe way to extract the tag <title>? (something like getElementByTagName)

+3

html xpath

silent Aug 18 '10 at 1:20

source share

5 answers

Due to the naughty nature of html markup, you should use the html parsing library. You did not specify a platform or language, but there are a number of open source libraries there.

+3

Paul sasik Aug 18 '10 at 1:25

source share

/html/head/title , , :

title;
HTML , ;
HTML- HTML .

HTML, /html/head/title[1], , .

+2

Alohci 18 . '10 8:13

javascript, :

document.title

+1

Topera 18 . '10 1:26

-, XML ( HTML, XPath), //title .

0

jwismar 18 . '10 1:26

meder omuraliev · Accepted Answer · 2010-08-18T01:25:14+0000

"//title" perhaps?

What is the safest way to extract <title> from an HTML file using xpath?

More articles: