Read documentation View can be found on IE test drive
In the link you will find more specific recommendations on viewing the type of reading during extraction to determine the title of the article, date, author, publisher, image, signature and copyright. Below is an example that I copy and paste from a test disk.
Date Read View will provide publisher information and dates together on a single line with an additional style to highlight this information. The publication date of the article will be displayed exactly as it appears on the line. Reading View is not converted to a specific date format.
How results viewing works
Once the website is set up to read access to the view, the read view uses a series of heuristics to identify and then extract the appropriate content from the page to create a new page (in memory). The algorithm was developed using an example network to provide the highest possible coverage and accuracy. These heuristics look at HTML tags, node depth, image size and word count to determine what content on the page is the "main" content.
source share