I recommend you consider simple_html_dom . It will be very easy.
Here is a working example of how to pull out the title and the first image.
<?php require 'simple_html_dom.php'; $html = file_get_html('http://www.google.com/'); $title = $html->find('title', 0); $image = $html->find('img', 0); echo $title->plaintext."<br>\n"; echo $image->src; ?>
Here is a second example that will do the same without an external library. I should note that using regex in HTML is NOT a good idea.
<?php $data = file_get_contents('http://www.google.com/'); preg_match('/<title>([^<]+)<\/title>/i', $data, $matches); $title = $matches[1]; preg_match('/<img[^>]*src=[\'"]([^\'"]+)[\'"][^>]*>/i', $data, $matches); $img = $matches[1]; echo $title."<br>\n"; echo $img; ?>
source share