I want to cancel some information on a web page. It uses a table layout structure.
I want to extract a third table inside a layout of a nested table that contains a series of nested tables. Each publication result. But the code does not work.
include('simple_html_dom.php'); $url = 'http://exams.keralauniversity.ac.in/Login/index.php?reslt=1'; $html = file_get_contents($url); $result =$html->find("table", 2); echo $result;
I used Curl to retrieve a website, but the problem is that its tags are not working, so it cannot be retrieved with a simple dom element.
function curl($url) { $ch = curl_init(); // Initialising cURL curl_setopt($ch, CURLOPT_URL,$url); // Setting cURL URL option with the $url variable passed into the function curl_setopt($ch, CURLOPT_RETURNTRANSFER, TRUE); // Setting cURL option to return the webpage data $data = curl_exec($ch); // Executing the cURL request and assigning the returned data to the $data variable curl_close($ch); // Closing cURL return $data; // Returning the data from the function } function scrape_between($data, $start, $end){ $data = stristr($data, $start); // Stripping all data from before $start $data = substr($data, strlen($start)); // Stripping $start $stop = stripos($data, $end); // Getting the position of the $end of the data to scrape $data = substr($data, 0, $stop); // Stripping all data from after and including the $end of the data to scrape return $data; // Returning the scraped data from the function } $scraped_page = curl($url); // Executing our curl function to scrape the webpage http://www.example.com and return the results into the $scraped_website variable $scraped_data = scrape_between($scraped_page, ' </html>', '</table></td><td></td></tr> </table>'); echo $scraped_data; $myfile = fopen("newfile.html", "w") or die("Unable to open file!"); fwrite($myfile, $scraped_data); fclose($myfile);
How to clear result and save PDF
source share