Simple web crawler/spider
I am just developing a very simple web spider/crawler. Here is the code:
<?php
$seed = "http://www.akosblog.com";
$html = file_get_contents($seed);
echo "Page : " . $seed;
preg_match_all("/http:\/\/[^\"\s']+/", $html, $matches, PREG_SET_ORDER);
foreach ($matches as $val) {
echo "<br><font color=red>links :</font> " . $val[0] . "\r\n";
}
?>
So how could I do that?
Regards,
Akos
"Jamroom is a Profile Centric CMS system suitable as a development framework for building entire communities. Highly modular in concept. Suitable for enterprise level development teams or solo freelancers."
Android Project Ideas
Affordable, Wordpress plugins & Web Applications
Ralph Smith
Mercenary development and deployment.
:)
Professional Web Developer providing high quality Ecommerce Website Designing.