This small php bot goes on a page and retrieves its content, than parse it and finds infrmation. Has some problems with some redirects. You can find PHP source code on the related post: Bot that retrieves url meta data and other infos