This small php bot goes on a page and retrieves its content, than strips away html and keeps only text: Retrieve all text from a url