[development] Drupal module for scraping information from an HTML/XML document

John Fiala jcfiala at gmail.com
Tue Nov 30 19:06:33 UTC 2010


These days, if I'm going to be trying to extract data from html/xml,
I'd use querypath.  Give it a try!

On Tue, Nov 30, 2010 at 11:56 AM, James Benstead
<james.benstead at gmail.com> wrote:
> What I'd like to do once a Resource has been added to the site is to scrape
> certain information from it: at this point I'm thinking the Title of the
> page the link points to and the provider of the resource - e.g., which
> Drupal shop originally created the resource. What's the best way to go about
> doing this? I'm pretty sure there's not a Drupal module that solves the
> problem out of the box.

-- 
John Fiala
www.jcfiala.net


More information about the development mailing list