[consulting] Drupal Data Mining

paola.dimaio at gmail.com paola.dimaio at gmail.com
Thu Aug 30 16:05:44 UTC 2007


Sounds like you want to map an ontology?


> I am working on a project where I need to do some data mining within Drupal.
> Specifically, I need to index nodes to find keyworks and associate those
> nodes automatically with taxonomy terms. In addition, I need to store
> relationships between taxonomy terms when they appear in the same nodes.
>
First, you may want to establish exactly what kind of relationship you
want to map
(the granularity)

I expect you cannot easily find meaniGnful sets of 'keywords' without
some of kind of pre-existing schema - you may need extract all the
terms that apear to the be central (say that are in the title, or in
the first 2 paragraphs) , then get a human expert to 'edit' that list
into a shortlist of keywords that are representative of the
domain/data you are tyring to model

then you may want to parse the whole database against that list

similarly, you may want to articulate the keyword according to
meanigful patters, and do the same

afaik, its best done semi-manually (no machine can do that yet) unless
you already have an 'ontology' to start with

sorry if it sounds circular

I need someone to help with this task, and I know more or less how to
design the process
but need someone to help me do it

let me know if you are available to work together on this
cheers
PDM

Paola Di Maio

>
>
> Has anyone done something similar in the past or have any suggestions on
> where to start? At the most basic level, I could write a script that does a
> regex for each taxonomy term on each article entered, but that sounds like a
> processing beast and I would prefer to go another route.
>
> Thank you,
>  Michael Haggerty
>  Managing Partner
>  Trellon, LLC
>  http://www.trellon.com
>  (p) 301-577-6162
>  (c) 240-643-6561
>  (f) 413-691-9114
>  (aim) haggerty321
>
>
> _______________________________________________
> consulting mailing list
> consulting at drupal.org
> http://lists.drupal.org/mailman/listinfo/consulting
>
>


-- 
Paola Di Maio
School of IT
www.mfu.ac.th
*********************************************


More information about the consulting mailing list