[development] Xpath query for class matching

nitin gupta nitingupta.iitg at gmail.com
Fri Oct 2 21:41:43 UTC 2009


Yeah, I understand . it is the thumbnail maker for google images page. But
if I want to have a specific script for this conversion, one can never be
sure. If you notice, there are two colons in the query, which if percent
encoded will render the URL invalid.
Thanks for all your inputs.

--
Regards,
Nitin Kumar Gupta
http://publicmind.in/blog/


On Sat, Oct 3, 2009 at 3:06 AM, Domenic Santangelo <domenic at workhabit.com>wrote:

>  nitin gupta wrote:
>
> @URL: Maybe in hurry, I did not explain it properly. This was exactly what
> I was doing. look at the project,
> https://sourceforge.net/projects/absoluteurl/
>  until I came across this image url on the google images page:
>
>
> http://t3.gstatic.com/images?q=tbn:7i1D2KAZcCd8yM:http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg
>
>  What would you consider here the query string or the path? Anything we
> do, the URL is going to break unless we do nothing. Any ideas?(may be just
> for encode the spaces).
>
>  scheme: http
> host: t3.gstatic.com<http://t3.gstatic.com/images?q=tbn:7i1D2KAZcCd8yM:http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg>
> path: images<http://t3.gstatic.com/images?q=tbn:7i1D2KAZcCd8yM:http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg>
> query:
> q=tbn:7i1D2KAZcCd8yM:http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg<http://t3.gstatic.com/images?q=tbn:7i1D2KAZcCd8yM:http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg>
>
> That url is some sort of thumbnail maker or something... compare to
> http://www.flash-slideshow-maker.com/images/help_clip_image004.jpg
>
> -D
>
>
>
>  --
> Regards,
> Nitin Kumar Gupta
> http://publicmind.in/blog/
>
>
> On Sat, Oct 3, 2009 at 2:22 AM, Domenic Santangelo <domenic at workhabit.com>wrote:
>
>>  nitin gupta wrote:
>>
>> Thanks for your help. but this query will probably select "abcfooxyz" as
>> well when "foo" is supplied. (untested), although it will definitely select
>> "foo" in "foo bar" (tested)
>> $xpath->query("//div[contains(@class, '" . $class . "')]")
>>
>>  How can we be more specific?
>>
>>  Try this example:
>> http://westhoffswelt.de/blog/0036_xpath_to_select_html_by_class.html
>>
>>  @URL: Actually I am maintaining the module feedapi imagegrabber, which
>> downloads images from external websites. Now sometimes the url I parse has
>> spaces, so I am unable to decide whether or not  to percentage encode the
>> URL, because percentage encoding will make this URL valid but will break the
>> following URL:
>> http://www.google.com/search?q=hello
>> by converting it to
>> http://www.google.com/search?q%3Dhello
>>
>>  "q=hello" is a query string, not strictly part of the path. I would strip
>> the url to its component parts (parse_url) and encode the path, then
>> re-append the query string.
>>
>> http://us2.php.net/manual/en/function.parse-url.php
>> http://www.faqs.org/rfcs/rfc1738.html
>>
>> HTH,
>>
>> -D
>>
>>
>>  Looking forward.
>>
>> --
>> Regards,
>> Nitin Kumar Gupta
>> http://publicmind.in/blog/
>>
>>
>> On Sat, Oct 3, 2009 at 1:30 AM, Domenic Santangelo <domenic at workhabit.com>
>> wrote:
>> >
>> > nitin gupta wrote:
>> >
>> > for eg <div class="foo"></div> is selected by the following query.
>> > $class = "foo";
>> > $xpath->query("//*[@class = '".$class."']");
>> > But, this fails in case of multiple classes, i.e. does not select this :
>> > <div class="foo bar"> </div>
>> >
>> > The "contains" function would probably work, eg (untested):
>> > $xpath->query("//div[contains(@class, '" . $class . "')]");
>> >
>> > One more question: is this url valid : http://xyz.com/path with
>> space.html
>> >
>> > Nope. You can escape the url if you just HAVE to have spaces:
>> >
>> > http://n00b.com/path%20with%20space.html
>> >
>> > but why not use dashes?
>> >
>> > http://pro.com/path-with-properly-indexed-spaces.html
>> >
>> > -Dom
>>
>>
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.drupal.org/pipermail/development/attachments/20091003/6cb1bfbc/attachment.htm>


More information about the development mailing list