@URL: Maybe in hurry, I did not explain it properly. This was exactly what I was doing. look at the project, https://sourceforge.net/projects/absoluteurl/scheme: http
until I came across this image url on the google images page:
What would you consider here the query string or the path? Anything we do, the URL is going to break unless we do nothing. Any ideas?(may be just for encode the spaces).
--
Regards,
Nitin Kumar Gupta
http://publicmind.in/blog/
On Sat, Oct 3, 2009 at 2:22 AM, Domenic Santangelo <domenic@workhabit.com> wrote:
nitin gupta wrote:Try this example: http://westhoffswelt.de/blog/0036_xpath_to_select_html_by_class.htmlThanks for your help. but this query will probably select "abcfooxyz" as well when "foo" is supplied. (untested), although it will definitely select "foo" in "foo bar" (tested)
$xpath->query("//div[contains(@class, '" . $class . "')]")
How can we be more specific?"q=hello" is a query string, not strictly part of the path. I would strip the url to its component parts (parse_url) and encode the path, then re-append the query string.
@URL: Actually I am maintaining the module feedapi imagegrabber, which downloads images from external websites. Now sometimes the url I parse has spaces, so I am unable to decide whether or not to percentage encode the URL, because percentage encoding will make this URL valid but will break the following URL:
http://www.google.com/search?q=helloby converting it to
http://us2.php.net/manual/en/function.parse-url.php
http://www.faqs.org/rfcs/rfc1738.html
HTH,
-D
Looking forward.
--
Regards,
Nitin Kumar Gupta
http://publicmind.in/blog/
On Sat, Oct 3, 2009 at 1:30 AM, Domenic Santangelo <domenic@workhabit.com> wrote:
>
> nitin gupta wrote:
>
> for eg <div class="foo"></div> is selected by the following query.
> $class = "foo";
> $xpath->query("//*[@class = '".$class."']");
> But, this fails in case of multiple classes, i.e. does not select this :
> <div class="foo bar"> </div>
>
> The "contains" function would probably work, eg (untested):
> $xpath->query("//div[contains(@class, '" . $class . "')]");
>
> One more question: is this url valid : http://xyz.com/path with space.html
>
> Nope. You can escape the url if you just HAVE to have spaces:
>
> http://n00b.com/path%20with%20space.html
>
> but why not use dashes?
>
> http://pro.com/path-with-properly-indexed-spaces.html
>
> -Dom