On Tue, 1 Mar 2005, Steve Dondley wrote:
A properly set up drupal site should handle any characters through UTF-8 and correctly convert everything into it. If you have old content, convert it to UTF-8 with iconv before importing. Oh and note that stuff like "curly quotes" is actually not ISO-8859-1, it's Windows-1252.
Yes, I understand the distinction between ISO-8859-x and Windows-1252.
At any rate, the content is coming in from e-mails handled via the mailhandler modules (which are often written by people using MS applications). These are not converted to standard UTF-8 format as best as I can tell.
Indeed they aren't.
I have already decided against changing the output format to ISO-8859-1 and changed it back into UTF-8. Instead, I'm in the middle of addressing this problem by inserting conversion functions into the mailhandler module. However, PHP needs to be compiled "with-iconv" in order for this to work. This can be a problem for some users.
But you say that Drupal already handles this conversion. I'm not so sure about that. I did a grep on 'iconv' and only place I saw it used was in an xml parser function in common.inc for the feeds.
Right. It would help if that part of the function was in a separate function. Then mailhandler could call this function.
Cheers, Gerhard