[drupal-support] Charset problem

Gerhard Killesreiter killesreiter at physik.uni-freiburg.de
Wed Mar 2 01:03:07 UTC 2005



On Tue, 1 Mar 2005, Steve Dondley wrote:

>
> > A properly set up drupal site should handle any characters through
> > UTF-8 and correctly convert everything into it. If you have old
> > content, convert it to UTF-8 with iconv before importing. Oh and note
> > that stuff like "curly quotes" is actually not ISO-8859-1, it's
> > Windows-1252.
> >
> Yes, I understand the distinction between ISO-8859-x and Windows-1252.
>
> At any rate, the content is coming in from e-mails handled via the
> mailhandler modules (which are often written by people using MS
> applications).  These are not converted to standard UTF-8 format as best
> as I can tell.

Indeed they aren't.

> I have already decided against changing the output
> format to ISO-8859-1 and changed it back into UTF-8.  Instead, I'm in
> the middle of addressing this problem by inserting conversion functions
> into the mailhandler module.  However, PHP needs to be compiled
> "with-iconv" in order for this to work.  This can be a problem for some
> users.
>
> But you say that Drupal already handles this conversion.  I'm not so
> sure about that.  I did a grep on 'iconv' and only place I saw it used
> was in an xml parser function in common.inc for the feeds.

Right. It would help if that part of the function was in a separate
function. Then mailhandler could call this function.

Cheers,
	Gerhard



More information about the drupal-support mailing list