[pmwiki-users] Pagenames with accented characters

Petko Yotov 5ko at 5ko.fr
Sat Jul 18 16:45:53 CDT 2009


On Saturday 18 July 2009 21:50:12 DaveG wrote:
> Petko Yotov wrote:
> > Make sure the config.php file is saved in the same encoding in which the
> > wiki is. If your wiki is in the default Latin-1 (a.k.a. iso8859-1), the
> > config.php encoding should be the same, and not Unicode/UTF-8.
...
> Is this documented somewhere? If so I couldn't find it. If not, where is
> a good place, and I'll write it up? I'm not familiar enough with i18n
> stuff to know which page is appropriate.

Probably on the pages talking about config.php, farmconfig.php and Group/Page 
customizations. And in the cookbook, where the recipe is.

> It seems a little obscure that 
> saving config.php is what determines if the i18n cookbook works right --
> is that a 'normal' thing for i18n stuff.

If we want PmWiki to catch and convert some characters, they should be encoded 
the same way in config.php and in the wiki page, or it cannot find them.

Newer operating systems like GNU/Linux, FreeBSD and Apple generally default to 
saving text files in Unicode/UTF-8, in most Windowses afaik it is CP1252 
which is almost the same as Latin-1.

PmWiki default is Latin-1 which works fine for English and most West-EU 
languages, but neither works for Central-EU (Czech, Polish), nor for other 
alphabets (Cyrillic, Greek, Arab, Hebrew, Chinese, Korean...).

We should work towards defaulting PmWiki to Unicode/UTF-8 which allows all 
possible alphabets and languages, but it is not trivial to convert an 
existing site.

> (I don't think I've ever had to 
> even think about what file encoding type I use before.)

You have been lucky :-) If you primarily use a language using the Latin 
(Roman) alphabet with no accented letters, any encoding should be fine.

Petko



More information about the pmwiki-users mailing list