[pmwiki-users] Re: Translation [was pmwiki-users] i18n and iso-8859-13
Algis Kabaila
akabaila at pcug.org.au
Thu Apr 7 23:41:39 CDT 2005
On Friday 08 April 2005 14:25, Patrick R. Michaud wrote:
> On Thu, Apr 07, 2005 at 06:53:12PM +1000, Algis Kabaila wrote:
> > Actually, I would like to add (in
> > Lithuanian) simple instructions of making a site accept the Lithuanian
> > charactes and operate with utf-8 encoding, if there are no objections.
>
> If you're asking about this for the PmWikiLt.* group on pmwiki.org,
> there aren't any objections. My general rule is that whoever is doing
> the work of building/maintaining the language translations gets to
> decide the contents and structure of those translations. :-)
OK, I assume that I am the "chief translator" and will put the comment
about installation of XLPageLt into a site RSN. (My RSN is a bit slower
these days than it used to be).
>
> > Translation of some words, viz. "by" is difficult because its meaning is
> > dependent on context. I thought that the best was not to attempt
> > translation of "by" at all, in order to avoid the "hydraulic hammer"
> > becoming "water sheep". I see that a note on the XLPage tells to delete
> > items that are not translated. Can we just comment it out with # or will
> > that not work?
>
> It can be commented out with #, left empty, or even removed altogether.
>
> > Also, would you mind if I ask a 'non-wiki' question - how to
> > correctly specify utf-8 encoding in a web page? (I have used meta
> > tags under the wrong impression that this was a standard way
> > of doing it. Currently my **wrong** header looks like this:
> >
> > <meta content="text/html; charset=utf-8" http-equiv="content-type"> ).
>
> Well, meta tags are just one way of doing it. The HTML 4.01 standard
> identifies three mechanisms, highest priority to lowest:
>
> 1. An HTTP charset parameter in a Content-Type field
> 2. A <meta> declaration with "http-equiv" set to "Content-Type"
> 3. The charset attribute set on an element specifying an external
resource
>
> So, the best and recommended way to specify the encoding is in the
> webserver configuration itself, not in the individual web pages.
> Unfortunately in your case, Apache was (mis)configured to always
> specifying a charset parameter, which overrides any <meta> tag
> the HTML document might have in it.
>
> Anyway, to answer your question, the standard, W3C-recommended way to
> specify utf-8 of documents is to configure Apache to do it. There's
> several ways to do this. If all of the documents in a directory are
> known to be utf-8, then the .htaccess file can contain
>
> AddDefaultCharset UTF-8
>
> If Apache has been configured with content negotiation and the standard
> charsets, then one can also specify utf-8 for a document by adding a
> ".utf8" extension (no hyphen) onto its filename, such as "mydoc.html.utf8".
> Other encodings might use ".iso8859-1", ".cp-1251", etc.
>
> However, if it's not feasible to configure Apache for the charset
> encodings, then one can tell Apache not to do send any charset
> parameter with
>
> AddDefaultCharset Off
>
> and then the browser will use any encoding specified by the document's
> <meta http-equiv="Content-Type" ... /> tag.
>
> Hope this helps,
>
> Pm
Thank you for your comprehensive explanation of headers. I can not change the
Apache config on our ISP (PCUG-TIP), but I certainly can do it on my own
HAN. Basically, I am quite happy with the way the two pmwikis that I manage
work and reasonably satisfied with my home pages, too. I just can not stop
being curious about the rights and the wrongs of this world...
Gratefully,
Al.
>
--
Algis Kabaila
http://www.pcug.org.au/~akabaila
More information about the pmwiki-users
mailing list