* setup2.sgml (setup-locale-ov): Align description of working modifiers

to latest changes.
This commit is contained in:
Corinna Vinschen 2010-02-06 21:41:05 +00:00
parent 28d3c4fa38
commit 573df59cd4
2 changed files with 40 additions and 21 deletions

View File

@ -1,3 +1,8 @@
2010-02-06 Corinna Vinschen <corinna@vinschen.de>
* setup2.sgml (setup-locale-ov): Align description of working modifiers
to latest changes.
2010-02-06 Corinna Vinschen <corinna@vinschen.de> 2010-02-06 Corinna Vinschen <corinna@vinschen.de>
* new-features.sgml (ov-new1.7.2): Add support for new charsets. * new-features.sgml (ov-new1.7.2): Add support for new charsets.

View File

@ -272,33 +272,47 @@ ignored for now.</para>
<itemizedlist mark="bullet"> <itemizedlist mark="bullet">
<listitem><para>For languages which default to one of the ISO-8859 character <listitem><para>For languages which default to the ISO-8859-1 character
sets, the modifier "@euro" can be added to enforce usage of the ISO-8859-15 set, the modifier "@euro" can be added to enforce usage of the ISO-8859-15
character set, which includes a character for the "Euro" currency sign .</para> character set, which includes a character for the "Euro" currency sign.
</listitem> Beware, that also works for non-european locales.
<listitem><para>
The default charset of the "uz_UZ" locale is ISO-8859-1. With the "@cyrillic"
modifier it's UTF-8.
</para></listitem> </para></listitem>
<listitem><para> <listitem><para>
The default charset of the "tt_RU" locale is ISO-8859-5. With the "@iqtelif" The default script used for all Serbian language locales (sr_BA, sr_ME, sr_RS,
modifier it's UTF-8. and the deprecated sr_CS and sr_SP) is cyrillic. With the "@latin" modifier
it gets switched to the latin script with the respective collation behaviour.
</para></listitem> </para></listitem>
<listitem><para>There's a class of characters in the Unicode character set, <listitem><para>
called the "CJK Ambiguous Width Character set". For these characters the width The default charset of the "be_BY" locale (Belarusian/Belarus) is CP1251.
With the "@latin" modifier it's UTF-8.
</para></listitem>
<listitem><para>
The default charset of the "tt_RU" locale (Tatar/Russia) is ISO-8859-5.
With the "@iqtelif" modifier it's UTF-8.
</para></listitem>
<listitem><para>
The default charset of the "uz_UZ" locale (Uzbek/Uzbekistan) is ISO-8859-1.
With the "@cyrillic" modifier it's UTF-8.
</para></listitem>
<listitem><para>
There's a class of characters in the Unicode character set, called the
"CJK Ambiguous Width Character set". For these characters the width
returned by the wcwidth/wcswidth function is usually 1. This is often a returned by the wcwidth/wcswidth function is usually 1. This is often a
problem in East-Asian languages, which historically use character sets in problem in East-Asian languages, which historically use character sets
which these characters have a width of 2. By default, the wcwidth/wcswidth in which these characters have a width of 2. By default, the
functions return 1 as the width of these characters, except if the language is wcwidth/wcswidth functions return 1 as the width of these characters,
specifed as "ja" (Japanese), "ko" (Korean), or "zh" (Chinese). In these except if the language is specifed as "ja" (Japanese), "ko" (Korean), or
languages wcwidth and wcswidth return 2 for these characters. This is not "zh" (Chinese). In these languages wcwidth and wcswidth return 2 for
correct in all circumstances, so the user of one of these languages can specify these characters. This is not correct in all circumstances, so the user
the modifier "@cjknarrow", which modifies the behaviour of wcwidth/wcswidth to of one of these languages can specify the modifier "@cjknarrow", which
return 1 for the ambiguous width characters.</para> modifies the behaviour of wcwidth/wcswidth to return 1 for the ambiguous
</listitem> width characters.
</para></listitem>
</itemizedlist> </itemizedlist>