* cygwinenv.sgml: Move "codepage:xxx" to the removed options section.
Change text accordingly. * new-features.sgml: Try to explain new way to define character sets.
This commit is contained in:
parent
161211d186
commit
1c6743b74d
@ -1,3 +1,9 @@
|
|||||||
|
2009-03-24 Corinna Vinschen <corinna@vinschen.de>
|
||||||
|
|
||||||
|
* cygwinenv.sgml: Move "codepage:xxx" to the removed options section.
|
||||||
|
Change text accordingly.
|
||||||
|
* new-features.sgml: Try to explain new way to define character sets.
|
||||||
|
|
||||||
2009-03-18 Corinna Vinschen <corinna@vinschen.de>
|
2009-03-18 Corinna Vinschen <corinna@vinschen.de>
|
||||||
|
|
||||||
* cygwin-ug-net.in.sgml: Update date.
|
* cygwin-ug-net.in.sgml: Update date.
|
||||||
|
@ -11,29 +11,6 @@ by prefixing with <literal>no</literal>.</para>
|
|||||||
|
|
||||||
<itemizedlist mark="bullet">
|
<itemizedlist mark="bullet">
|
||||||
|
|
||||||
<listitem>
|
|
||||||
<para><envar>codepage:[ansi|oem|utf8]</envar> - This option controls
|
|
||||||
which single- or multibyte character set is used for file and console
|
|
||||||
operations. Windows is using UTF-16 characters internally and this
|
|
||||||
option specifies how 8-byte character sets are converted to UTF-16 and
|
|
||||||
vice versa. The default setting is <envar>ansi</envar> which means,
|
|
||||||
conversion is based on the current ANSI codepage, typically 1252 in
|
|
||||||
many Western language versions of Windows. The name originates from the
|
|
||||||
ANSI Latin1 (ISO 8859-1) standard, used in Windows 1.0, though the
|
|
||||||
character sets have since diverged from any standard. The second
|
|
||||||
setting selects an older, DOS-based character set, containing various
|
|
||||||
line drawing and special characters. It is called <envar>oem</envar>
|
|
||||||
since it was originally encoded in the firmware of IBM PCs by original
|
|
||||||
equipment manufacturers (OEMs).</para>
|
|
||||||
<para>If you find that some characters (especially non-US or 'graphical' ones)
|
|
||||||
do not display correctly in Cygwin, you can use this option to select an
|
|
||||||
appropriate codepage. Finally, <envar>utf8</envar> treats all file names
|
|
||||||
and console characters as UTF-8 chars. Please note that, for correct
|
|
||||||
operation, you have to set the environment variable LANG or LC_ALL to
|
|
||||||
somthing like "en_US.UTF-8", otherwise many applications will not be
|
|
||||||
able to recognize UTF-8 strings correctly.</para>
|
|
||||||
</listitem>
|
|
||||||
|
|
||||||
<listitem>
|
<listitem>
|
||||||
<para><envar>(no)dosfilewarning</envar> - If set, Cygwin will warn the
|
<para><envar>(no)dosfilewarning</envar> - If set, Cygwin will warn the
|
||||||
first time a user uses an "MS-DOS" style path name rather than a POSIX-style
|
first time a user uses an "MS-DOS" style path name rather than a POSIX-style
|
||||||
@ -194,6 +171,16 @@ information, read the documentation in <xref linkend="mount-table"></xref> and
|
|||||||
<xref linkend="pathnames-casesensitive"></xref>.</para>
|
<xref linkend="pathnames-casesensitive"></xref>.</para>
|
||||||
</listitem>
|
</listitem>
|
||||||
|
|
||||||
|
<listitem>
|
||||||
|
<para><envar>codepage:[ansi|oem]</envar> - This option controled
|
||||||
|
which character set is used for file and console operations. Since Cygwin
|
||||||
|
is now doing all character conversion by itself, depending on the
|
||||||
|
application call to the <function>setlocale()</function> function, and in
|
||||||
|
turn by the setting of the environment variables <envar>$LANG</envar>,
|
||||||
|
<envar>$LC_ALL</envar>, or <envar>$LC_CTYPE</envar>, this setting
|
||||||
|
got useless.</para>
|
||||||
|
</listitem>
|
||||||
|
|
||||||
<listitem>
|
<listitem>
|
||||||
<para><envar>(no)ntea</envar> - This option has been removed since it
|
<para><envar>(no)ntea</envar> - This option has been removed since it
|
||||||
only fakes security which is considered dangerous and useless. It also
|
only fakes security which is considered dangerous and useless. It also
|
||||||
|
@ -17,13 +17,18 @@
|
|||||||
are only local to the current session and disappear when the last
|
are only local to the current session and disappear when the last
|
||||||
Cygwin process in the session exits.
|
Cygwin process in the session exits.
|
||||||
|
|
||||||
|
- If a filename cannot be represented in the current character set,
|
||||||
|
the character will be converted to a sequence Ctrl-N + UTF-8 representation
|
||||||
|
of the character. This allows to access all files, even those not
|
||||||
|
having a valid representation of their filename in the current character
|
||||||
|
set (codepage). To have always a valid string, use the UTF-8 charset
|
||||||
|
by setting the environment variable $LANG, $LC_ALL, or $LC_CTYPE to a
|
||||||
|
valid POSIX value, for instance in Cygwin.bat like this:
|
||||||
|
|
||||||
|
set LC_CTYPE=en_US.UTF-8
|
||||||
|
|
||||||
- PATH_MAX is now 4096. Internally, path names can be as long as the
|
- PATH_MAX is now 4096. Internally, path names can be as long as the
|
||||||
underlying OS can handle (32K).
|
underlying OS can handle (32K).
|
||||||
|
|
||||||
- UTF-8 filenames are supported now. So far, this requires to set
|
|
||||||
the environment variable CYGWIN to contain "codepage:utf8". but this
|
|
||||||
will likely disappear at one point. The setting of $LANG or $LC_CTYPE
|
|
||||||
will be used instead.
|
|
||||||
|
|
||||||
- struct dirent now supports d_type, filled out with DT_REG or DT_DIR.
|
- struct dirent now supports d_type, filled out with DT_REG or DT_DIR.
|
||||||
All other file types return as DT_UNKNOWN for performance reasons.
|
All other file types return as DT_UNKNOWN for performance reasons.
|
||||||
@ -176,6 +181,19 @@
|
|||||||
<sect2 id="ov-new1.7-posix"><title>Other POSIX related changes</title>
|
<sect2 id="ov-new1.7-posix"><title>Other POSIX related changes</title>
|
||||||
|
|
||||||
<screen>
|
<screen>
|
||||||
|
- A lot of character sets are supported now via a call to setlocale().
|
||||||
|
The setting of the environment variables $LANG, $LC_ALL or $LC_CTYPE will
|
||||||
|
be used. For instance, setting $LANG to "de_DE.ISO-8859-15" before
|
||||||
|
starting a Cygwin session will use the ISO-8859-15 character set in
|
||||||
|
the entire session. UTF-8 is supported as well, as in "en_US.UTF-8".
|
||||||
|
|
||||||
|
The full list of supported character sets: "ASCII", "ISO-8859-x" with x
|
||||||
|
in 1-16, except 12, "UTF-8", Windows codepages "CPxxx", with xxx in
|
||||||
|
(437, 720, 737, 775, 850, 852, 855, 857, 858, 862, 866, 874, 1125,
|
||||||
|
1250, 1251, 1252, 1253, 1254, 1255, 1256, 1257, 1258), "JIS", "SJIS",
|
||||||
|
"eucJP", "Big5". The leading language and territory part (en_US) is not
|
||||||
|
used by Cygwin yet, but is required for POSIX compatibility.
|
||||||
|
|
||||||
- Allow multiple concurrent read locks per thread for pthread_rwlock_t.
|
- Allow multiple concurrent read locks per thread for pthread_rwlock_t.
|
||||||
|
|
||||||
- Implement pthread_kill(thread, 0) as per POSIX.
|
- Implement pthread_kill(thread, 0) as per POSIX.
|
||||||
|
Loading…
x
Reference in New Issue
Block a user