newlib

Author	SHA1	Message	Date
Corinna Vinschen	4b97244d12	Cygwin: define pthread_tryjoin_np/pthread_timedjoin_np _GNU_VISIBLE These functions are GNU extensions. Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-27 18:20:47 +02:00
Corinna Vinschen	c9d6787e76	Cygwin: doc: add pthread_tryjoin_np, pthread_timedjoin_np Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-27 18:20:11 +02:00
Corinna Vinschen	cb3ddf9e2a	Cygwin: pthread_timedjoin_np: return ETIMEDOUT, not EBUSY pthread_timedjoin_np returns ETIMEDOUT if a thread is still running, not EBUSY as pthread_tryjoin_np. Also, clean up initializing timeout in pthread_tryjoin_np. Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-27 18:19:18 +02:00
Corinna Vinschen	732e0b395d	Cygwin: Implement pthread_tryjoin_np and pthread_timedjoin_np - Move pthread_join to thread.cc to have all `join' calls in the same file (pthread_timedjoin_np needs pthread_convert_abstime which is static inline in thread.cc) - Bump API version Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-27 17:56:59 +02:00
Corinna Vinschen	006520ca2b	newlib: enable new math functions on Cygwin Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-27 15:53:51 +02:00
Szabolcs Nagy	b99d49e506	New pow implementation The new implementation is provided under !__OBSOLETE_MATH, it uses ISO C99 code. With default settings the worst case error in nearest rounding mode is 0.54 ULP with inlined fma and fma contraction. It uses a 4 KB lookup table in addition to the table in exp_data.c, on aarch64 .text+.rodata size of libm.a is increased by 2295 bytes. Improvements on Cortex-A72: latency: 3.3x thruput: 4.9x	2018-06-27 15:40:49 +02:00
Szabolcs Nagy	07e2c32828	New log2 implementation The new implementation is provided under !__OBSOLETE_MATH, it uses ISO C99 code. With default settings the worst case error in nearest rounding mode is 0.547 ULP with inlined fma and fma contraction. It uses a 1 KB lookup table, on aarch64 .text+.rodata size of libm.a is increased by 1584 bytes. Note that the math.h header defines log2(x) to be log(x)/Ln2, this is not changed, so the new code is only used if that macro is suppressed. Improvements on Cortex-A72: latency: 2.0x thruput: 2.2x	2018-06-27 15:40:49 +02:00
Szabolcs Nagy	e5791079c6	New log implementation The new implementations are provided under !__OBSOLETE_MATH, it uses ISO C99 code. With default settings the worst case error in nearest rounding mode is 0.519 ULP with inlined fma and fma contraction. It uses a 2 KB lookup table, on aarch64 .text+.rodata size of libm.a is increased by 1703 bytes. The w_log.c wrapper is disabled since error handling is inline in the new code. New __HAVE_FAST_FMA and __HAVE_FAST_FMA_DEFAULT feature macros were added to enable selecting between the code path that uses fma and the one that does not. Targets supposed to set __HAVE_FAST_FMA_DEFAULT if they have single instruction fma and the compiler can actually inline it (gcc has __FP_FAST_FMA macro but that does not guarantee inlining with -fno-builtin-fma). Improvements on Cortex-A72: latency: 1.9x thruput: 2.3x	2018-06-27 15:40:49 +02:00
Szabolcs Nagy	fb929067db	New exp and exp2 implementations The new implementations are provided under !__OBSOLETE_MATH, they use ISO C99 code. There are several settings, with the default one the worst case error in nearest rounding mode is 0.509 ULP for exp and 0.507 ULP for exp2 when a multiply and add is contracted into an fma. They use a shared 2 KB lookup table, on aarch64 .text+.rodata size of libm.a is increased by 1868 bytes. The w_*.c wrappers are disabled for the new code as it takes care of error handling inline. The old exp2(x) code used to be just pow(2,x) so the speedup there is more significant. The file name has no special prefix to avoid any name collision with existing files. Improvements on Cortex-A72: exp latency: 3.2x exp thruput: 4.1x exp2 latency: 7.8x exp2 thruput: 18.8x	2018-06-27 15:40:49 +02:00
Szabolcs Nagy	cfbcbd1c95	Use uint32_t sign argument to math error functions This change is equivalent to the commit `c65db17340` and only affects code that is from the Arm optimized-routines project. It does not affect the observable behaviour, but the code generation can be different on 64bit targets. The intention is to make the portable semantics of the code obvious by using a fixed size type.	2018-06-27 15:40:49 +02:00
Corinna Vinschen	6497fdfaf4	Cygwin: fix bumptious GCC 7 warnings Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-26 17:20:48 +02:00
Corinna Vinschen	6c55be9dbb	Cygwin: Allow to build without experimental AF_UNIX code by default Introduce __WITH_AF_UNIX preprocessor flag to enable the new code Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-26 16:31:17 +02:00
Corinna Vinschen	17918cc6a6	Cygwin: add Unicode patch to release notes Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-26 10:21:18 +02:00
Takashi Yano	048490485a	Fix Unicode table. * (mkcategories): Fix a bug that outputs incorrect Unicode category table for code point ranges. * (categories.t): Rebuild it using the bug-fixed mkcategories. This fixes the problem reported in the following post. https://cygwin.com/ml/cygwin/2018-06/msg00248.html	2018-06-26 10:19:12 +02:00
Corinna Vinschen	b14daac482	Revert "Remove -fno-builtin to allow gcc to inline functions such as fabs, floor, creal, imag." This reverts commit `c077b9de99`. Yet another accidental commit...	2018-06-26 10:17:04 +02:00
Corinna Vinschen	dbe905c140	Cygwin: exceptions: fix FPE exception flags The FPE flags for divisions by zero were not implemented Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-26 10:12:19 +02:00
Corinna Vinschen	3dc89bbafe	Cygwin: signal.h: improve exception flags definition - add numbers for readability - add a preprocessor macro for each flag Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-06-26 10:12:10 +02:00
Jon Beniston	c077b9de99	Remove -fno-builtin to allow gcc to inline functions such as fabs, floor, creal, imag.	2018-06-25 13:31:51 +02:00
Takashi Yano	9c84bfd479	Fix the handling of out-of-band (OOB) data in a socket. * fhandler.h (class fhandler_socket_inet): Add variable bool oobinline. * fhandler_socket_inet.cc (fhandler_socket_inet::fhandler_socket_inet): Initialize variable oobinline. (fhandler_socket_inet::recv_internal): Make the handling of OOB data as consistent with POSIX as possible. Add simulation of inline mode for OOB data as a workaround for broken winsock behavior. (fhandler_socket_inet::setsockopt): Ditto. (fhandler_socket_inet::getsockopt): Ditto. (fhandler_socket_wsock::ioctl): Fix return value of SIOCATMARK command. The return value of SIOCATMARK of winsock is almost opposite to expectation. * fhandler_socket_local.cc (fhandler_socket_local::recv_internal): Remove the handling of OOB data from AF_LOCAL domain socket. Operation related to OOB data will result in an error like Linux does. (fhandler_socket_local::sendto): Ditto. (fhandler_socket_local::sendmsg): Ditto. This fixes the issue reported in following post. https://cygwin.com/ml/cygwin/2018-06/msg00143.html	2018-06-22 10:20:08 +02:00
Wilco Dijkstra	3baadb9912	Improve performance of sinf/cosf/sincosf Here is the correct patch with both filenames and int cast fixed: This patch is a complete rewrite of sinf, cosf and sincosf. The new version is significantly faster, as well as simple and accurate. The worst-case ULP is 0.56072, maximum relative error is 0.5303p-23 over all 4 billion inputs. In non-nearest rounding modes the error is 1ULP. The algorithm uses 3 main cases: small inputs which don't need argument reduction, small inputs which need a simple range reduction and large inputs requiring complex range reduction. The code uses approximate integer comparisons to quickly decide between these cases - on some targets this may be slow, so this can be configured to use floating point comparisons. The small range reducer uses a single reduction step to handle values up to 120.0. It is fastest on targets which support inlined round instructions. The large range reducer uses integer arithmetic for simplicity. It does a 32x96 bit multiply to compute a 64-bit modulo result. This is more than accurate enough to handle the worst-case cancellation for values close to an integer multiple of PI/4. It could be further optimized, however it is already much faster than necessary. Simple benchmark showing speedup factor on AArch64 for various ranges: range 0.7853982 sinf 1.7 cosf 2.2 sincosf 2.8 range 1.570796 sinf 1.9 cosf 1.9 sincosf 2.7 range 3.141593 sinf 2.0 cosf 2.0 sincosf 3.5 range 6.283185 sinf 2.3 cosf 2.3 sincosf 4.2 range 125.6637 sinf 2.9 cosf 3.0 sincosf 5.1 range 1.1259e15 sinf 26.8 cosf 26.8 sincosf 45.2 ChangeLog: 2018-05-18 Wilco Dijkstra <wdijkstr@arm.com> * newlib/libm/common/Makefile.in: Regenerated. * newlib/libm/common/Makefile.am: Add sinf.c, cosf.c, sincosf.c sincosf.h, sincosf_data.c. Add -fbuiltin -fno-math-errno to CFLAGS. * newlib/libm/common/math_config.h: Add HAVE_FAST_ROUND, HAVE_FAST_LROUND, roundtoint, converttoint, force_eval_float, force_eval_double, eval_as_float, eval_as_double, likely, unlikely. * newlib/libm/common/cosf.c: New file. * newlib/libm/common/sinf.c: Likewise. * newlib/libm/common/sincosf.h: Likewise. * newlib/libm/common/sincosf.c: Likewise. * newlib/libm/common/sincosf_data.c: Likewise. * newlib/libm/math/sf_cos.c: Add #if to build conditionally. * newlib/libm/math/sf_sin.c: Likewise. * newlib/libm/math/wf_sincos.c: Likewise. --	2018-06-21 09:37:04 +02:00
Corinna Vinschen	cfe8c6c504	Revert "Improve performance of sinf/cosf/sincosf" This reverts commit `fca80a9d1b`. Accidentally pushed a preliminary version	2018-06-21 09:36:39 +02:00
Jon Beniston	b7d9d27b0e	libm/common/s_round.c (round): Add cast for 16-bit CPUs	2018-06-21 09:31:13 +02:00
Wilco Dijkstra	fca80a9d1b	Improve performance of sinf/cosf/sincosf This patch is a complete rewrite of sinf, cosf and sincosf. The new version is significantly faster, as well as simple and accurate. The worst-case ULP is 0.56072, maximum relative error is 0.5303p-23 over all 4 billion inputs. In non-nearest rounding modes the error is 1ULP. The algorithm uses 3 main cases: small inputs which don't need argument reduction, small inputs which need a simple range reduction and large inputs requiring complex range reduction. The code uses approximate integer comparisons to quickly decide between these cases - on some targets this may be slow, so this can be configured to use floating point comparisons. The small range reducer uses a single reduction step to handle values up to 120.0. It is fastest on targets which support inlined round instructions. The large range reducer uses integer arithmetic for simplicity. It does a 32x96 bit multiply to compute a 64-bit modulo result. This is more than accurate enough to handle the worst-case cancellation for values close to an integer multiple of PI/4. It could be further optimized, however it is already much faster than necessary. Simple benchmark showing speedup factor on AArch64 for various ranges: range 0.7853982 sinf 1.7 cosf 2.2 sincosf 2.8 range 1.570796 sinf 1.9 cosf 1.9 sincosf 2.7 range 3.141593 sinf 2.0 cosf 2.0 sincosf 3.5 range 6.283185 sinf 2.3 cosf 2.3 sincosf 4.2 range 125.6637 sinf 2.9 cosf 3.0 sincosf 5.1 range 1.1259e15 sinf 26.8 cosf 26.8 sincosf 45.2 ChangeLog: 2018-06-18 Wilco Dijkstra <wdijkstr@arm.com> * newlib/libm/common/Makefile.in: Regenerated. * newlib/libm/common/Makefile.am: Add sinf.c, cosf.c, sincosf.c sincosf.h, sincosf_data.c. Add -fbuiltin -fno-math-errno to CFLAGS. * newlib/libm/common/math_config.h: Add HAVE_FAST_ROUND, HAVE_FAST_LROUND, roundtoint, converttoint, force_eval_float, force_eval_double, eval_as_float, eval_as_double, likely, unlikely. * newlib/libm/common/cosf.c: New file. * newlib/libm/common/sinf.c: Likewise. * newlib/libm/common/sincosf.h: Likewise. * newlib/libm/common/sincosf.c: Likewise. * newlib/libm/common/sincosf_data.c: Likewise. * newlib/libm/math/sf_cos.c: Add #if to build conditionally. * newlib/libm/math/sf_sin.c: Likewise. * newlib/libm/math/wf_sincos.c: Likewise. --	2018-06-19 09:44:28 +02:00
Thomas Kindler	9dd3c3b0ad	newlib: getopt now permutes multi-flag options correctly Previously, "test 1 2 3 -a -b -c" was permuted to "test -a -b -c 1 2 3", but "test 1 2 3 -abc" was left as "test 1 2 3 -abc". Signed-off-by: Thomas Kindler <mail+newlib@t-kindler.de>	2018-06-18 18:45:44 +02:00
Ken Brown	ebc9171ede	Bump Cygwin DLL version to 2.11.0	2018-06-07 09:42:36 +02:00
Ken Brown	2ea436b433	Cygwin: Document clearenv and bump API minor Also add earlier "What changed" items to new-features.xml.	2018-06-07 09:42:36 +02:00
Ken Brown	3a049236db	Cygwin: Remove workaround in environ.cc Commit `ebd645e` on 2001-10-03 made environ.cc:_addenv() add unneeded space at the end of the environment block to "work around problems with some buggy applications." This clutters the code and is presumably no longer needed.	2018-06-07 09:42:36 +02:00
Ken Brown	defaa2ca31	Cygwin: Implement the GNU extension clearenv	2018-06-07 09:42:36 +02:00
Ken Brown	9234545e3d	Cygwin: Allow the environment pointer to be NULL Following glibc, interpret this as meaning the environment is empty.	2018-06-07 09:42:36 +02:00
Ken Brown	1ecbb8d7b7	Cygwin: Clarify some code in environ.cc	2018-06-07 09:42:36 +02:00
Ken Brown	a7c23d109f	Cygwin: Add pthread_rwlock_* fix to release notes	2018-06-01 21:59:42 +02:00
Ken Brown	59847b5d73	Declare the pthread_rwlock_* functions if __cplusplus >= 201402L Some of these functions are used in the <shared_mutex> C++ header.	2018-06-01 12:09:12 +02:00
Corinna Vinschen	8ac6b15487	Cygwin: Add stack alignment crash after fork fix to release notes Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-05-29 18:43:20 +02:00
Sergejs Lukanihins	06797545b3	Cygwin: Fixing the math behind rounding down ch.stacklimit to page size.	2018-05-29 18:37:33 +02:00
Corinna Vinschen	53960db861	Cygwin: Add Sergejs Lukanihins to contributors Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-05-29 18:34:54 +02:00
Corinna Vinschen	efade43bd5	Cygwin: Add buffer underrun fix to release notes Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-05-29 18:31:07 +02:00
Corinna Vinschen	35998fc2fa	Cygwin: normalize_win32_path: Avoid buffer underruns Thanks to Ken Harris <Ken.Harris@mathworks.com> for the diagnosis. When backing up tail to handle a "..", the code only checked that it didn't underrun the destination buffer while removing path components. It did not take into account that the first backslash in the path had to be kept intact. Example path to trigger the problem: "C:\A..\..\..\B' Fix this by moving the dst pointer to the first backslash so subsequent tests cannot underrun this position. Also make sure that we always have a backslash. Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-05-29 18:23:14 +02:00
Corinna Vinschen	7d00a5e320	Cygwin: TEST only: Add a buffer underrun assertion to symlink_info::check Thanks to Ken Harris <Ken.Harris@mathworks.com> for the diagnosis which led to a buffer underrun in this loop. Revert before release. Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-05-29 18:23:14 +02:00
Jeff Johnston	4a3d0a5a5d	Fix issue with malloc_extend_top - when calculating a correction to align next brk to page boundary, ensure that the correction is less than a page size - if allocating the correction fails, ensure that the top size is set to brk + sbrk_size (minus any front alignment made) Signed-off-by: Jeff Johnston <jjohnstn@redhat.com>	2018-05-29 10:16:48 -04:00
Matthias Kannwischer	fcfea0ae2d	fix llrint and lrint for 52 <= exponent <= 62	2018-05-29 15:59:48 +02:00
Freddie Chopin	3305f35570	Fix 32-bit overflow in mktime() when time_t is 64-bits long When converting number of days since epoch (32-bits) to seconds, calculations using 32-bit `long` overflow for years above 2038. Solve this by casting number of days to `time_t` just before final multiplication. Signed-off-by: Freddie Chopin <freddie.chopin@gmail.com>	2018-05-29 15:27:03 +02:00
Jeff Johnston	e928275566	Use _LDBL_EQ_DBL in nexttowardf.c 2018-05-07 Tom de Vries <tom@codesourcery.com> * libm/common/nexttowardf.c: Use _LDBL_EQ_DBL instead of _LDBL_EQ_DOUBLE.	2018-05-07 12:22:12 -04:00
Ben Levinsky	28627a5a03	libgloss: microblaze: adjust handlers to be weak. Previously, hw exception handler stub and interrupt handler stub for microbaze were unable to be overwritten. Change to weak to fix this. Signed-off-by: Ben Levinsky <ben.levinsky@xilinx.com>	2018-05-03 15:16:13 -04:00
Yaakov Selkowitz	67609efeb0	Cygwin: fix build with GCC 7 GCC 7 is able to see straight through this trick, so use a more formal method to avoid the warning. Signed-off-by: Yaakov Selkowitz <yselkowi@redhat.com>	2018-04-16 22:46:11 -05:00
Jeff Johnston	cd31fbb2ae	Add nvptx port. - From: Cesar Philippidis <cesar@codesourcery.com> Date: Tue, 10 Apr 2018 14:43:42 -0700 Subject: [PATCH] nvptx port This port adds support for Nvidia GPU's, which are primarily used as offload accelerators in OpenACC and OpenMP.	2018-04-13 15:42:37 -04:00
Corinna Vinschen	e206c39bb6	Cygwin: fix guard checking for current user's AuthZ context Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-04-12 09:43:12 +02:00
Corinna Vinschen	5d99256613	Cygwin: add cuinof changes to release text Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-04-11 12:46:18 +02:00
Corinna Vinschen	cef1070bcb	Cygwin: cpuinfo: Use active CPU count per group There are systems with a MaximumProcessorCount not reflecting the actually available CPUs. The ActiveProcessorCount is correct though. So we use ActiveProcessorCount rather than MaximumProcessorCount per group to set group affinity correctly. Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-04-11 12:45:57 +02:00
Corinna Vinschen	92f4e0500b	Cygwin: wincap: expose more SYSTEM_INFO members and use as appropriate Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-04-11 11:59:35 +02:00
Corinna Vinschen	402d68af1a	Cygwin: cpuinfo: report L3 cache on Intel CPUs Signed-off-by: Corinna Vinschen <corinna@vinschen.de>	2018-04-11 10:06:25 +02:00

1 2 3 4 5 ...

18237 Commits