You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

CONTRIBUTORS.md 6.6 kB

8 years ago
8 years ago
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170
  1. # Contributions to the OpenBLAS project
  2. ## Creator & Maintainer
  3. * Zhang Xianyi <traits.zhang@gmail.com>
  4. ## Active Developers
  5. * Wang Qian <traz0824@gmail.com>
  6. * Optimize BLAS3 on ICT Loongson 3A.
  7. * Optimize BLAS3 on Intel Sandy Bridge.
  8. * Werner Saar <wernsaar@googlemail.com>
  9. * [2013-03-04] Optimize AVX and FMA4 DGEMM on AMD Bulldozer
  10. * [2013-04-27] Optimize AVX and FMA4 TRSM on AMD Bulldozer
  11. * [2013-06-09] Optimize AVX and FMA4 SGEMM on AMD Bulldozer
  12. * [2013-06-11] Optimize AVX and FMA4 ZGEMM on AMD Bulldozer
  13. * [2013-06-12] Optimize AVX and FMA4 CGEMM on AMD Bulldozer
  14. * [2013-06-16] Optimize dgemv_n kernel on AMD Bulldozer
  15. * [2013-06-20] Optimize ddot, daxpy kernel on AMD Bulldozer
  16. * [2013-06-21] Optimize dcopy kernel on AMD Bulldozer
  17. * Porting and Optimization on ARM Cortex-A9
  18. * Optimization on AMD Piledriver
  19. * Optimization on Intel Haswell
  20. ## Previous Developers
  21. * Zaheer Chothia <zaheer.chothia@gmail.com>
  22. * Improve the compatibility about complex number
  23. * Build LAPACKE: C interface to LAPACK
  24. * Improve the windows build.
  25. * Chen Shaohu <huhumartinwar@gmail.com>
  26. * Optimize GEMV on the Loongson 3A processor.
  27. * Luo Wen
  28. * Intern. Test Level-2 BLAS.
  29. ## Contributors
  30. In chronological order:
  31. * pipping <http://page.mi.fu-berlin.de/pipping>
  32. * [2011-06-11] Make USE_OPENMP=0 disable openmp.
  33. * Stefan Karpinski <stefan@karpinski.org>
  34. * [2011-12-28] Fix a bug about SystemStubs on Mac OS X.
  35. * Alexander Eberspächer <https://github.com/aeberspaecher>
  36. * [2012-05-02] Add note on patch for segfaults on Linux kernel 2.6.32.
  37. * Mike Nolta <mike@nolta.net>
  38. * [2012-05-19] Fix building bug on FreeBSD and NetBSD.
  39. * Sylvestre Ledru <https://github.com/sylvestre>
  40. * [2012-07-01] Improve the detection of sparc. Fix building bug under
  41. Hurd and kfreebsd.
  42. * Jameson Nash <https://github.com/vtjnash>
  43. * [2012-08-20] Provide support for passing CFLAGS, FFLAGS, PFLAGS, FPFLAGS to
  44. make on the command line.
  45. * Alexander Nasonov <alnsn@yandex.ru>
  46. * [2012-11-10] Fix NetBSD build.
  47. * Sébastien Villemot <sebastien@debian.org>
  48. * [2012-11-14] Fix compilation with TARGET=GENERIC. Patch applied to Debian package.
  49. * [2013-08-28] Avoid failure on qemu guests declaring an Athlon CPU without 3dnow!
  50. * Kang-Che Sung <Explorer09@gmail.com>
  51. * [2013-05-17] Fix typo in the document. Re-order the architecture list in getarch.c.
  52. * Kenneth Hoste <kenneth.hoste@gmail.com>
  53. * [2013-05-22] Adjust Makefile about downloading LAPACK source files.
  54. * Lei WANG <https://github.com/wlbksy>
  55. * [2013-05-22] Fix a bug about wget.
  56. * Dan Luu <http://www.linkedin.com/in/danluu>
  57. * [2013-06-30] Add Intel Haswell support (using sandybridge optimizations).
  58. * grisuthedragon <https://github.com/grisuthedragon>
  59. * [2013-07-11] create openblas_get_parallel to retrieve information which parallelization
  60. model is used by OpenBLAS.
  61. * Elliot Saba <staticfloat@gmail.com>
  62. * [2013-07-22] Add in return value for `interface/trtri.c`
  63. * Sébastien Fabbro <bicatali@gentoo.org>
  64. * [2013-07-24] Modify makefile to respect user's LDFLAGS
  65. * [2013-07-24] Add stack markings for GNU as arch-independent for assembler files
  66. * Viral B. Shah <viral@mayin.org>
  67. * [2013-08-21] Patch LAPACK XLASD4.f as discussed in JuliaLang/julia#2340
  68. * Lars Buitinck <https://github.com/larsmans>
  69. * [2013-08-28] get rid of the generated cblas_noconst.h file
  70. * [2013-08-28] Missing threshold in gemm.c
  71. * [2013-08-28] fix default prefix handling in makefiles
  72. * yieldthought <https://github.com/yieldthought>
  73. * [2013-10-08] Remove -Wl,--retain-symbols-file from dynamic link line to fix tool support
  74. * Keno Fischer <https://github.com/loladiro>
  75. * [2013-10-23] Use FC instead of CC to link the dynamic library on OS X
  76. * Christopher Meng <cickumqt@gmail.com>
  77. * [2013-12-09] Add DESTDIR support for easier building on RPM based distros.
  78. Use install command instead of cp to install files with permissions control.
  79. * Lucas Beyer <lucasb.eyer.be@gmail.com>
  80. * [2013-12-10] Added support for NO_SHARED in make install.
  81. * carlkl <https://github.com/carlkl>
  82. * [2013-12-13] Fixed LAPACKE building bug on Windows
  83. * Isaac Dunham <https://github.com/idunham>
  84. * [2014-08-03] Fixed link error on Linux/musl
  85. * Dave Nuechterlein
  86. * [2014-10-10] trmm and sgemm kernels (optimized for APM's X-Gene 1).
  87. ARMv8 support.
  88. * Jerome Robert <jeromerobert@gmx.com>
  89. * [2015-01-01] Speed-up small `ger` and `gemv` using stack allocation (bug #478)
  90. * [2015-12-23] `stack_check` in `gemv.c` (bug #722)
  91. * [2015-12-28] Allow to force the number of parallel make job
  92. * [2015-12-28] Fix detection of AMD E2-3200 detection
  93. * [2015-12-31] Let `make MAX_STACK_ALLOC=0` do what expected
  94. * [2016-01-19] Disable multi-threading in `ger` and `swap` for small matrices (bug #731)
  95. * [2016-01-24] Use `GEMM_MULTITHREAD_THRESHOLD` as a number of ops (bug #742)
  96. * [2016-01-26] Let `openblas_get_num_threads` return the number of active threads (bug #760)
  97. * [2016-01-30] Speed-up small `zger`, `zgemv`, `ztrmv` using stack allocation (bug #727)
  98. * Dan Kortschak
  99. * [2015-01-07] Added test for drotmg bug #484.
  100. * Ton van den Heuvel <https://github.com/ton>
  101. * [2015-03-18] Fix race condition during shutdown causing a crash in gotoblas_set_affinity().
  102. * Martin Koehler <https://github.com/grisuthedragon/>
  103. * [2015-09-07] Improved imatcopy
  104. * Ashwin Sekhar T K <https://github.com/ashwinyes/>
  105. * [2015-11-09] Assembly kernels for Cortex-A57 (ARMv8)
  106. * [2015-11-20] lapack-test fixes for Cortex-A57
  107. * [2016-03-14] Additional functional Assembly Kernels for Cortex-A57
  108. * [2016-03-14] Optimize Dgemm 4x4 for Cortex-A57
  109. * theoractice <https://github.com/theoractice/>
  110. * [2016-03-20] Fix compiler error in VisualStudio with CMake
  111. * [2016-03-22] Fix access violation on Windows while static linking
  112. * Paul Mustière <https://github.com/buffer51/>
  113. * [2016-02-04] Fix Android build on ARMV7
  114. * [2016-04-26] Android build with LAPACK for ARMV7 & ARMV8
  115. * Shivraj Patil <https://github.com/sva-img/>
  116. * [2016-05-03] DGEMM optimization for MIPS P5600 and I6400 using MSA
  117. * Kaustubh Raste <https://github.com/ksraste/>
  118. * [2016-05-09] DTRSM optimization for MIPS P5600 and I6400 using MSA
  119. * [2016-05-20] STRSM optimization for MIPS P5600 and I6400 using MSA
  120. * Abdelrauf <https://github.com/quickwritereader>
  121. * [2017-01-01] dgemm and dtrmm kernels for IBM z13
  122. * [2017-02-26] ztrmm kernel for IBM z13
  123. * [2017-03-13] strmm and ctrmm kernel for IBM z13
  124. * [2017-09-01] initial Blas Level-1,2 (double precision) for IBM z13