2023 Commits (2d0b2334259d41c2003b51a07580dbd25cfe267c)

Author SHA1 Message Date
  wernsaar 099853fff6 added dtrsm_kernel_RN_8x2_bulldozer.S 12 years ago
  wernsaar 44d23881b5 dtrsm_kernel_LT_8x2_bulldozer.S performance optimization 12 years ago
  Zhang Xianyi 32fb6b9bb2 Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop 12 years ago
  wernsaar aaeb8eaecd modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 8aeec32ea0 modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 87fc9de572 added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 564aa60fec removed dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar f645665dd6 fixed bug in dgemv_t_bulldozer.S 12 years ago
  wernsaar e45a347cd2 repaired trmm bug in sgemm_kernel_16x2_bulldozer.S 12 years ago
  wernsaar 99727ac013 repaired trmm bug in cgemm_kernel_4x2_bulldozer.S 12 years ago
  wernsaar 6e0a2fbc0c repaired trmm bug in zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar 0a22f99c58 repaired trmm bug in dgemm_kernel_8x2_bulldozer.S 12 years ago
  wernsaar cff70a666d added generic trmm kernels and modified Makefile.L3 12 years ago
  wernsaar 84bd0aabaa added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  Zhang Xianyi 72b1edaf1b Merge branch 'develop' into bulldozer 12 years ago
  wangqian 1b3b9e841d Fixed a computational error in zgemm_kernel_4x4_sandy.S file. 12 years ago
  Zhang Xianyi 2ed0f6ab60 Fixed the typo. 12 years ago
  Zhang Xianyi 886cbaf4e4 Support AMD Piledriver by bulldozer kernels. 12 years ago
  Zhang Xianyi 57944538b6 Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX. 12 years ago
  Zhang Xianyi fa916a0fac Fixed #238 bug in lsame on x86. 12 years ago
  Zhang Xianyi fb298b34ae Merge pull request #235 from wernsaar/develop 12 years ago
  wernsaar 16012767f4 added dcopy_bulldozer.S 12 years ago
  wernsaar bcbac31b47 added ddot_bulldozer.S 12 years ago
  wernsaar 8dc0c72583 added daxpy_bulldozer.S 12 years ago
  wernsaar 89405a1a0b cleanup of dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar 4f2b12b8a8 added dgemv_t_bulldozer.S 12 years ago
  Zhang Xianyi 646e168d26 Merge pull request #233 from wernsaar/develop 12 years ago
  wernsaar 93dbbe1fb8 added dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar a135f5d9ed added gemm_tcopy_2_bulldozer.S 12 years ago
  wernsaar d0b6299b13 added dgemm_tcopy_8_bulldozer.S 12 years ago
  wernsaar 9e58dd509e added gemm_ncopy_2_bulldozer.S 12 years ago
  wernsaar 7c8227101b cleanup of dgemv_n_bulldozer.S and optimization of inner loop 12 years ago
  wernsaar f67fa62851 added dgemv_n_bulldozer.S 12 years ago
  Zhang Xianyi cd1d473ba0 Merge pull request #230 from wernsaar/develop 12 years ago
  wernsaar 0ded1fcc1c performance optimizations in sgemm_kernel_16x2_bulldozer.S 12 years ago
  wernsaar a789b588cd added cgemm_kernel_4x2_bulldozer.S 12 years ago
  wernsaar 8eaa04acbb added zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar d854b30ae6 Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3 12 years ago
  wernsaar d65bbec99b added new sgemm kernel for BULLDOZER 12 years ago
  wernsaar e4c39c7c26 changed stack touching 12 years ago
  wernsaar 25491e42f9 New dgemm kernel for BULLDOZER: dgemm_kernel_8x2_bulldozer.S 12 years ago
  Zhang Xianyi 9f59f384d8 Refs #223. Fixed s/dgemv bug on windows. 12 years ago
  wangqian 23965f164c Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 12 years ago
  wangqian 6a72840945 Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86. 12 years ago
  wernsaar 69aa6c8fb1 bad performance with some data 12 years ago
  wernsaar 60b263f3d2 removed trsm_kernel_RT_4x4_bulldozer.S. wrong results 12 years ago
  wernsaar 7ac306e0da added trsm_kernel_RT_4x4_bulldozer.S 12 years ago
  wernsaar 4cb454cdf2 added trsm_kernel_LT_4x4_bulldozer.S 12 years ago
  wernsaar 19ad2fb128 prefetch improved. Defined 2 different kernels for inner loop 12 years ago
  wernsaar 6821677489 minor improvements and code cleanup 12 years ago