188 Commits (02bc36ac79b06142b97e2c7a93632f668f3e6b4e)

Author SHA1 Message Date
  wernsaar 02bc36ac79 added sgemm_ncopy routine and made some improvements on cgemm_kernel for ARMV7 12 years ago
  wernsaar 85484a42df added kernels for cgemm, ctrmm, zgemm and ztrmm 12 years ago
  wernsaar 3983011f0b added sgemm- and strmm_kernel 12 years ago
  wernsaar 2a1515c9dd added dgemm_ncopy_4_vfpv3.S 12 years ago
  wernsaar 31f51e78bc minor optimizations on dgemm_kernel 12 years ago
  wernsaar e0b968c3a7 Changed kernels for dgemm and dtrmm 12 years ago
  wernsaar 1c63180bb6 updated dgemm_kernel_8x2_vfpv3.S 12 years ago
  wernsaar 4a474ea7dc changed dgemm_kernel to use fused multiply add 12 years ago
  wernsaar 69ce737cc5 modified Makefile.L3 for ARM 12 years ago
  wernsaar 70411af888 initial checkin of kernel/arm 12 years ago
  wernsaar 067e8417fd removed unnessesary instructions from zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar a82da3d069 removed unnessesary instructions 12 years ago
  Zhang Xianyi 1569bf14f8 Refs #282. Fixed zgemv_n typo bug on Win64. 12 years ago
  Zhang Xianyi c0159d44a3 Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop 12 years ago
  wernsaar c17a850c1c modified KERNEL.BULLDOZER 12 years ago
  wernsaar 099853fff6 added dtrsm_kernel_RN_8x2_bulldozer.S 12 years ago
  wernsaar 44d23881b5 dtrsm_kernel_LT_8x2_bulldozer.S performance optimization 12 years ago
  Zhang Xianyi 32fb6b9bb2 Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop 12 years ago
  wernsaar aaeb8eaecd modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 8aeec32ea0 modified dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 87fc9de572 added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar 564aa60fec removed dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  wernsaar f645665dd6 fixed bug in dgemv_t_bulldozer.S 12 years ago
  wernsaar e45a347cd2 repaired trmm bug in sgemm_kernel_16x2_bulldozer.S 12 years ago
  wernsaar 99727ac013 repaired trmm bug in cgemm_kernel_4x2_bulldozer.S 12 years ago
  wernsaar 6e0a2fbc0c repaired trmm bug in zgemm_kernel_2x2_bulldozer.S 12 years ago
  wernsaar 0a22f99c58 repaired trmm bug in dgemm_kernel_8x2_bulldozer.S 12 years ago
  wernsaar cff70a666d added generic trmm kernels and modified Makefile.L3 12 years ago
  wernsaar 84bd0aabaa added dtrsm_kernel_LT_8x2_bulldozer.S 12 years ago
  Zhang Xianyi 72b1edaf1b Merge branch 'develop' into bulldozer 12 years ago
  wangqian 1b3b9e841d Fixed a computational error in zgemm_kernel_4x4_sandy.S file. 12 years ago
  Zhang Xianyi 2ed0f6ab60 Fixed the typo. 12 years ago
  Zhang Xianyi 886cbaf4e4 Support AMD Piledriver by bulldozer kernels. 12 years ago
  Zhang Xianyi 57944538b6 Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX. 12 years ago
  Zhang Xianyi fa916a0fac Fixed #238 bug in lsame on x86. 12 years ago
  Zhang Xianyi fb298b34ae Merge pull request #235 from wernsaar/develop 12 years ago
  wernsaar 16012767f4 added dcopy_bulldozer.S 12 years ago
  wernsaar bcbac31b47 added ddot_bulldozer.S 12 years ago
  wernsaar 8dc0c72583 added daxpy_bulldozer.S 12 years ago
  wernsaar 89405a1a0b cleanup of dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar 4f2b12b8a8 added dgemv_t_bulldozer.S 12 years ago
  Zhang Xianyi 646e168d26 Merge pull request #233 from wernsaar/develop 12 years ago
  wernsaar 93dbbe1fb8 added dgemm_ncopy_8_bulldozer.S 12 years ago
  wernsaar a135f5d9ed added gemm_tcopy_2_bulldozer.S 12 years ago
  wernsaar d0b6299b13 added dgemm_tcopy_8_bulldozer.S 12 years ago
  wernsaar 9e58dd509e added gemm_ncopy_2_bulldozer.S 12 years ago
  wernsaar 7c8227101b cleanup of dgemv_n_bulldozer.S and optimization of inner loop 12 years ago
  wernsaar f67fa62851 added dgemv_n_bulldozer.S 12 years ago
  Zhang Xianyi cd1d473ba0 Merge pull request #230 from wernsaar/develop 12 years ago
  wernsaar 0ded1fcc1c performance optimizations in sgemm_kernel_16x2_bulldozer.S 12 years ago