14 Commits (optimized_for_deeplearning)

Author SHA1 Message Date
  Ashwin Sekhar T K 45f78963ac Optimized cgemm kernel for CORTEXA57 10 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Zhang Xianyi 1cf2b10224 Use pure C generic target on x86 and x86_64. 10 years ago
  Werner Saar 9bd962f655 modified haswell parameter dgemm_unroll_n 10 years ago
  Zhang Xianyi ea7f9dacf4 Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1. 10 years ago
  Zhang Xianyi 2fb02626da Update organization info. 11 years ago
  Benedikt Huber 58c90d5937 # The first commit's message is: 11 years ago
  wernsaar b079df9ef4 added optimized sdot- and dsdot-kernel, written in C 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  Zhang Xianyi 6c4a7d0828 Import AMD Piledriver DGEMM kernel generated by AUGEM. 12 years ago
  wernsaar cff70a666d added generic trmm kernels and modified Makefile.L3 12 years ago
  wangqian f76f952547 Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions. 13 years ago
  Wang Qian 8e53b57bb2 Appending gemmkernel and trmmkernel C code in kernel/generic, this code can be used to execute on a new platform which dose not have optimized assemble kernel. 13 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago