45 Commits (optimized_for_deeplearning)

Author SHA1 Message Date
  Ralph Campbell fbc21266e6 Minor C code fixes in driver/ 10 years ago
  Zhang Xianyi d8392c1245 Fixe cmake config bugs. 10 years ago
  Zhang Xianyi f874465bb8 Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit. 10 years ago
  Hank Anderson 9eaea02f33 Added additional gemm defines for complex types. 10 years ago
  Hank Anderson 0d8e227ea7 Changed strategy for setting preprocessor definitions. 10 years ago
  Hank Anderson 371071d461 Added CONJ defines for trmm/trsm. 10 years ago
  Hank Anderson 8a143516e3 Added alternate_name to a couple of the name mangling schemes. 10 years ago
  Hank Anderson e5897ecb9b Added zherk_kernel.c objects to driver/level3. 10 years ago
  Hank Anderson 4662a0b13a Changed generate functions to iterate through a list of float types. 10 years ago
  Hank Anderson e74462a3f5 Moved declarations to start of functions to satisfy MSVC C89 implementation. 10 years ago
  Hank Anderson 056ba26755 Changed a number of inline calls to use __inline. 10 years ago
  Hank Anderson e8c39138c6 Removed return value from GenerateNamedObjects. 10 years ago
  Hank Anderson 627d5e7401 Added SMP objects to driver/level3. 10 years ago
  Hank Anderson 943fa2fb58 Fixed object names in level2. 10 years ago
  Hank Anderson 461e691127 Codes when define is absent are now a parameter to AllCombinations. 10 years ago
  Hank Anderson cfaf1c678f Added option to append define codes with an underscore. 10 years ago
  Hank Anderson 0d7bad1f35 Changed GenerateObjects to append combination codes (e.g. dtrmm_TU). 10 years ago
  Hank Anderson d11bde60d0 DOUBLE define for DBLAS objects is now set in main CMakeLists.txt. 10 years ago
  Hank Anderson 5057a4b4df Added openblas add_library call that uses DBLAS_OBJS ojbects. 10 years ago
  Hank Anderson d3dcdddf75 Moved functions into util cmake file. 10 years ago
  Hank Anderson e5e7595bf9 Added paramater to GenerateObjects for defines that affect all sources. 10 years ago
  Hank Anderson 7693887d61 Added empty set to the combinations generated by AllCombinations. 10 years ago
  Hank Anderson 8d9b196e0d Moved loop over define combos into a function. 10 years ago
  Hank Anderson a6cf8aafc0 Updated level3/CMakeLists with correct defines using all combos. 10 years ago
  Hank Anderson dbdca7bf0c Added first pass at driver/level3 Makefile conversion. 10 years ago
  wernsaar 7aae4a62e7 enabled use of GEMM3M functions 11 years ago
  wernsaar 1d33547222 optimized zgemm kernel for haswell 11 years ago
  wernsaar 3ea4dadd30 optimizations for trsm 11 years ago
  wernsaar 1b10ff129a optimizations for trmm 11 years ago
  wernsaar 125610d23b allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk 11 years ago
  wernsaar be94db096c disabled *3M functions for x86_64 platforms 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar c947ab85dc changed level3.c 12 years ago
  wernsaar 2840d56aeb added dgemm_kernel for Piledriver 12 years ago
  Zhang Xianyi 77b572fa0b Merge branch 'loongson3a' into develop 12 years ago
  Zhang Xianyi 32d2ca3035 Refs #214, #221, #246. Fixed the getrf overflow bug on Windows. 12 years ago
  wernsaar 6f008abcef replaced defined(DOUBLE) by !defined(XDOUBLE) 12 years ago
  Zhang Xianyi 5d3312142a Refs #221 #246. Fixed the overflowing stack bug in mutlithreading BLAS3. 12 years ago
  wernsaar 25491e42f9 New dgemm kernel for BULLDOZER: dgemm_kernel_8x2_bulldozer.S 12 years ago
  Xianyi Zhang 6b01d58712 Disable the optimization of muli-threading gemm on the Loongson3A. 12 years ago
  Wang Qian 8163ab7e55 Change the block size on Loongson 3B. 14 years ago
  traz 9fe3049de6 Adding conditional compilation(#if defined(LOONGSON3A)) to avoid affecting the performance of other platforms. 14 years ago
  traz 831858b883 Modify aligned address of sa and sb to improve the performance of multi-threads. 14 years ago
  Xianyi Zhang 1b97ec1a7c Added DEBUG option in Makefile.rule. Fixed DEBUG typo mistakes. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago