- 08e4bb3 Add vcpkg installation instructions (#211) by JonLiu1993 · 1 year, 8 months ago main
- e844ffd Adding output_range_offset support for requantize transform. (#209) by Everton Constantino · 2 years, 5 months ago
- 09d81e0 Build static or shared libraries for all platforms (#208) by Andrew Marshall · 2 years, 6 months ago
- daf630d Support CMake install/find_package (#207) by Park DongHa · 2 years, 7 months ago
- f995960 Support aarch64-pc-windows-msvc (#206) by metarutaiga · 2 years, 11 months ago
- 8589f20 Fix -Wdeprecated-copy. (#205) by pkasting · 2 years, 11 months ago
- 13d5770 Port fixedpoint header to WebAssembly SIMD (#202) by Marat Dukhan · 3 years, 4 months ago
- fda83bd Large-scale change: Mark static const class/struct members as constexpr. This change fixes declarations that have initial values but are technically not definitions by marking them constexpr (which counts as a definition). This enables, among other things, the modified constants to be passed into functions and function templates that accept arguments by reference. Without this change, such functions would cause linker errors. by Danny van der Rijn · 4 years ago
- 023c190 Add 16-bit support for AVX2 by T.J. Alumbaugh · 4 years ago
- 2483d84 added template specialization for int8_t saturating add by Lamar · 4 years, 4 months ago
- fcdb1b3 fixed doc comments for Mul and Sub by Lamar · 4 years, 4 months ago
- 08c5dd3 fix recent regression by Benoit Jacob · 4 years, 5 months ago
- e25e353 StoreFinalOutputImpl::Run is reloading MatrixMap::data_ and by Clement Courbet · 4 years, 5 months ago
- 6a007c7 Add some extra documentation to the magic numbers used in by Nick Kreeger · 4 years, 5 months ago
- dc69acd fix avx header include by Benoit Jacob · 4 years, 9 months ago
- a227af1 Fix the type to avoid comparison of integers of different signs. The compiling error is reported when -Wsign-compared is added. by Chao Mei · 4 years, 10 months ago
- 447abda Increase ProfilingStack::kMaxSize by Sean Silva · 5 years ago
- 12fed0c Argh, we can't update MultiThreadGemm to use the new by Benoit Jacob · 5 years ago
- 5495284 Another take on what a sane Execute method would look like, by Benoit Jacob · 5 years ago
- f0077a5 unused variable by Benoit Jacob · 5 years ago
- 948468b Rename Execute to LegacyExecuteAndDestroyTasks. Add better Execute overload. by Benoit Jacob · 5 years ago
- 892c70f Making scripts compatible with Python 3 by dibyaran · 5 years ago
- 36212ad Changed Microsoft-specific suffix for 'long long int' type to the C++ standard for the same type explained in the section '5.13 Literals'. by Arritmic · 5 years ago
- dec2b7d Fix pack_neon to use signed instruction. by Suharsh Sivakumar · 5 years ago
- 1bf3b9c Fix ShiftLeft int16x8_t impl. by Suharsh Sivakumar · 5 years ago
- 1ff8bdd fix the deps for the :fixedpoint target by Benoit Jacob · 5 years ago
- 82acdad Do not register worker threads for profiling. by Benoit Jacob · 5 years ago
- 1b76f37 make SSE 128bit specializations available also when AVX is available, as AVX per se only defines 256bit specializations, and user code may depend specifically on 128bit, as is the case in TFLite by Benoit Jacob · 5 years ago
- 664f14a Per-channel output rescale and int8 input support for NEON. by Suharsh Sivakumar · 5 years ago
- 2390b74 Fix documentation on OutputScalar in GemmWithOutputPipeline by Tetsuo Kiso · 5 years ago
- 58825b1 MIPS: Improve StoreFinalOutput for 8x4 and 8x8 blocks of uint8. by Alexey Frunze · 5 years ago
- 5e071e0 Add a stage truncating int32 to uint8. by Alexey Frunze · 5 years ago
- 1949d3c MIPS: Implement an int8/LhsAlwaysNonzero kernel by Alexey Frunze · 5 years ago
- 1762ea1 Fix markdown format in kernel.md by Tetsuo Kiso · 5 years ago
- 2233f5e fix warnings about unused parameters by Paul Idstein · 5 years ago
- 55680df Fix typo by Tetsuo Kiso · 5 years ago
- 7aff33d Fix typo by Tetsuo Kiso · 5 years ago
- 42c5318 use GEMMLOWP_NOINLINE to avoid windows build break. by linkerzhang · 5 years ago
- 6a692f0 Add --fix option to allow fixing up wrong encodings in place, useful when mutating already-encoded code by Benoit Jacob · 5 years ago
- a3df028 Fix typo in comments by Tetsuo Kiso · 5 years ago
- 1490d29 add an experiment program to study how to interprete ARM PMU cache event counters. by Benoit Jacob · 5 years ago
- 3fb5c17 MIPS: Implement ImplSaturatingRoundingMultiplyByPOT<,,-1> by Alexey Frunze · 5 years ago
- 18e4cc0 support encoding sdot as well as udot by Benoit Jacob · 5 years ago
- 75c19af MIPS: Implement RoundingDivideByPOT(). by Alexey Frunze · 6 years ago
- 6db6c2a MIPS: Improve saturation. by Alexey Frunze · 6 years ago
- 8416bab Fix #136 (test_blocking_counter) by Alexey Frunze · 6 years ago
- 7d6d123 Import the new kernel contributed by ARM using the new dot-product by Benoit Jacob · 6 years ago
- 3f87bab tiny fix to encode.py by Benoit Jacob · 6 years ago
- f183d88 further improvements on encode.py by Benoit Jacob · 6 years ago
- b1d3c06 Minor touchups on standalone programs by Benoit Jacob · 6 years ago
- f3c2b90 Add a tool to encode udot instructions in asm code so we compile by Benoit Jacob · 6 years ago
- 2c33e33 make udot kernels build on current toolchains by manually encoding the udot instruction, and add another udot kernel, "_narrow" by Benoit Jacob · 6 years ago
- 7c5f9be Fix a use-after-free issue with the TLS data in the profiler, by Benoit Jacob · 6 years ago
- 3aad165 Fix a regression of the profiler introduced in commit by Benoit Jacob · 6 years ago
- 719139c fix comment by Benoit Jacob · 6 years ago
- d877b64 Fix undefined behavior in the overflow case in ShiftLeft. by Benoit Jacob · 6 years ago
- 4f09871 lint by Benoit Jacob · 6 years ago
- 1080dba Fixing missing header. (Patch by Ben Vanik) by Benoit Jacob · 6 years ago
- fa27313 Make gemmlowp multithreading correct: by Benoit Jacob · 6 years ago
- dea287e Added math header to fixed point for round() declaration by Pete Warden · 6 years ago
- a47fd48 Separated platform detection into a new header, so fixed point routines can use the right architecture after previous removal of common.h header by Pete Warden · 6 years ago
- e88547b Removed fixed point dependency on wider gemmlowp headers, and added explicit casting for platforms where ints are 16-bits, to help with microcontroller portability by Pete Warden · 6 years ago
- 07eb865 fix warnings about unused parameters by Bin Li · 6 years ago
- 8724b85 [MSVC] Enable SSE4 when AVX is turned on by Loo Rong Jie · 6 years ago
- cd685fe [Bazel] Don't set -lpthread for Windows by Loo Rong Jie · 6 years ago
- 6c8f5d4 add missing inline keywords to fix linkage issues by bjacob · 6 years ago
- 697ceea Add 'cc' and 'memory' to the clobbers list of NEON_64bit_GEMM_Int7Operands_AccumEightWithin16Bits and NEON_64bit_GEMM_Int425Operands kernels since they are being modified. by Marie White · 6 years ago
- 38ebac7 Add malloc.h for Windows by Yong Tang · 6 years ago
- fc0ebb1 Fix compilation error in Windows by Yong Tang · 6 years ago
- 85179c4 add Richart Winterton to CONTRIBUTORS, clang-format and convert to unix line endings by Benoit Jacob · 6 years ago
- d74760e Avx2optimizations (#122) by rrwinterton · 6 years ago
- ad03972 Add profiling support to benchmark_all_sizes by Benoit Jacob · 6 years ago
- e08d0f6 Merge pull request #132 from jalexstark/fixedpoint-constexpr by bjacob · 6 years ago
- c033219 Adjust use of static const / static constexpr in fixedpoint. by Alex Stark · 6 years ago
- 87510e5 Merge pull request #131 from MIPS/msa_width_major by bjacob · 6 years ago
- c318e3f MIPS: Improve the kernel by switching to width-major. by Alexey Frunze · 6 years ago
- f59a96b Merge pull request #130 from ajtulloch/gemmlowp-wait-for-variable-change-spurious-wakeup by bjacob · 6 years ago
- 1a4582c multi_thread_gemm.h::WaitForVariableChange: Handle spurious wakeups correctly by Andrew Tulloch · 6 years ago
- d6fe01d Merge pull request #129 from ajtulloch/gemmlowp-fix-undefined-behaviour-in-left-shift by bjacob · 6 years ago
- ea3a4bf UBSAN: Fix fixedpoint.h::ShiftLeft to not invoke undefined behaviour by Andrew Tulloch · 6 years ago
- 5b40e38 Merge pull request #128 from mariecwhite/sevenbit by bjacob · 6 years ago
- 3463777 Add mariewhite@ to list of contributors by Marie White · 6 years ago
- abc0fdb Add 4-bit GEMM asm and intrinsics kernel by Marie White · 6 years ago
- 73184f4 Add 7-bit GEMM asm and intrinsics kernel by Marie White · 6 years ago
- d41349d Add kernel operand ranges to standalone benchmark by Marie White · 6 years ago
- ecae4d1 Merge pull request #125 from MIPS/mips_cast16 by bjacob · 6 years ago
- c9bd438 MIPS: Add output stage casting to int16 by Alexey Frunze · 6 years ago
- 7c7c744 Merge pull request #124 from bjacob/updates-for-fast-lstm by bjacob · 6 years ago
- 416992a add OutputStageScaleInt32ByFixedPointAndExponent by Benoit Jacob · 6 years ago
- 9d7ae82 Add output stage casting to int16 by Benoit Jacob · 6 years ago
- 15a66ad rename: s/OutputStageQuantizeDownInt32ToUint8ScaleByFixedPoint/OutputStageQuantizeDownInt32ByFixedPoint/g - and keep the old name supported for compatibility by Benoit Jacob · 6 years ago
- d4d1e29 compilation fixes by Benoit Jacob · 6 years ago
- c45f8f5 Merge pull request #123 from MIPS/mips_fixedpoint16bit by bjacob · 6 years ago
- 95af7df fix compilation by Benoit Jacob · 6 years ago
- 80c9ee5 MIPS: Add 16-bit support to gemmlowp/fixedpoint & math functions by Alexey Frunze · 6 years ago
- f4763b6 compilation fixes by Benoit Jacob · 6 years ago
- 304ac74 fix trailing whitespace in script by Benoit Jacob · 6 years ago
- d77359f Merge branch 'master' of github.com:google/gemmlowp by Benoit Jacob · 6 years ago
- 6da0669 clang-format by Benoit Jacob · 6 years ago
- 4c5cbde Merge pull request #113 from achille/master by bjacob · 6 years ago