Log - HEAD - third_party/github.com/google/gemmlowp

08e4bb3 Add vcpkg installation instructions (#211) by JonLiu1993 · 1 year, 8 months ago main
e844ffd Adding output_range_offset support for requantize transform. (#209) by Everton Constantino · 2 years, 5 months ago
09d81e0 Build static or shared libraries for all platforms (#208) by Andrew Marshall · 2 years, 6 months ago
daf630d Support CMake install/find_package (#207) by Park DongHa · 2 years, 7 months ago
f995960 Support aarch64-pc-windows-msvc (#206) by metarutaiga · 2 years, 11 months ago
8589f20 Fix -Wdeprecated-copy. (#205) by pkasting · 2 years, 11 months ago
13d5770 Port fixedpoint header to WebAssembly SIMD (#202) by Marat Dukhan · 3 years, 4 months ago
fda83bd Large-scale change: Mark static const class/struct members as constexpr. This change fixes declarations that have initial values but are technically not definitions by marking them constexpr (which counts as a definition). This enables, among other things, the modified constants to be passed into functions and function templates that accept arguments by reference. Without this change, such functions would cause linker errors. by Danny van der Rijn · 4 years ago
023c190 Add 16-bit support for AVX2 by T.J. Alumbaugh · 4 years ago
2483d84 added template specialization for int8_t saturating add by Lamar · 4 years, 4 months ago
fcdb1b3 fixed doc comments for Mul and Sub by Lamar · 4 years, 4 months ago
08c5dd3 fix recent regression by Benoit Jacob · 4 years, 5 months ago
e25e353 StoreFinalOutputImpl::Run is reloading MatrixMap::data_ and by Clement Courbet · 4 years, 5 months ago
6a007c7 Add some extra documentation to the magic numbers used in by Nick Kreeger · 4 years, 5 months ago
dc69acd fix avx header include by Benoit Jacob · 4 years, 9 months ago
a227af1 Fix the type to avoid comparison of integers of different signs. The compiling error is reported when -Wsign-compared is added. by Chao Mei · 4 years, 10 months ago
447abda Increase ProfilingStack::kMaxSize by Sean Silva · 5 years ago
12fed0c Argh, we can't update MultiThreadGemm to use the new by Benoit Jacob · 5 years ago
5495284 Another take on what a sane Execute method would look like, by Benoit Jacob · 5 years ago
f0077a5 unused variable by Benoit Jacob · 5 years ago
948468b Rename Execute to LegacyExecuteAndDestroyTasks. Add better Execute overload. by Benoit Jacob · 5 years ago
892c70f Making scripts compatible with Python 3 by dibyaran · 5 years ago
36212ad Changed Microsoft-specific suffix for 'long long int' type to the C++ standard for the same type explained in the section '5.13 Literals'. by Arritmic · 5 years ago
dec2b7d Fix pack_neon to use signed instruction. by Suharsh Sivakumar · 5 years ago
1bf3b9c Fix ShiftLeft int16x8_t impl. by Suharsh Sivakumar · 5 years ago
1ff8bdd fix the deps for the :fixedpoint target by Benoit Jacob · 5 years ago
82acdad Do not register worker threads for profiling. by Benoit Jacob · 5 years ago
1b76f37 make SSE 128bit specializations available also when AVX is available, as AVX per se only defines 256bit specializations, and user code may depend specifically on 128bit, as is the case in TFLite by Benoit Jacob · 5 years ago
664f14a Per-channel output rescale and int8 input support for NEON. by Suharsh Sivakumar · 5 years ago
2390b74 Fix documentation on OutputScalar in GemmWithOutputPipeline by Tetsuo Kiso · 5 years ago
58825b1 MIPS: Improve StoreFinalOutput for 8x4 and 8x8 blocks of uint8. by Alexey Frunze · 5 years ago
5e071e0 Add a stage truncating int32 to uint8. by Alexey Frunze · 5 years ago
1949d3c MIPS: Implement an int8/LhsAlwaysNonzero kernel by Alexey Frunze · 5 years ago
1762ea1 Fix markdown format in kernel.md by Tetsuo Kiso · 5 years ago
2233f5e fix warnings about unused parameters by Paul Idstein · 5 years ago
55680df Fix typo by Tetsuo Kiso · 5 years ago
7aff33d Fix typo by Tetsuo Kiso · 5 years ago
42c5318 use GEMMLOWP_NOINLINE to avoid windows build break. by linkerzhang · 5 years ago
6a692f0 Add --fix option to allow fixing up wrong encodings in place, useful when mutating already-encoded code by Benoit Jacob · 5 years ago
a3df028 Fix typo in comments by Tetsuo Kiso · 5 years ago
1490d29 add an experiment program to study how to interprete ARM PMU cache event counters. by Benoit Jacob · 5 years ago
3fb5c17 MIPS: Implement ImplSaturatingRoundingMultiplyByPOT<,,-1> by Alexey Frunze · 5 years ago
18e4cc0 support encoding sdot as well as udot by Benoit Jacob · 5 years ago
75c19af MIPS: Implement RoundingDivideByPOT(). by Alexey Frunze · 6 years ago
6db6c2a MIPS: Improve saturation. by Alexey Frunze · 6 years ago
8416bab Fix #136 (test_blocking_counter) by Alexey Frunze · 6 years ago
7d6d123 Import the new kernel contributed by ARM using the new dot-product by Benoit Jacob · 6 years ago
3f87bab tiny fix to encode.py by Benoit Jacob · 6 years ago
f183d88 further improvements on encode.py by Benoit Jacob · 6 years ago
b1d3c06 Minor touchups on standalone programs by Benoit Jacob · 6 years ago
f3c2b90 Add a tool to encode udot instructions in asm code so we compile by Benoit Jacob · 6 years ago
2c33e33 make udot kernels build on current toolchains by manually encoding the udot instruction, and add another udot kernel, "_narrow" by Benoit Jacob · 6 years ago
7c5f9be Fix a use-after-free issue with the TLS data in the profiler, by Benoit Jacob · 6 years ago
3aad165 Fix a regression of the profiler introduced in commit by Benoit Jacob · 6 years ago
719139c fix comment by Benoit Jacob · 6 years ago
d877b64 Fix undefined behavior in the overflow case in ShiftLeft. by Benoit Jacob · 6 years ago
4f09871 lint by Benoit Jacob · 6 years ago
1080dba Fixing missing header. (Patch by Ben Vanik) by Benoit Jacob · 6 years ago
fa27313 Make gemmlowp multithreading correct: by Benoit Jacob · 6 years ago
dea287e Added math header to fixed point for round() declaration by Pete Warden · 6 years ago
a47fd48 Separated platform detection into a new header, so fixed point routines can use the right architecture after previous removal of common.h header by Pete Warden · 6 years ago
e88547b Removed fixed point dependency on wider gemmlowp headers, and added explicit casting for platforms where ints are 16-bits, to help with microcontroller portability by Pete Warden · 6 years ago
07eb865 fix warnings about unused parameters by Bin Li · 6 years ago
8724b85 [MSVC] Enable SSE4 when AVX is turned on by Loo Rong Jie · 6 years ago
cd685fe [Bazel] Don't set -lpthread for Windows by Loo Rong Jie · 6 years ago
6c8f5d4 add missing inline keywords to fix linkage issues by bjacob · 6 years ago
697ceea Add 'cc' and 'memory' to the clobbers list of NEON_64bit_GEMM_Int7Operands_AccumEightWithin16Bits and NEON_64bit_GEMM_Int425Operands kernels since they are being modified. by Marie White · 6 years ago
38ebac7 Add malloc.h for Windows by Yong Tang · 6 years ago
fc0ebb1 Fix compilation error in Windows by Yong Tang · 6 years ago
85179c4 add Richart Winterton to CONTRIBUTORS, clang-format and convert to unix line endings by Benoit Jacob · 6 years ago
d74760e Avx2optimizations (#122) by rrwinterton · 6 years ago
ad03972 Add profiling support to benchmark_all_sizes by Benoit Jacob · 6 years ago
e08d0f6 Merge pull request #132 from jalexstark/fixedpoint-constexpr by bjacob · 6 years ago
c033219 Adjust use of static const / static constexpr in fixedpoint. by Alex Stark · 6 years ago
87510e5 Merge pull request #131 from MIPS/msa_width_major by bjacob · 6 years ago
c318e3f MIPS: Improve the kernel by switching to width-major. by Alexey Frunze · 6 years ago
f59a96b Merge pull request #130 from ajtulloch/gemmlowp-wait-for-variable-change-spurious-wakeup by bjacob · 6 years ago
1a4582c multi_thread_gemm.h::WaitForVariableChange: Handle spurious wakeups correctly by Andrew Tulloch · 6 years ago
d6fe01d Merge pull request #129 from ajtulloch/gemmlowp-fix-undefined-behaviour-in-left-shift by bjacob · 6 years ago
ea3a4bf UBSAN: Fix fixedpoint.h::ShiftLeft to not invoke undefined behaviour by Andrew Tulloch · 6 years ago
5b40e38 Merge pull request #128 from mariecwhite/sevenbit by bjacob · 6 years ago
3463777 Add mariewhite@ to list of contributors by Marie White · 6 years ago
abc0fdb Add 4-bit GEMM asm and intrinsics kernel by Marie White · 6 years ago
73184f4 Add 7-bit GEMM asm and intrinsics kernel by Marie White · 6 years ago
d41349d Add kernel operand ranges to standalone benchmark by Marie White · 6 years ago
ecae4d1 Merge pull request #125 from MIPS/mips_cast16 by bjacob · 6 years ago
c9bd438 MIPS: Add output stage casting to int16 by Alexey Frunze · 6 years ago
7c7c744 Merge pull request #124 from bjacob/updates-for-fast-lstm by bjacob · 6 years ago
416992a add OutputStageScaleInt32ByFixedPointAndExponent by Benoit Jacob · 6 years ago
9d7ae82 Add output stage casting to int16 by Benoit Jacob · 6 years ago
15a66ad rename: s/OutputStageQuantizeDownInt32ToUint8ScaleByFixedPoint/OutputStageQuantizeDownInt32ByFixedPoint/g - and keep the old name supported for compatibility by Benoit Jacob · 6 years ago
d4d1e29 compilation fixes by Benoit Jacob · 6 years ago
c45f8f5 Merge pull request #123 from MIPS/mips_fixedpoint16bit by bjacob · 6 years ago
95af7df fix compilation by Benoit Jacob · 6 years ago
80c9ee5 MIPS: Add 16-bit support to gemmlowp/fixedpoint & math functions by Alexey Frunze · 6 years ago
f4763b6 compilation fixes by Benoit Jacob · 6 years ago
304ac74 fix trailing whitespace in script by Benoit Jacob · 6 years ago
d77359f Merge branch 'master' of github.com:google/gemmlowp by Benoit Jacob · 6 years ago
6da0669 clang-format by Benoit Jacob · 6 years ago
4c5cbde Merge pull request #113 from achille/master by bjacob · 6 years ago