-
Notifications
You must be signed in to change notification settings - Fork 326
Pull requests: google/XNNPACK
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use VNNI for m=1 when using AMX microkernel. Create matching VNNI microkernels with m=1.
#6518
opened May 31, 2024 by
copybara-service
bot
Loading…
Generate the build identifier using the XNNPack sources that are used in
:prod_microkernels
.
#6516
opened May 31, 2024 by
copybara-service
bot
Loading…
Add QS8_QC8W GEMM/IGEMM microkernels for Wasm Relaxed Unsigned and Signed …
#6505
opened May 30, 2024 by
fanchenkong1
Loading…
- Adds a new type of
xnn_profile_info
to return node_ids associated with the profiling information: xnn_profile_info_operator_id
.
#6498
opened May 29, 2024 by
copybara-service
bot
Loading…
SSE41 qs8 rdsum accumulating microkernels
#6492
opened May 28, 2024 by
copybara-service
bot
Loading…
Add qd8-qb4w-f16/f32-gemm scalar kernels, qb4w tests + benchmarking
#6491
opened May 28, 2024 by
GregoryComer
Loading…
Avoid benchmark link errors in Bazel for platforms that don't have specializations for ~all kernels. Benchmarks should prefer to use the test/bench microkernels -- *not* the prod microkernels -- therefore they should only depend on
test_mode
dependencies for those that exist. Unfortunately, the Bazel build rules for benchmarks didn't really follow these guidelines, so new appropriate targets were added and depended on as appropriate.
#6490
opened May 28, 2024 by
copybara-service
bot
Loading…
Extend the
convert
operator for the new qp8
packed per-row dynamic quantization.
#6479
opened May 27, 2024 by
copybara-service
bot
Loading…
Clean up the
xnn_f32_abs_params
and xnn_f32_neg_params
that are no longer needed.
#6475
opened May 24, 2024 by
copybara-service
bot
Loading…
Rewrite the
f32
tanh
kernels using the new SIMD
intrinsics.
#6457
opened May 22, 2024 by
copybara-service
bot
Loading…
Introduce
SIMD
headers which provide arch-specific wrappers for some common operations.
#6456
opened May 22, 2024 by
copybara-service
bot
Loading…
AVX2 qs8 rsum use vpmovsxbd to read bytes as ints
#6444
opened May 21, 2024 by
copybara-service
bot
Loading…
Prototype to integrate SW optimizations for Arm® CPUs
#6436
opened May 17, 2024 by
gmiodice
Loading…
[blockwise] Minor fixes for qb4w goi packing routine
#6434
opened May 17, 2024 by
digantdesai
Loading…
Use a better error bound for
fp16
tests of the rsum
microkernel.
#6431
opened May 16, 2024 by
copybara-service
bot
Loading…
F32-RMINMAXSUM - add reduction sum to f32-rminmax
#6427
opened May 16, 2024 by
copybara-service
bot
Loading…
Add a new
x8-packq
microkernel that packs and per-row dynamically quantizes fp32
to qp8
.
#6424
opened May 15, 2024 by
copybara-service
bot
Loading…
Add dependencies to the KleidiAI library to both the
BUILD
and CMakeLists.txt
files.
#6417
opened May 14, 2024 by
copybara-service
bot
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.