11 results for “topic:sse4”
Fast UTF-8 validation with range algorithm (NEON+SSE4+AVX2)
Fast C functions for the computing the positional popcount (pospopcnt).
Efficient C functions to compute the summary statistics (flagstats) for sequencing read sets.
Fast algorithms for computing XX^T for binary matrices
A fresh (experimental) look at Scilab 6.x
LLaMeSIMD - The Ultimate SIMD Intrinsic & Function Translation Benchmarking Suite
X86-64 bilateral instruction tokenizer implemented in C. Supports the following processor extensions: AES, AVX, AVX2, AVX512, FMA, MMX, SSE, SSE2, SSE3, SSE4, x87(FPU), VMX. In order to ease testing, a diassembler which transforms tokens into compilable assembly (for NASM compiler) has been implemented.
Software for Burstcoin cryptocurrencies. Support for CPU and GPU. More information at cosburst@gmail.com
A cross platform Redis Module Example that warns and uses the optimized functions based on instruction set extensions available and or microarchitecture
No description provided.
Sleef inline headers