https://github.com/libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
@stdlib/blas-ext-base-gcusumkbn2
Calculate the cumulative sum of strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-gsumpw
Calculate the sum of strided array elements using pairwise summation.
@stdlib/blas-ext-base-gapxsumpw
Adds a constant to each strided array element and computes the sum using pairwise summation.
@stdlib/blas-ext-base-dsumpw
Calculate the sum of double-precision floating-point strided array elements using pairwise summation.
@stdlib/blas-ext-base-gsumkbn
Calculate the sum of strided array elements using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dnansumkbn
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-ssumpw
Calculate the sum of single-precision floating-point strided array elements using pairwise summation.
@stdlib/blas-ext-base-dapxsumkbn
Add a constant to each double-precision floating-point strided array element and compute the sum using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dnannsumpw
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using pairwise summation.
@stdlib/blas-ext-base-sapxsumpw
Adds a constant to each single-precision floating-point strided array element and computes the sum using pairwise summation.
@stdlib/blas-ext-base-dssumpw
Calculate the sum of single-precision floating-point strided array elements using pairwise summation with extended accumulation and returning an extended precision result.
@stdlib/blas-ext-base-dapxsumkbn2
Add a constant to each double-precision floating-point strided array element and compute the sum using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-sapxsumkbn
Add a constant to each single-precision floating-point strided array element and compute the sum using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-ssumkbn2
Calculate the sum of single-precision floating-point strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dsumkbn
Calculate the sum of double-precision floating-point strided array elements using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-scusumpw
Calculate the cumulative sum of single-precision floating-point strided array elements using pairwise summation.
@stdlib/blas-ext-base-sdssumpw
Calculate the sum of single-precision floating-point strided array elements using pairwise summation with extended accumulation.
@stdlib/blas-ext-base-gsumkbn2
Calculate the sum of strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dcusumkbn
Calculate the cumulative sum of double-precision floating-point strided array elements using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-ssumkbn
Calculate the sum of single-precision floating-point strided array elements using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-gapxsumkbn
Add a constant to each strided array element and compute the sum using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-gcusumkbn
Calculate the cumulative sum of strided array elements using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-scusumkbn
Calculate the cumulative sum of single-precision floating-point strided array elements using an improved Kahan–Babuška algorithm.
blas-ext-base-dcusumkbn2
Calculate the cumulative sum of double-precision floating-point strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dsumkbn2
Calculate the sum of double-precision floating-point strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dnannsumkbn2
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dnannsumkbn
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dnansumpw
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using pairwise summation.
@stdlib/blas-ext-base-dsapxsumpw
Adds a constant to each single-precision floating-point strided array element and computes the sum using pairwise summation with extended accumulation and returning an extended precision result.
@stdlib/blas-ext-base-gnansumkbn
Calculate the sum of strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dsnansumpw
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values, using pairwise summation with extended accumulation, and returning an extended precision result.
@stdlib/blas-ext-base-snansumpw
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values and using pairwise summation.
@stdlib/blas-ext-base-sdsnansumpw
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values and using pairwise summation with extended accumulation.
@stdlib/blas-ext-base-ssort2hp
Simultaneously sort two single-precision floating-point strided arrays based on the sort order of the first array using heapsort.
@stdlib/blas-ext-base-sdsapxsumpw
Adds a constant to each single-precision floating-point strided array element and computes the sum using pairwise summation with extended accumulation.
@stdlib/blas-ext-base-dsort2hp
Simultaneously sort two double-precision floating-point strided arrays based on the sort order of the first array using heapsort.
@stdlib/blas-ext-base-gnannsumkbn
Calculate the sum of strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
@stdlib/blas-ext-base-ssort2sh
Simultaneously sort two single-precision floating-point strided arrays based on the sort order of the first array using Shellsort.
@stdlib/blas-ext-base-dnansumkbn2
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-scusumkbn2
Calculate the cumulative sum of single-precision floating-point strided array elements using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-gnansumkbn2
Calculate the sum of strided array elements, ignoring NaN values and using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-sasumpw
Calculate the sum of absolute values (L1 norm) of single-precision floating-point strided array elements using pairwise summation.
@stdlib/blas-ext-base-gapxsumkbn2
Add a constant to each strided array element and compute the sum using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-dsort2sh
Simultaneously sort two double-precision floating-point strided arrays based on the sort order of the first array using Shellsort.
@stdlib/blas-ext-base-sapxsumkbn2
Add a constant to each single-precision floating-point strided array element and compute the sum using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-snansumkbn2
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values and using a second-order iterative Kahan–Babuška algorithm.
@stdlib/blas-ext-base-gcusumpw
Calculate the cumulative sum of strided array elements using pairwise summation.
@stdlib/blas-ext-base-snansumkbn
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
blas-base-sdsdot
Calculate the dot product of two single-precision floating-point vectors with extended accumulation.
@stdlib/blas-ext-base-dapx
Add a constant to each element in a double-precision floating-point strided array.
@stdlib/blas-base-ddot
Calculate the dot product of two double-precision floating-point vectors.
https://github.com/ashvardanian/simsimd
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
@stdlib/blas-base-sdot
Calculate the dot product of two single-precision floating-point vectors.
@stdlib/blas-ext-base-dssum
Calculate the sum of single-precision floating-point strided array elements using extended accumulation and returning an extended precision result.
@stdlib/blas-base-cswap
Interchanges two complex single-precision floating-point vectors.
@stdlib/blas-base-dswap
Interchange two double-precision floating-point vectors.
@stdlib/blas-ext-base-gsumors
Calculate the sum of strided array elements using ordinary recursive summation.
@stdlib/blas-ext-base-ssum
Calculate the sum of single-precision floating-point strided array elements.
@stdlib/blas-ext-base-dsumors
Calculate the sum of double-precision floating-point strided array elements using ordinary recursive summation.
@stdlib/blas-ext-base-dapxsum
Adds a constant to each double-precision floating-point strided array element and computes the sum.
@stdlib/blas-base-ccopy
Copy values from one complex single-precision floating-point vector to another complex single-precision floating-point vector.
@stdlib/blas-ext-base-ssumors
Calculate the sum of single-precision floating-point strided array elements using ordinary recursive summation.
@stdlib/blas-ext-base-gcusum
Calculate the cumulative sum of strided array elements.
@stdlib/blas-ext-base-dnanasumors
Calculate the sum of absolute values (L1 norm) of double-precision floating-point strided array elements, ignoring NaN values and using ordinary recursive summation.
@stdlib/blas-base-sswap
Interchange two single-precision floating-point vectors.
@stdlib/blas-ext-base-dsum
Calculate the sum of double-precision floating-point strided array elements.
@stdlib/blas-ext-base-sdsapxsum
Adds a constant to each single-precision floating-point strided array element and computes the sum using extended accumulation.