-
Notifications
You must be signed in to change notification settings - Fork 117
Expand file tree
/
Copy pathCITATION.cff
More file actions
81 lines (79 loc) · 2.96 KB
/
CITATION.cff
File metadata and controls
81 lines (79 loc) · 2.96 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
cff-version: 1.2.0
title: "NumKong: 2000 Mixed Precision Kernels For All"
message: "If you use this software, please cite it as below."
type: software
license: Apache-2.0
version: 7.6.0
repository-code: https://github.com/ashvardanian/NumKong
url: https://ashvardanian.com/posts/numkong/
abstract: >-
Portable mixed-precision math, linear-algebra, & retrieval library
with 2'000+ SIMD kernels for x86, Arm, RISC-V, LoongArch, Power, & WebAssembly,
leveraging rare algebraic transforms with both 1D & 2D registers like AMX & SME,
covering 15+ numeric types from 4-bit integers & 6-bit floats to 128-bit complex numbers,
validated against 118-bit extended-precision baselines
with saturation, casting, & rounding edge-case coverage,
in a 5-100x smaller binary than other BLAS-like alternatives,
co-designed with Tensor abstractions in C++, Python, Rust, JavaScript, GoLang, & Swift.
authors:
- family-names: Vardanian
given-names: Ash
alias: ashvardanian
orcid: https://orcid.org/0000-0002-4882-1815
references:
- type: article
title: "SciPy Distances up to 200x Faster with AVX-512 & SVE"
url: https://ashvardanian.com/posts/simsimd-faster-scipy/
date-published: 2023-10-07
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "GCC Compiler vs Human - 119x Faster Assembly"
url: https://ashvardanian.com/posts/gcc-12-vs-avx512fp16/
date-published: 2023-10-23
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "Accelerating JavaScript Arrays by 10x for Vector Search"
url: https://ashvardanian.com/posts/javascript-ai-vector-search/
date-published: 2023-10-21
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "Python, C, Assembly - 2'500x Faster Cosine Similarity"
url: https://ashvardanian.com/posts/python-c-assembly-comparison/
date-published: 2023-10-30
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "NumPy vs BLAS: Losing 90% of Throughput"
url: https://ashvardanian.com/posts/numpy-vs-blas-costs/
date-published: 2024-03-12
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "5x Faster Set Intersections: SVE2, AVX-512, & NEON"
url: https://ashvardanian.com/posts/simd-set-intersections-sve2-avx512/
date-published: 2024-09-16
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "Understanding SIMD: Infinite Complexity of Trivial Problems"
url: https://ashvardanian.com/posts/understanding-simd-complexity/
date-published: 2024-11-25
authors:
- family-names: Vardanian
given-names: Ash
- type: article
title: "Hiding x86 Port Latency for 330 GB/s/core Reductions"
url: https://ashvardanian.com/posts/cpu-ports/
date-published: 2025-01-19
authors:
- family-names: Vardanian
given-names: Ash