index
:
prim-benchmarks
main
Processing-in-Memory Benchmarks
derf
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
VA
Age
Commit message (
Expand
)
Author
Lines
2025-05-14
VA: Change default to exp=1 (strong scaling)
Birte Kristina Friesel
-2
/
+2
2025-05-14
VA: Add benchmark script
Birte Kristina Friesel
-0
/
+24
2025-05-14
Use benchmark-specific input descriptions rather than opaque "#elements"
Birte Kristina Friesel
-8
/
+10
2025-05-13
TRNS, VA: global params are not required
Birte Kristina Friesel
-5
/
+4
2025-05-13
VA: Aspect Header: Use aspect-provided input_size
Birte Kristina Friesel
-2
/
+2
2025-05-13
tracing: provide input_size and element_size in derived aspect
Birte Kristina Friesel
-2
/
+12
2025-05-13
VA: Use common timer and timing aspect headers
Birte Kristina Friesel
-199
/
+29
2025-05-13
VA: AspectC++ support
Birte Kristina Friesel
-46
/
+206
2025-01-16
VA: dos2unix; indent -linux
Birte Kristina Friesel
-642
/
+767
2025-01-13
store input size as long
Birte Kristina Friesel
-8
/
+8
2025-01-13
VA: Add valgrind-ws benchmark script and baseline adjustments
Birte Kristina Friesel
-7
/
+49
2025-01-07
BS, GEMV, TRNS, TS, VA: Add basic perf stats
Birte Kristina Friesel
-0
/
+6
2024-07-26
VA hbm: reduce number of repetitions
Birte Kristina Friesel
-1
/
+1
2024-07-25
VA nmc: explicitly source SDK
Birte Kristina Friesel
-0
/
+2
2024-07-23
VA: remove single-node with in/out on same node
Birte Kristina Friesel
-13
/
+3
2024-07-22
VA baseline: configurable memcpy NUMA binding
Birte Kristina Friesel
-18
/
+50
2024-07-22
VA dimes-hetsim-nmc: variable input node
Birte Kristina Friesel
-2
/
+2
2024-07-17
VA: do not count move_pages towards timer
Birte Kristina Friesel
-1
/
+1
2024-07-17
VA hetsim nmc: nits
Birte Kristina Friesel
-5
/
+5
2024-07-17
VA memcpy: HBM support
Birte Kristina Friesel
-25
/
+63
2024-07-17
VA dimes benchmarks: re-introduce --resume; add memcpy baseline
Birte Kristina Friesel
-25
/
+31
2024-07-17
VA: Remove legacy run scripts
Birte Kristina Friesel
-136
/
+0
2024-07-17
VA: Add baseline variant with memcpy overhead (always use local input data)
Birte Kristina Friesel
-5
/
+81
2024-07-15
Revert "VA CPU version: log latency"
Birte Kristina Friesel
-2
/
+2
2024-07-15
VA, GEMV: Add roofline benchmarks
Birte Kristina Friesel
-0
/
+25
2024-07-12
VA CPU version: log latency
Birte Kristina Friesel
-2
/
+2
2024-07-10
VA: nits
Birte Kristina Friesel
-13
/
+13
2024-07-05
VA: log NUMA node of allocated rank(s)
Birte Kristina Friesel
-32
/
+52
2024-07-05
VA baseline: use strong scaling by default
Birte Kristina Friesel
-1
/
+1
2024-07-04
VA: write directly to output; no tee
Birte Kristina Friesel
-3
/
+3
2024-07-04
VA: add separate benchmark scripts for NMC and HBM
Birte Kristina Friesel
-28
/
+100
2024-07-04
VA: Remove legacy SDK_SINGLETHREADED macro
Birte Kristina Friesel
-12
/
+5
2024-07-04
VA: Add NUMA baseline variant
Birte Kristina Friesel
-24
/
+145
2024-03-11
VA: handle alloc/load/free overhead as parameters; vary them for fgbs24a
Birte Kristina Friesel
-18
/
+11
2024-02-28
some fgbs24a scripts
Birte Kristina Friesel
-0
/
+28
2024-02-28
VA: n_elements_per_dpu
Birte Kristina Friesel
-2
/
+2
2023-11-27
VA: revert prim benchmarks to original configurations; host-specific logs
Birte Kristina Friesel
-8
/
+61
2023-11-27
VA: add dpuinfo and singlethreaded options
Birte Kristina Friesel
-6
/
+29
2023-11-27
VA: determine number of ranks
Birte Kristina Friesel
-0
/
+3
2023-11-21
VA CPU baseline: rename misleading file_size to input_size
Birte Kristina Friesel
-6
/
+6
2023-11-21
VA basilen: … wat
Birte Kristina Friesel
-1
/
+0
2023-11-21
UNI, VA: fix compilation without alloc/load/free benchmarks
Birte Kristina Friesel
-3
/
+3
2023-11-20
VA: more detail output
Birte Kristina Friesel
-7
/
+33
2023-11-20
VA: optionally include dpu_{alloc,load,free} overhead in benchmark output
Birte Kristina Friesel
-42
/
+96
2023-09-21
VA: remove a stray newline
Birte Kristina Friesel
-1
/
+1
2023-07-26
update config space
Birte Friesel
-5
/
+6
2023-06-02
VA cpu: limit to DPU type
Daniel Friesel
-5
/
+9
2023-05-31
VA: Correctly calculate PIM kernel throughput
Daniel Friesel
-2
/
+2
2023-05-31
VA: cleanup
Daniel Friesel
-7
/
+10
2023-05-31
VA: run.sh: up to 88 threads; strong scaling
Daniel Friesel
-2
/
+9
[next]