summaryrefslogtreecommitdiff
path: root/VA
AgeCommit message (Collapse)AuthorLines
5 daysstore input size as longBirte Kristina Friesel-8/+8
5 daysVA: Add valgrind-ws benchmark script and baseline adjustmentsBirte Kristina Friesel-7/+49
11 daysBS, GEMV, TRNS, TS, VA: Add basic perf statsBirte Kristina Friesel-0/+6
To Do: Right now, the stats include benchmark setup. They should not.
2024-07-26VA hbm: reduce number of repetitionsBirte Kristina Friesel-1/+1
2024-07-25VA nmc: explicitly source SDKBirte Kristina Friesel-0/+2
2024-07-23VA: remove single-node with in/out on same nodeBirte Kristina Friesel-13/+3
2024-07-22VA baseline: configurable memcpy NUMA bindingBirte Kristina Friesel-18/+50
2024-07-22VA dimes-hetsim-nmc: variable input nodeBirte Kristina Friesel-2/+2
2024-07-17VA: do not count move_pages towards timerBirte Kristina Friesel-1/+1
2024-07-17VA hetsim nmc: nitsBirte Kristina Friesel-5/+5
2024-07-17VA memcpy: HBM supportBirte Kristina Friesel-25/+63
2024-07-17VA dimes benchmarks: re-introduce --resume; add memcpy baselineBirte Kristina Friesel-25/+31
2024-07-17VA: Remove legacy run scriptsBirte Kristina Friesel-136/+0
2024-07-17VA: Add baseline variant with memcpy overhead (always use local input data)Birte Kristina Friesel-5/+81
2024-07-15Revert "VA CPU version: log latency"Birte Kristina Friesel-2/+2
This reverts commit 92b065344bfec4a529c821380ddbce65f92eee78.
2024-07-15VA, GEMV: Add roofline benchmarksBirte Kristina Friesel-0/+25
2024-07-12VA CPU version: log latencyBirte Kristina Friesel-2/+2
2024-07-10VA: nitsBirte Kristina Friesel-13/+13
2024-07-05VA: log NUMA node of allocated rank(s)Birte Kristina Friesel-32/+52
2024-07-05VA baseline: use strong scaling by defaultBirte Kristina Friesel-1/+1
2024-07-04VA: write directly to output; no teeBirte Kristina Friesel-3/+3
2024-07-04VA: add separate benchmark scripts for NMC and HBMBirte Kristina Friesel-28/+100
2024-07-04VA: Remove legacy SDK_SINGLETHREADED macroBirte Kristina Friesel-12/+5
2024-07-04VA: Add NUMA baseline variantBirte Kristina Friesel-24/+145
2024-03-11VA: handle alloc/load/free overhead as parameters; vary them for fgbs24aBirte Kristina Friesel-18/+11
2024-02-28some fgbs24a scriptsBirte Kristina Friesel-0/+28
2024-02-28VA: n_elements_per_dpuBirte Kristina Friesel-2/+2
2023-11-27VA: revert prim benchmarks to original configurations; host-specific logsBirte Kristina Friesel-8/+61
2023-11-27VA: add dpuinfo and singlethreaded optionsBirte Kristina Friesel-6/+29
singlethreading does not seem to work at all. Funny SDK.
2023-11-27VA: determine number of ranksBirte Kristina Friesel-0/+3
2023-11-21VA CPU baseline: rename misleading file_size to input_sizeBirte Kristina Friesel-6/+6
2023-11-21VA basilen: … watBirte Kristina Friesel-1/+0
Fix MOpps output
2023-11-21UNI, VA: fix compilation without alloc/load/free benchmarksBirte Kristina Friesel-3/+3
2023-11-20VA: more detail outputBirte Kristina Friesel-7/+33
2023-11-20VA: optionally include dpu_{alloc,load,free} overhead in benchmark outputBirte Kristina Friesel-42/+96
2023-09-21VA: remove a stray newlineBirte Kristina Friesel-1/+1
2023-07-26update config spaceBirte Friesel-5/+6
2023-06-02VA cpu: limit to DPU typeDaniel Friesel-5/+9
2023-05-31VA: Correctly calculate PIM kernel throughputDaniel Friesel-2/+2
2023-05-31VA: cleanupDaniel Friesel-7/+10
2023-05-31VA: run.sh: up to 88 threads; strong scalingDaniel Friesel-2/+9
2023-05-30VA: extend config spaceDaniel Friesel-2/+4
2023-05-30VA: new dfatool format; -O and reproduction benchmarksDaniel Friesel-15/+65
2023-05-25VA: re-add expDaniel Friesel-3/+7
2023-05-25re-add (and, in some cases, fix) -x supportDaniel Friesel-2/+6
2023-05-25port VA to dfatoolDaniel Friesel-58/+88
2023-05-24VA: port to dfatoolDaniel Friesel-21/+97
2021-06-16PrIM -- first commitJuan Gomez Luna-0/+784