summaryrefslogtreecommitdiff
path: root/VA
AgeCommit message (Expand)AuthorLines
2024-07-17VA: do not count move_pages towards timerBirte Kristina Friesel-1/+1
2024-07-17VA hetsim nmc: nitsBirte Kristina Friesel-5/+5
2024-07-17VA memcpy: HBM supportBirte Kristina Friesel-25/+63
2024-07-17VA dimes benchmarks: re-introduce --resume; add memcpy baselineBirte Kristina Friesel-25/+31
2024-07-17VA: Remove legacy run scriptsBirte Kristina Friesel-136/+0
2024-07-17VA: Add baseline variant with memcpy overhead (always use local input data)Birte Kristina Friesel-5/+81
2024-07-15Revert "VA CPU version: log latency"Birte Kristina Friesel-2/+2
2024-07-15VA, GEMV: Add roofline benchmarksBirte Kristina Friesel-0/+25
2024-07-12VA CPU version: log latencyBirte Kristina Friesel-2/+2
2024-07-10VA: nitsBirte Kristina Friesel-13/+13
2024-07-05VA: log NUMA node of allocated rank(s)Birte Kristina Friesel-32/+52
2024-07-05VA baseline: use strong scaling by defaultBirte Kristina Friesel-1/+1
2024-07-04VA: write directly to output; no teeBirte Kristina Friesel-3/+3
2024-07-04VA: add separate benchmark scripts for NMC and HBMBirte Kristina Friesel-28/+100
2024-07-04VA: Remove legacy SDK_SINGLETHREADED macroBirte Kristina Friesel-12/+5
2024-07-04VA: Add NUMA baseline variantBirte Kristina Friesel-24/+145
2024-03-11VA: handle alloc/load/free overhead as parameters; vary them for fgbs24aBirte Kristina Friesel-18/+11
2024-02-28some fgbs24a scriptsBirte Kristina Friesel-0/+28
2024-02-28VA: n_elements_per_dpuBirte Kristina Friesel-2/+2
2023-11-27VA: revert prim benchmarks to original configurations; host-specific logsBirte Kristina Friesel-8/+61
2023-11-27VA: add dpuinfo and singlethreaded optionsBirte Kristina Friesel-6/+29
2023-11-27VA: determine number of ranksBirte Kristina Friesel-0/+3
2023-11-21VA CPU baseline: rename misleading file_size to input_sizeBirte Kristina Friesel-6/+6
2023-11-21VA basilen: … watBirte Kristina Friesel-1/+0
2023-11-21UNI, VA: fix compilation without alloc/load/free benchmarksBirte Kristina Friesel-3/+3
2023-11-20VA: more detail outputBirte Kristina Friesel-7/+33
2023-11-20VA: optionally include dpu_{alloc,load,free} overhead in benchmark outputBirte Kristina Friesel-42/+96
2023-09-21VA: remove a stray newlineBirte Kristina Friesel-1/+1
2023-07-26update config spaceBirte Friesel-5/+6
2023-06-02VA cpu: limit to DPU typeDaniel Friesel-5/+9
2023-05-31VA: Correctly calculate PIM kernel throughputDaniel Friesel-2/+2
2023-05-31VA: cleanupDaniel Friesel-7/+10
2023-05-31VA: run.sh: up to 88 threads; strong scalingDaniel Friesel-2/+9
2023-05-30VA: extend config spaceDaniel Friesel-2/+4
2023-05-30VA: new dfatool format; -O and reproduction benchmarksDaniel Friesel-15/+65
2023-05-25VA: re-add expDaniel Friesel-3/+7
2023-05-25re-add (and, in some cases, fix) -x supportDaniel Friesel-2/+6
2023-05-25port VA to dfatoolDaniel Friesel-58/+88
2023-05-24VA: port to dfatoolDaniel Friesel-21/+97
2021-06-16PrIM -- first commitJuan Gomez Luna-0/+784