Age | Commit message (Collapse) | Author | Lines | |
---|---|---|---|---|
2025-05-14 | VA: Change default to exp=1 (strong scaling) | Birte Kristina Friesel | -2/+2 | |
2025-05-14 | VA: Add benchmark script | Birte Kristina Friesel | -0/+24 | |
2025-05-14 | Use benchmark-specific input descriptions rather than opaque "#elements" | Birte Kristina Friesel | -8/+10 | |
2025-05-13 | TRNS, VA: global params are not required | Birte Kristina Friesel | -5/+4 | |
2025-05-13 | VA: Aspect Header: Use aspect-provided input_size | Birte Kristina Friesel | -2/+2 | |
2025-05-13 | tracing: provide input_size and element_size in derived aspect | Birte Kristina Friesel | -2/+12 | |
2025-05-13 | VA: Use common timer and timing aspect headers | Birte Kristina Friesel | -199/+29 | |
2025-05-13 | VA: AspectC++ support | Birte Kristina Friesel | -46/+206 | |
2025-01-16 | VA: dos2unix; indent -linux | Birte Kristina Friesel | -642/+767 | |
2025-01-13 | store input size as long | Birte Kristina Friesel | -8/+8 | |
2025-01-13 | VA: Add valgrind-ws benchmark script and baseline adjustments | Birte Kristina Friesel | -7/+49 | |
2025-01-07 | BS, GEMV, TRNS, TS, VA: Add basic perf stats | Birte Kristina Friesel | -0/+6 | |
To Do: Right now, the stats include benchmark setup. They should not. | ||||
2024-07-26 | VA hbm: reduce number of repetitions | Birte Kristina Friesel | -1/+1 | |
2024-07-25 | VA nmc: explicitly source SDK | Birte Kristina Friesel | -0/+2 | |
2024-07-23 | VA: remove single-node with in/out on same node | Birte Kristina Friesel | -13/+3 | |
2024-07-22 | VA baseline: configurable memcpy NUMA binding | Birte Kristina Friesel | -18/+50 | |
2024-07-22 | VA dimes-hetsim-nmc: variable input node | Birte Kristina Friesel | -2/+2 | |
2024-07-17 | VA: do not count move_pages towards timer | Birte Kristina Friesel | -1/+1 | |
2024-07-17 | VA hetsim nmc: nits | Birte Kristina Friesel | -5/+5 | |
2024-07-17 | VA memcpy: HBM support | Birte Kristina Friesel | -25/+63 | |
2024-07-17 | VA dimes benchmarks: re-introduce --resume; add memcpy baseline | Birte Kristina Friesel | -25/+31 | |
2024-07-17 | VA: Remove legacy run scripts | Birte Kristina Friesel | -136/+0 | |
2024-07-17 | VA: Add baseline variant with memcpy overhead (always use local input data) | Birte Kristina Friesel | -5/+81 | |
2024-07-15 | Revert "VA CPU version: log latency" | Birte Kristina Friesel | -2/+2 | |
This reverts commit 92b065344bfec4a529c821380ddbce65f92eee78. | ||||
2024-07-15 | VA, GEMV: Add roofline benchmarks | Birte Kristina Friesel | -0/+25 | |
2024-07-12 | VA CPU version: log latency | Birte Kristina Friesel | -2/+2 | |
2024-07-10 | VA: nits | Birte Kristina Friesel | -13/+13 | |
2024-07-05 | VA: log NUMA node of allocated rank(s) | Birte Kristina Friesel | -32/+52 | |
2024-07-05 | VA baseline: use strong scaling by default | Birte Kristina Friesel | -1/+1 | |
2024-07-04 | VA: write directly to output; no tee | Birte Kristina Friesel | -3/+3 | |
2024-07-04 | VA: add separate benchmark scripts for NMC and HBM | Birte Kristina Friesel | -28/+100 | |
2024-07-04 | VA: Remove legacy SDK_SINGLETHREADED macro | Birte Kristina Friesel | -12/+5 | |
2024-07-04 | VA: Add NUMA baseline variant | Birte Kristina Friesel | -24/+145 | |
2024-03-11 | VA: handle alloc/load/free overhead as parameters; vary them for fgbs24a | Birte Kristina Friesel | -18/+11 | |
2024-02-28 | some fgbs24a scripts | Birte Kristina Friesel | -0/+28 | |
2024-02-28 | VA: n_elements_per_dpu | Birte Kristina Friesel | -2/+2 | |
2023-11-27 | VA: revert prim benchmarks to original configurations; host-specific logs | Birte Kristina Friesel | -8/+61 | |
2023-11-27 | VA: add dpuinfo and singlethreaded options | Birte Kristina Friesel | -6/+29 | |
singlethreading does not seem to work at all. Funny SDK. | ||||
2023-11-27 | VA: determine number of ranks | Birte Kristina Friesel | -0/+3 | |
2023-11-21 | VA CPU baseline: rename misleading file_size to input_size | Birte Kristina Friesel | -6/+6 | |
2023-11-21 | VA basilen: … wat | Birte Kristina Friesel | -1/+0 | |
Fix MOpps output | ||||
2023-11-21 | UNI, VA: fix compilation without alloc/load/free benchmarks | Birte Kristina Friesel | -3/+3 | |
2023-11-20 | VA: more detail output | Birte Kristina Friesel | -7/+33 | |
2023-11-20 | VA: optionally include dpu_{alloc,load,free} overhead in benchmark output | Birte Kristina Friesel | -42/+96 | |
2023-09-21 | VA: remove a stray newline | Birte Kristina Friesel | -1/+1 | |
2023-07-26 | update config space | Birte Friesel | -5/+6 | |
2023-06-02 | VA cpu: limit to DPU type | Daniel Friesel | -5/+9 | |
2023-05-31 | VA: Correctly calculate PIM kernel throughput | Daniel Friesel | -2/+2 | |
2023-05-31 | VA: cleanup | Daniel Friesel | -7/+10 | |
2023-05-31 | VA: run.sh: up to 88 threads; strong scaling | Daniel Friesel | -2/+9 | |