summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorLines
2025-05-09TRNS: Add CCMCC25 benchmark scriptBirte Kristina Friesel-0/+75
2025-05-09TRNS: Add tracing and timing aspects for dfatool behaviour modelsBirte Kristina Friesel-22/+156
2025-05-09TRNS: Guard dfatool output behind a compile-time flagBirte Kristina Friesel-15/+29
2025-05-09TRNS: Remove unused WITH_{ALLOC,LOAD,FREE}_OVERHEAD Makefile optionsBirte Kristina Friesel-4/+1
2025-05-09TRNS: timer.h: dos2unixBirte Kristina Friesel-66/+74
2025-01-17SpMV: indent -linuxBirte Kristina Friesel-541/+681
2025-01-17SpMV host: Add missing \nBirte Kristina Friesel-1/+1
2025-01-16TRNS baseline: Add valgrind-ws supportBirte Kristina Friesel-6/+21
2025-01-16TS: dos2unix; indent -linuxBirte Kristina Friesel-308/+412
2025-01-16VA: dos2unix; indent -linuxBirte Kristina Friesel-642/+767
2025-01-16MLP: dos2unix; indent -linuxBirte Kristina Friesel-228/+314
2025-01-16HST-S: indent -linuxBirte Kristina Friesel-794/+924
2025-01-16GEMV: indent -linuxBirte Kristina Friesel-496/+614
2025-01-16COUNT: indent -linuxBirte Kristina Friesel-548/+664
2025-01-16COUNT: valgrind-ws support (WiP)Birte Kristina Friesel-23/+60
2025-01-16BS: dos2unix; indent -linuxBirte Kristina Friesel-539/+632
2025-01-16BS baseline: Add valgrind-ws supportBirte Kristina Friesel-2/+31
2025-01-16BFS: indent -linuxBirte Kristina Friesel-629/+792
2025-01-16BFS: Port NMC version (WiP)Birte Kristina Friesel-25/+69
2025-01-16BFS: port baseline (wip)Birte Kristina Friesel-6/+61
2025-01-16MLP baseline: indent -linuxBirte Kristina Friesel-143/+157
2025-01-16MLP baseline: Add NUMA and ws support; actually use parametersBirte Kristina Friesel-49/+144
2025-01-13store input size as longBirte Kristina Friesel-8/+8
2025-01-13VA: Add valgrind-ws benchmark script and baseline adjustmentsBirte Kristina Friesel-7/+49
2025-01-10SpMV baseline: Add optional native=0 flag; switch to -O3Birte Kristina Friesel-1/+15
2025-01-10GEMV baseline: Add optional native=0 flagBirte Kristina Friesel-7/+14
2025-01-10TRNS: Support compilation without -march=nativeBirte Kristina Friesel-13/+19
2025-01-10BS: Randomize elements and queriesBirte Kristina Friesel-3/+5
2025-01-10BS: Support CPU baseline compilation without -march=nativeBirte Kristina Friesel-10/+21
2025-01-07BS, GEMV, TRNS, TS, VA: Add basic perf statsBirte Kristina Friesel-0/+76
To Do: Right now, the stats include benchmark setup. They should not.
2024-12-16CPU-DPU: Add NoDMC'25 benchmark scriptsBirte Kristina Friesel-0/+71
2024-12-16COUNT: Add VaMos'25 benchmark scriptBirte Kristina Friesel-0/+43
2024-12-06BFS, MLP, SpMV, UNI: compile baselines with -march=nativeBirte Kristina Friesel-5/+8
2024-11-15STREAM: Re-enable -fltoBirte Kristina Friesel-1/+1
SDK 2024.1.0 shipped a compiler bug. With 2024.2.0, it is working again.
2024-11-15CPU-DPU microbenchmark: use default number of threads per poolBirte Kristina Friesel-6/+2
2024-11-14README: Document changes made to benchmarksBirte Kristina Friesel-0/+32
2024-10-30README: dimes24: Add PDF and DOI linksBirte Kristina Friesel-1/+2
2024-10-30README: Add citation informationBirte Kristina Friesel-1/+3
2024-10-28README: Add Git repo linksBirte Kristina Friesel-0/+4
2024-10-28COUNT: Add baselineBirte Kristina Friesel-0/+246
2024-10-24Update READMEBirte Kristina Friesel-2/+6
2024-10-24Add COUNT benchmark (based on SEL)Birte Kristina Friesel-0/+645
2024-10-11README: fix typoBirte Kristina Friesel-1/+1
2024-09-27README: -v; add referencesBirte Kristina Friesel-2/+49
2024-08-23Make STREAM Microbenchmark work with SDK 2024.1.0Birte Kristina Friesel-1/+1
For whatever reason, -flto causes a "Heap Full" error even in an otherwise empty program: * thread #1, name = 'DPUthread0', stop reason = fault 1 (Heap Full) * frame #0: 0x80000350 dpu_code`mem_alloc_nolock(size=64) at alloc.c:32:5 frame #1: 0x80000170 dpu_code`main_kernel1 [inlined] mem_alloc(size=64) at alloc.c:52:21 frame #2: 0x80000168 dpu_code`main_kernel1 at add.c:73 frame #3: 0x80000308 dpu_code`main at add.c:42:12 frame #4: 0x80000050 dpu_code`__bootstrap at crt0.c:36:5 I do not know why this is the case.
2024-08-19HST-S: Add memcpy variant and update HBM eval scriptBirte Kristina Friesel-69/+139
2024-08-15GEMV: measure each write (dpu_push_xfer call) separatelyBirte Kristina Friesel-17/+37
2024-07-28TRNS DPU: Correctly initalize task so that it's actually repeatableBirte Kristina Friesel-0/+1
2024-07-28TRNS host: set active_dpus_before at proper locationBirte Kristina Friesel-1/+1
2024-07-26TRNS: decrease number of repetitionsBirte Kristina Friesel-5/+5