summaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorLines
41 hoursTS: dos2unix; indent -linuxHEADmainBirte Kristina Friesel-308/+412
41 hoursVA: dos2unix; indent -linuxBirte Kristina Friesel-642/+767
41 hoursMLP: dos2unix; indent -linuxBirte Kristina Friesel-228/+314
41 hoursHST-S: indent -linuxBirte Kristina Friesel-794/+924
41 hoursGEMV: indent -linuxBirte Kristina Friesel-496/+614
41 hoursCOUNT: indent -linuxBirte Kristina Friesel-548/+664
41 hoursCOUNT: valgrind-ws support (WiP)Birte Kristina Friesel-23/+60
41 hoursBS: dos2unix; indent -linuxBirte Kristina Friesel-539/+632
41 hoursBS baseline: Add valgrind-ws supportBirte Kristina Friesel-2/+31
41 hoursBFS: indent -linuxBirte Kristina Friesel-629/+792
41 hoursBFS: Port NMC version (WiP)Birte Kristina Friesel-25/+69
41 hoursBFS: port baseline (wip)Birte Kristina Friesel-6/+61
41 hoursMLP baseline: indent -linuxBirte Kristina Friesel-143/+157
41 hoursMLP baseline: Add NUMA and ws support; actually use parametersBirte Kristina Friesel-49/+144
4 daysstore input size as longBirte Kristina Friesel-8/+8
4 daysVA: Add valgrind-ws benchmark script and baseline adjustmentsBirte Kristina Friesel-7/+49
7 daysSpMV baseline: Add optional native=0 flag; switch to -O3Birte Kristina Friesel-1/+15
7 daysGEMV baseline: Add optional native=0 flagBirte Kristina Friesel-7/+14
8 daysTRNS: Support compilation without -march=nativeBirte Kristina Friesel-13/+19
8 daysBS: Randomize elements and queriesBirte Kristina Friesel-3/+5
8 daysBS: Support CPU baseline compilation without -march=nativeBirte Kristina Friesel-10/+21
10 daysBS, GEMV, TRNS, TS, VA: Add basic perf statsBirte Kristina Friesel-0/+76
To Do: Right now, the stats include benchmark setup. They should not.
2024-12-16CPU-DPU: Add NoDMC'25 benchmark scriptsBirte Kristina Friesel-0/+71
2024-12-16COUNT: Add VaMos'25 benchmark scriptBirte Kristina Friesel-0/+43
2024-12-06BFS, MLP, SpMV, UNI: compile baselines with -march=nativeBirte Kristina Friesel-5/+8
2024-11-15STREAM: Re-enable -fltoBirte Kristina Friesel-1/+1
SDK 2024.1.0 shipped a compiler bug. With 2024.2.0, it is working again.
2024-11-15CPU-DPU microbenchmark: use default number of threads per poolBirte Kristina Friesel-6/+2
2024-11-14README: Document changes made to benchmarksBirte Kristina Friesel-0/+32
2024-10-30README: dimes24: Add PDF and DOI linksBirte Kristina Friesel-1/+2
2024-10-30README: Add citation informationBirte Kristina Friesel-1/+3
2024-10-28README: Add Git repo linksBirte Kristina Friesel-0/+4
2024-10-28COUNT: Add baselineBirte Kristina Friesel-0/+246
2024-10-24Update READMEBirte Kristina Friesel-2/+6
2024-10-24Add COUNT benchmark (based on SEL)Birte Kristina Friesel-0/+645
2024-10-11README: fix typoBirte Kristina Friesel-1/+1
2024-09-27README: -v; add referencesBirte Kristina Friesel-2/+49
2024-08-23Make STREAM Microbenchmark work with SDK 2024.1.0Birte Kristina Friesel-1/+1
For whatever reason, -flto causes a "Heap Full" error even in an otherwise empty program: * thread #1, name = 'DPUthread0', stop reason = fault 1 (Heap Full) * frame #0: 0x80000350 dpu_code`mem_alloc_nolock(size=64) at alloc.c:32:5 frame #1: 0x80000170 dpu_code`main_kernel1 [inlined] mem_alloc(size=64) at alloc.c:52:21 frame #2: 0x80000168 dpu_code`main_kernel1 at add.c:73 frame #3: 0x80000308 dpu_code`main at add.c:42:12 frame #4: 0x80000050 dpu_code`__bootstrap at crt0.c:36:5 I do not know why this is the case.
2024-08-19HST-S: Add memcpy variant and update HBM eval scriptBirte Kristina Friesel-69/+139
2024-08-15GEMV: measure each write (dpu_push_xfer call) separatelyBirte Kristina Friesel-17/+37
2024-07-28TRNS DPU: Correctly initalize task so that it's actually repeatableBirte Kristina Friesel-0/+1
2024-07-28TRNS host: set active_dpus_before at proper locationBirte Kristina Friesel-1/+1
2024-07-26TRNS: decrease number of repetitionsBirte Kristina Friesel-5/+5
2024-07-26GEMV: specify SDK versionBirte Kristina Friesel-0/+2
2024-07-26CPU-DPU: specify SDK versionBirte Kristina Friesel-1/+5
2024-07-26TS: re-add --resumeBirte Kristina Friesel-12/+11
2024-07-26TRNS host: do not needlessly re-allocate DPUsBirte Kristina Friesel-0/+1
2024-07-26VA hbm: reduce number of repetitionsBirte Kristina Friesel-1/+1
2024-07-25VA nmc: explicitly source SDKBirte Kristina Friesel-0/+2
2024-07-25TRNS: fix write1 documentationBirte Kristina Friesel-4/+4
2024-07-25TRNS: specify memcpy nodeBirte Kristina Friesel-10/+32