blob: c36ee771a51d75f4a63a9ae73b6882db653f1c3d (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
|
In-place matrix transposition (TRNS)
Compilation instructions
make
Execution instructions
./trns -w 0 -r 1 -m 16 -n 8 -o 4096 -p 2556 -i 64
For more options
./trns -h
Read more
J. Gomez-Luna et al., “In-place Matrix Transposition on GPUs,” IEEE TPDS, 2016.
|