Transtec.de
===============

CUDA Device #0
Major revision number         =3
Minor revision number         =5
Name                          =Tesla K20m
Total global memory           =5032706048
Total shared memory per block =49152
Total registers per block     =65536
Warp size                     =32
Maximum memory pitch          =2147483647
Maximum threads per block     =1024
Maximum dimension 0 of block =1024
Maximum dimension 1 of block =1024
Maximum dimension 2 of block =64
Maximum dimension 0 of grid =2147483647
Maximum dimension 1 of grid =65535
Maximum dimension 2 of grid =65535
Clock rate                    =705500
Total constant memory         =65536
Texture alignment             =512
Concurrent copy and execution =Yes
Number of multiprocessors     =13
Kernel execution timeout      =No

L'alignement sur 128 octets n'a pas d'influence avec data size 8 bits
sans alignement: 1  : 296.826s
avec alignement: 128: 292.207
