Tanguy Pruvot
|
def9888bd5
|
xmr: prefer 32bit uint4 and smaller offsets in core
also prefer ulong2 shared load to be closer to the ptx
|
8 years ago |
Tanguy Pruvot
|
588c7ba361
|
xmr: dont use shared mem hack, windows dont like
|
8 years ago |
Tanguy Pruvot
|
bd030db5d1
|
xmr: vectors rewrite, now the phase2 is using only 40 regs
no more constant memory used for aes.
tested only on linux cuda 8 for now... wip
|
8 years ago |
Tanguy Pruvot
|
e231343060
|
xmr: make it more smooth on windows with defaults
also improve a bit the 750 ti on linux...
|
8 years ago |
Tanguy Pruvot
|
93adb56c8e
|
handle cryptonight light variant
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
|
8 years ago |
Tanguy Pruvot
|
2479ffaaa2
|
xmr: fix decimal diff + aes cleanup
change default launch config to -l 32x16 to handle the 750 Ti better
not definitive, doing tests..
|
8 years ago |
Tanguy Pruvot
|
c1f1ad9280
|
xmr: stabilize the final kernel
|
8 years ago |
Tanguy Pruvot
|
066a569357
|
import xmr, to finish
todo: fix jh cuda and wrong decimal diff (0xffff problem ?)
|
8 years ago |