Tanguy Pruvot
2f541065fb
cuda_helper: rename correctly hiword/loword functions
10 years ago
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
10 years ago
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
10 years ago
Tanguy Pruvot
1b65cd05cc
heavy: add error checks, fix strict aliasing and linux
...
The core problem was the cuda hefty Thread per block set to high
but took me several hours to find that...
btw... +25% in heavy 12500 with 256 threads per block... vs 128 & 512
if max reg count is set to 80...
10 years ago
Tanguy Pruvot
a747e4ca0f
blake512: use a new SWAPDWORDS asm func (0.05ms)
...
small improvement, do it on pentablake and heavy variants too
based on sp commit (but SWAP32 is already used for 32bit ints)
10 years ago
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
10 years ago
Christian Buchner
3b21069504
bump to revision V1.1 with Killer Groestl
11 years ago
Christian Buchner
9ff175f8e6
small fixes for compilation and help
11 years ago
Christian Buchner
6c8eff98c0
bump to revision v0.8
11 years ago
Christian Buchner
0659d697c1
Revision 0.6 with myriad-groestl and jackpot coin
11 years ago
Christian Buchner
2ca6ede92b
update to version v0.5 (2014-03-27)
11 years ago
Christian Buchner
b93669a99f
committing changes to v0.3 release - added Groestlcoin.
11 years ago
Christian Buchner
6aac3fbba5
commit initial version 0.1
11 years ago