Tanguy Pruvot
61231bc66c
fix various memory leaks on algo switch
8 years ago
Tanguy Pruvot
73f6720121
whirlpool: restore old source code for SM 3.0
...
SM 3.0 implementation need a manual define in whirlpool.cu...
alexis variant is 2x slower on SM3.0 (GT 740)
8 years ago
Tanguy Pruvot
18637503f2
whirlpool: upgrade with alexis variant (2x)
...
to check on older cards... maybe breaks some
8 years ago
Tanguy Pruvot
76a22479b1
whirlpool midstate and debug/trace defines
...
+ new cuda_debug.cuh include to trace gpu data
Happy new year!
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
9 years ago
Tanguy Pruvot
394d50987c
x15/whirlpool: improve hash rate and reduce cpu usage
...
also do some define's clean up
9 years ago
Tanguy Pruvot
9dfa757dc7
warn on cuda errors + various small changes
...
The full benchmark can now be launched with "ccminer --benchmark"
add a new helper function which log a warning with last cuda error
(not shown with the quiet option) : CUDA_LOG_ERROR();
it can be used where miner.h is included (.c/.cpp/.cu)
fix x14 (in ccminer.cpp), a break was missing in switch..case
9 years ago
Tanguy Pruvot
c2214091ae
benchmark: free last memory leaks on algo switch
...
remains my original lyra2 implementation to fix... (cuda_lyra2.cu)
I guess some kind of memory overflow force the driver to allocate
memory... but was unable to free it without device reset.
9 years ago
Tanguy Pruvot
2f541065fb
cuda_helper: rename correctly hiword/loword functions
10 years ago
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
10 years ago
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
10 years ago
Tanguy Pruvot
6ae28162db
various extern cleanup + api history uids and gpu SM
...
uids could be useful to create graphes from history data
Note: please do a clean build after this commit (changes in miner.h)
10 years ago
sp-hash
26b9fe3586
faster x15, +23KH or 4ms on whirpool (30ms vs 34ms)
...
tpruvot: i didnt pick the asm replace_hiword, slower on linux
10 years ago
Tanguy Pruvot
10314d844f
whirlpool: remove dead code, win 2ms in final hash
10 years ago
Tanguy Pruvot
64e8cd3f98
add x17 algo, cleaned djm34 commit
...
todo: visual studio...
10 years ago
Tanguy Pruvot
5cf024cc5d
whirl: add missing merkle for some pools
10 years ago
Tanguy Pruvot
7d430edc25
x15: optimize by 2ms (39ms to 37)
...
and clean whirlpool ifdefs, cost too much to keep both methods
10 years ago
Tanguy Pruvot
4bc23048b5
x15: use djm34 code with asm xor64 + my rot64
...
some optimizations could be done later, after whirlcoin integration
10 years ago
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
10 years ago
Tanguy Pruvot
e1aa028b9f
x15: optimize, adds some kH/s
10 years ago
Tanguy Pruvot
35c0eb5512
x15: optimize the algo
10 years ago
Tanguy Pruvot
81d7e0421d
x15 now works
10 years ago
Tanguy Pruvot
06763c20b1
Implement x14 (cuda + cpu functions)
...
Project was updated for VS2013 and CUDA SDK 6.5
add also a --cputest function to dump cpu hash results
TODO: x15 is not fully functional, but first loop seems ok
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago