Tanguy Pruvot
d2f836119a
vstudio: remove remains, move sha in tree
...
+ small code fixes
8 years ago
Tanguy Pruvot
61231bc66c
fix various memory leaks on algo switch
8 years ago
Tanguy Pruvot
73f6720121
whirlpool: restore old source code for SM 3.0
...
SM 3.0 implementation need a manual define in whirlpool.cu...
alexis variant is 2x slower on SM3.0 (GT 740)
8 years ago
Tanguy Pruvot
18637503f2
whirlpool: upgrade with alexis variant (2x)
...
to check on older cards... maybe breaks some
8 years ago
Tanguy Pruvot
c66e8622b3
api: report per thread cpu hash checks (ACC/REJ)
...
+ update all algos for that...
8 years ago
Tanguy Pruvot
0ff75791e5
migrate 2nd nonce storage of most algos
...
This allow to keep pdata[19] as cursor between scans, and later, to sort them..
remains... heavy, scrypt, sia...
8 years ago
Tanguy Pruvot
9eead77027
diff: show by default, rework shares diff storage
...
This will allow later more gpu candidates.
Note: This is an unfinished work, we keep the previous behavior for now
To finish this, all algos solutions should be migrated and submitted nonces attributes stored.
Its required to handle the different share diff per nonce and fix the possible solved count error (if 1/2 nonces is solved).
8 years ago
Tanguy Pruvot
34e97bf3e6
Show intensity on init for all algos
8 years ago
Tanguy Pruvot
0a0fd33cac
attempt to reduce shared mem errors
8 years ago
Tanguy Pruvot
a237601747
1.7.1 release
...
set schedule flags to reduce linux cpu usage without MyStreamSynchronize()
9 years ago
Tanguy Pruvot
76a22479b1
whirlpool midstate and debug/trace defines
...
+ new cuda_debug.cuh include to trace gpu data
Happy new year!
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
9 years ago
Tanguy Pruvot
394d50987c
x15/whirlpool: improve hash rate and reduce cpu usage
...
also do some define's clean up
9 years ago
Tanguy Pruvot
61ff92b5b4
never interrupt global benchmark with found nonces
...
fix some algo weird hashrates (like blake)
and reset device between algos, for better accuracy
but this reset doesnt seems enough to bench all algos correctly...
to test on linux, could be a driver issue...
heavy: fix first alloc and indent with tabs...
9 years ago
Tanguy Pruvot
8d4d4d65ce
cuda: header for common kernel functions (quark/x11)
...
Was thinking about doing that since months ;) lets go
9 years ago
Tanguy Pruvot
d43dc9a021
use blake512 sp kernels on SM 5+ (80+64)
...
import and keep my code for older archs, like skein 64
reduce the gap between our versions...
+150kH x11 GTX 960 / +30kH 750Ti
+900kH quark GTX 960 / +230kH 750Ti
9 years ago
Tanguy Pruvot
355b835ae0
benchmark: enhance the mem leak detection
...
reduce "false" warnings, and ignore unrelated/small ones <= 1 MB
On windows the gpu memory can be allocated by other processes
+ some cleanup in algos... (free/gpulog)
9 years ago
Tanguy Pruvot
5bf1f98200
various fixes for SM 2.1 and the benchmark
...
X11+ algos and quark are not compatible for the moment
but these ones are :
Benchmark results for Gigabyte GTX 460 (SM 2.1 / 1 GB):
blakecoin : 159090.5 kH/s, 1 MB, 1048576 thr.
blake : 70208.9 kH/s, 1 MB, 1048576 thr.
bmw : 122802.6 kH/s, 65 MB, 2097152 thr.
deep : 3533.6 kH/s, 33 MB, 524288 thr.
fugue256 : 43177.9 kH/s, 17 MB, 524288 thr.
heavy : 4118.2 kH/s, 147 MB, 524032 thr.
keccak : 18673.1 kH/s, 129 MB, 2097152 thr.
luffa : 28816.0 kH/s, 257 MB, 4194304 thr.
lyra2 : 213.7 kH/s, 570 MB, 65536 thr.
mjollnir : 3895.6 kH/s, 147 MB, 524032 thr.
nist5 : 1101.4 kH/s, 67 MB, 1048576 thr.
penta : 501.6 kH/s, 21 MB, 327680 thr.
skein : 5432.4 kH/s, 65 MB, 1048576 thr.
skein2 : 6788.9 kH/s, 33 MB, 524288 thr.
whirlpool : 688.5 kH/s, 33 MB, 524288 thr.
zr5 : 122.5 kH/s, 86 MB, 262144 thr.
9 years ago
Tanguy Pruvot
9dfa757dc7
warn on cuda errors + various small changes
...
The full benchmark can now be launched with "ccminer --benchmark"
add a new helper function which log a warning with last cuda error
(not shown with the quiet option) : CUDA_LOG_ERROR();
it can be used where miner.h is included (.c/.cpp/.cu)
fix x14 (in ccminer.cpp), a break was missing in switch..case
9 years ago
Tanguy Pruvot
d195f2e8a2
intensity: do not reduce throughput before init
...
Else the memory allocated could be less than required later
btw, use the new "cuda" function to apply intensity/throughput
9 years ago
Tanguy Pruvot
ab5cc7162e
refactor: create bench.cpp and algos.h
...
Also enhance multi-thread benchmark synchro. with pthread barriers
9 years ago
Tanguy Pruvot
c2214091ae
benchmark: free last memory leaks on algo switch
...
remains my original lyra2 implementation to fix... (cuda_lyra2.cu)
I guess some kind of memory overflow force the driver to allocate
memory... but was unable to free it without device reset.
9 years ago
Tanguy Pruvot
4e1e03b891
benchmark: store all algos results + cuda fixes
...
Note: lyra2, lyra2v2 and script seems to have problems
to coexist with other algos... to run after some of them...
moved lyra2 first and skip scrypt/jane for the moment...
Only stored in memory for now.. to display a table after the bench
ccminer -a auto --benchmark
Results may be exported later to a json file...
9 years ago
Tanguy Pruvot
922c2a5cd7
algos: free allocated mem for algo switch
...
All can be freed propertly now, except script (reset) and lyra2 (leak)
9 years ago
Tanguy Pruvot
ee93927fac
diff: use the new function in all algos
9 years ago
Tanguy Pruvot
5f12943de5
whirlpool: add algo free function + vstudio
9 years ago
Tanguy Pruvot
b641bfdf8b
diff: rename functions like cpuminer-multi
...
more proper, intuitive...
9 years ago
Tanguy Pruvot
3f589cc4db
restore the whirlpool algo
9 years ago
Tanguy Pruvot
e1c4b3042c
algos: add functions to free allocated resources
...
Will be used later for algo switching
not really tested yet...
9 years ago
Tanguy Pruvot
5308898d1c
start v1.7, apply new prototypes to all algos
9 years ago
Tanguy Pruvot
3788fef9fd
Restore whirlpoolx algo
9 years ago
Tanguy Pruvot
42bcb91ca0
x11: update sp luffa/cube to get closer x11 speeds..
...
i had to clean it... lot of unused defines...
10 years ago
Tanguy Pruvot
298573b45d
Drop Whirlpool and whirpoolx algos, no more used...
...
Vanillacoin is now PoS only
10 years ago
Tanguy Pruvot
2f541065fb
cuda_helper: rename correctly hiword/loword functions
10 years ago
Tanguy Pruvot
2113be6eec
blake80: some changes and launch bounds, no perf changes
10 years ago
Tanguy Pruvot
3d3f2e2cb5
warnings: use the right device id (device_map[thr_id])
10 years ago
Tanguy Pruvot
5988e945ef
some cleanup and longpoll changes
10 years ago
KlausT
ae8e863591
remove uint32_t cast
10 years ago
Tanguy Pruvot
d6a421ae10
whirlpoolx: windows stay fluent with intensity 20
10 years ago
Tanguy Pruvot
ebd23bcc66
whirlpoolx: real fix for multi gpus
...
Main problem was the arrays allocations which should be made per cpu
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago
Tanguy Pruvot
10d4f4a96a
whirlpoolx: try to fix linux hashrates
...
bug reported by PVMining with a lot of cards (not seen here)
10 years ago
Tanguy Pruvot
f8f46bfe5b
whirlpoolx: fix for multi-gpu support
...
add also cputest entry and do some cleanup...
10 years ago
Tanguy Pruvot
0019936328
Import Whirlpoolx algo from alexis78 repo
...
remove trailing spaces and move to x15 folder
10 years ago
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
10 years ago
Tanguy Pruvot
26b51a557b
Allow different intensity per device
...
and clean the old variables, no more required
10 years ago
Tanguy Pruvot
2a5233f56e
api: report throughput when default
10 years ago
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
10 years ago
Tanguy Pruvot
9b5d3bf528
Fix whirl algo
10 years ago
Tanguy Pruvot
c3bdb623e8
Check and submit multiple nonces in one loop
...
Added to most algos, checkhash function scans a big range
and can find multiple nonces at once if the difficulty is low.
Stop ignoring them, submit second one if found...
Clean the draft code for rc=2 implemented for blake and pentablake
btw... fix the reduced displayed hashrate when a nonce is found...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago
Tanguy Pruvot
118a6be361
checkhash: simplify the common function
...
use klaus trivial function, the old code has always been a bit weird..
split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff
10 years ago
KlausT
cc9df2d361
use ROTL32 in x14_shabal
...
no impacts on perf but easier to read
10 years ago