Tanguy Pruvot
2f541065fb
cuda_helper: rename correctly hiword/loword functions
10 years ago
Tanguy Pruvot
2113be6eec
blake80: some changes and launch bounds, no perf changes
10 years ago
Tanguy Pruvot
3d3f2e2cb5
warnings: use the right device id (device_map[thr_id])
10 years ago
Tanguy Pruvot
5988e945ef
some cleanup and longpoll changes
10 years ago
KlausT
ae8e863591
remove uint32_t cast
10 years ago
Tanguy Pruvot
d6a421ae10
whirlpoolx: windows stay fluent with intensity 20
10 years ago
Tanguy Pruvot
ebd23bcc66
whirlpoolx: real fix for multi gpus
...
Main problem was the arrays allocations which should be made per cpu
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago
Tanguy Pruvot
10d4f4a96a
whirlpoolx: try to fix linux hashrates
...
bug reported by PVMining with a lot of cards (not seen here)
10 years ago
Tanguy Pruvot
f8f46bfe5b
whirlpoolx: fix for multi-gpu support
...
add also cputest entry and do some cleanup...
10 years ago
Tanguy Pruvot
0019936328
Import Whirlpoolx algo from alexis78 repo
...
remove trailing spaces and move to x15 folder
10 years ago
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
10 years ago
Tanguy Pruvot
26b51a557b
Allow different intensity per device
...
and clean the old variables, no more required
10 years ago
Tanguy Pruvot
2a5233f56e
api: report throughput when default
10 years ago
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
10 years ago
Tanguy Pruvot
9b5d3bf528
Fix whirl algo
10 years ago
Tanguy Pruvot
c3bdb623e8
Check and submit multiple nonces in one loop
...
Added to most algos, checkhash function scans a big range
and can find multiple nonces at once if the difficulty is low.
Stop ignoring them, submit second one if found...
Clean the draft code for rc=2 implemented for blake and pentablake
btw... fix the reduced displayed hashrate when a nonce is found...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago
Tanguy Pruvot
118a6be361
checkhash: simplify the common function
...
use klaus trivial function, the old code has always been a bit weird..
split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff
10 years ago
KlausT
cc9df2d361
use ROTL32 in x14_shabal
...
no impacts on perf but easier to read
10 years ago
Tanguy Pruvot
8ad180cc70
various small changes
...
heavy: reduce by 256 threads default intensity to all -i 20
cuda: put static thread init bools outside the code (made once)
api: fix nvml header to build without
10 years ago
Tanguy Pruvot
6ae28162db
various extern cleanup + api history uids and gpu SM
...
uids could be useful to create graphes from history data
Note: please do a clean build after this commit (changes in miner.h)
10 years ago
sp-hash
f0d91ab8a6
Luffa and simd merged to one kernal.
...
Small echo rewrite. +10KHASH on the 650(compute 3.0)
tpruvot: add Linux Makefile - Force to 80 registers (else -30KH/s)
Note : the hashrate seems more constant with this change
10 years ago
sp-hash
26b9fe3586
faster x15, +23KH or 4ms on whirpool (30ms vs 34ms)
...
tpruvot: i didnt pick the asm replace_hiword, slower on linux
10 years ago
Tanguy Pruvot
73f22b237a
Prepare trap of hardware/mem failures
10 years ago
Tanguy Pruvot
fe4ad36b73
intensity: sign warnings fixes min(i,u)
10 years ago
Tanguy Pruvot
c859041993
quark/blake512 opt. pointed by sp without asm
...
indeed, the pragma unroll doesnt always make things faster
asm part... to check later
10 years ago
Tanguy Pruvot
438308b3a2
Rework benchmark mode and min/max range
...
Was maybe my fault, but the benchmark mode was
always recomputing from nonce 0.
Also fix blake if -d 1 is used (one thread but second gpu)
stats: do not use thread id as key, prefer gpu id...
10 years ago
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
10 years ago
Tanguy Pruvot
11c5ec810d
Handle intensity param in all algos
...
and add a check related to start/max nounce params
10 years ago
Tanguy Pruvot
7cc5222394
Move common check_cpu functions to root
10 years ago
Tanguy Pruvot
95ac1d0f19
x11: adapt some blake 256 opts to 512 one
...
blake512: for the moment 6.2ms vs 7.12 before (+10%)
10 years ago
Tanguy Pruvot
10314d844f
whirlpool: remove dead code, win 2ms in final hash
10 years ago
Tanguy Pruvot
64e8cd3f98
add x17 algo, cleaned djm34 commit
...
todo: visual studio...
10 years ago
Tanguy Pruvot
5cf024cc5d
whirl: add missing merkle for some pools
10 years ago
Tanguy Pruvot
b4e690b486
sources: swith to UTF-8
10 years ago
Tanguy Pruvot
7d430edc25
x15: optimize by 2ms (39ms to 37)
...
and clean whirlpool ifdefs, cost too much to keep both methods
10 years ago
Tanguy Pruvot
1fbcbbacc4
Add whirlcoin and optimize x11 luffa (maxrregcount)
10 years ago
Tanguy Pruvot
4bc23048b5
x15: use djm34 code with asm xor64 + my rot64
...
some optimizations could be done later, after whirlcoin integration
10 years ago
Tanguy Pruvot
194fda87c1
x11: restore simd host2dev memcpytosymbol to reduce used cmem
...
Remove define attempts for SM 2.1 devices, fermi is not compatible
10 years ago
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
10 years ago
Tanguy Pruvot
e1aa028b9f
x15: optimize, adds some kH/s
10 years ago
Tanguy Pruvot
a9a3ad8afc
cuda: check for errors on cuda mem alloc
10 years ago
Tanguy Pruvot
35c0eb5512
x15: optimize the algo
10 years ago
Tanguy Pruvot
81d7e0421d
x15 now works
10 years ago
Tanguy Pruvot
06763c20b1
Implement x14 (cuda + cpu functions)
...
Project was updated for VS2013 and CUDA SDK 6.5
add also a --cputest function to dump cpu hash results
TODO: x15 is not fully functional, but first loop seems ok
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago