Tanguy Pruvot
3d3f2e2cb5
warnings: use the right device id (device_map[thr_id])
2015-04-23 09:41:56 +02:00
Tanguy Pruvot
5988e945ef
some cleanup and longpoll changes
2015-03-27 08:03:26 +01:00
KlausT
ae8e863591
remove uint32_t cast
2015-03-12 01:01:47 +01:00
Tanguy Pruvot
d6a421ae10
whirlpoolx: windows stay fluent with intensity 20
2015-03-08 23:00:59 +01:00
Tanguy Pruvot
ebd23bcc66
whirlpoolx: real fix for multi gpus
...
Main problem was the arrays allocations which should be made per cpu
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2015-03-08 22:56:04 +01:00
Tanguy Pruvot
10d4f4a96a
whirlpoolx: try to fix linux hashrates
...
bug reported by PVMining with a lot of cards (not seen here)
2015-03-08 14:21:39 +01:00
Tanguy Pruvot
f8f46bfe5b
whirlpoolx: fix for multi-gpu support
...
add also cputest entry and do some cleanup...
2015-03-07 16:33:04 +01:00
Tanguy Pruvot
0019936328
Import Whirlpoolx algo from alexis78 repo
...
remove trailing spaces and move to x15 folder
2015-03-07 14:06:33 +01:00
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
2015-02-28 14:05:09 +01:00
Tanguy Pruvot
26b51a557b
Allow different intensity per device
...
and clean the old variables, no more required
2015-01-24 11:17:29 +01:00
Tanguy Pruvot
2a5233f56e
api: report throughput when default
2015-01-22 06:28:59 +01:00
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
2015-01-22 04:55:27 +01:00
Tanguy Pruvot
9b5d3bf528
Fix whirl algo
2014-12-19 09:16:51 +01:00
Tanguy Pruvot
c3bdb623e8
Check and submit multiple nonces in one loop
...
Added to most algos, checkhash function scans a big range
and can find multiple nonces at once if the difficulty is low.
Stop ignoring them, submit second one if found...
Clean the draft code for rc=2 implemented for blake and pentablake
btw... fix the reduced displayed hashrate when a nonce is found...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-12-05 15:53:40 +00:00
Tanguy Pruvot
118a6be361
checkhash: simplify the common function
...
use klaus trivial function, the old code has always been a bit weird..
split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff
2014-12-01 00:20:40 +01:00
KlausT
cc9df2d361
use ROTL32 in x14_shabal
...
no impacts on perf but easier to read
2014-11-30 20:31:19 +01:00
Tanguy Pruvot
8ad180cc70
various small changes
...
heavy: reduce by 256 threads default intensity to all -i 20
cuda: put static thread init bools outside the code (made once)
api: fix nvml header to build without
2014-11-28 20:57:35 +01:00
Tanguy Pruvot
6ae28162db
various extern cleanup + api history uids and gpu SM
...
uids could be useful to create graphes from history data
Note: please do a clean build after this commit (changes in miner.h)
2014-11-26 11:55:42 +01:00
sp-hash
f0d91ab8a6
Luffa and simd merged to one kernal.
...
Small echo rewrite. +10KHASH on the 650(compute 3.0)
tpruvot: add Linux Makefile - Force to 80 registers (else -30KH/s)
Note : the hashrate seems more constant with this change
2014-11-23 07:04:07 +01:00
sp-hash
26b9fe3586
faster x15, +23KH or 4ms on whirpool (30ms vs 34ms)
...
tpruvot: i didnt pick the asm replace_hiword, slower on linux
2014-11-20 19:19:27 +01:00
Tanguy Pruvot
73f22b237a
Prepare trap of hardware/mem failures
2014-11-20 18:44:25 +01:00
Tanguy Pruvot
fe4ad36b73
intensity: sign warnings fixes min(i,u)
2014-11-17 14:48:55 +01:00
Tanguy Pruvot
c859041993
quark/blake512 opt. pointed by sp without asm
...
indeed, the pragma unroll doesnt always make things faster
asm part... to check later
2014-11-17 00:01:32 +01:00
Tanguy Pruvot
438308b3a2
Rework benchmark mode and min/max range
...
Was maybe my fault, but the benchmark mode was
always recomputing from nonce 0.
Also fix blake if -d 1 is used (one thread but second gpu)
stats: do not use thread id as key, prefer gpu id...
2014-11-16 23:28:18 +01:00
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
2014-11-11 19:11:16 +01:00
Tanguy Pruvot
11c5ec810d
Handle intensity param in all algos
...
and add a check related to start/max nounce params
2014-11-09 22:27:32 +01:00
Tanguy Pruvot
7cc5222394
Move common check_cpu functions to root
2014-09-10 00:27:01 +02:00
Tanguy Pruvot
95ac1d0f19
x11: adapt some blake 256 opts to 512 one
...
blake512: for the moment 6.2ms vs 7.12 before (+10%)
2014-09-09 17:55:07 +02:00
Tanguy Pruvot
10314d844f
whirlpool: remove dead code, win 2ms in final hash
2014-08-24 04:44:18 +02:00
Tanguy Pruvot
64e8cd3f98
add x17 algo, cleaned djm34 commit
...
todo: visual studio...
2014-08-23 22:44:17 +02:00
Tanguy Pruvot
5cf024cc5d
whirl: add missing merkle for some pools
2014-08-23 01:44:47 +02:00
Tanguy Pruvot
b4e690b486
sources: swith to UTF-8
2014-08-21 08:27:48 +02:00
Tanguy Pruvot
7d430edc25
x15: optimize by 2ms (39ms to 37)
...
and clean whirlpool ifdefs, cost too much to keep both methods
2014-08-20 22:00:26 +02:00
Tanguy Pruvot
1fbcbbacc4
Add whirlcoin and optimize x11 luffa (maxrregcount)
2014-08-20 07:49:22 +02:00
Tanguy Pruvot
4bc23048b5
x15: use djm34 code with asm xor64 + my rot64
...
some optimizations could be done later, after whirlcoin integration
2014-08-20 05:54:47 +02:00
Tanguy Pruvot
194fda87c1
x11: restore simd host2dev memcpytosymbol to reduce used cmem
...
Remove define attempts for SM 2.1 devices, fermi is not compatible
2014-08-19 18:32:14 +02:00
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
2014-08-19 03:29:11 +02:00
Tanguy Pruvot
e1aa028b9f
x15: optimize, adds some kH/s
2014-08-18 00:58:44 +02:00
Tanguy Pruvot
a9a3ad8afc
cuda: check for errors on cuda mem alloc
2014-08-17 22:41:05 +02:00
Tanguy Pruvot
35c0eb5512
x15: optimize the algo
2014-08-15 03:34:27 +02:00
Tanguy Pruvot
81d7e0421d
x15 now works
2014-08-15 00:11:59 +02:00
Tanguy Pruvot
06763c20b1
Implement x14 (cuda + cpu functions)
...
Project was updated for VS2013 and CUDA SDK 6.5
add also a --cputest function to dump cpu hash results
TODO: x15 is not fully functional, but first loop seems ok
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-08-12 14:47:03 +02:00