Commit Graph

338 Commits

Author SHA1 Message Date
Tanguy Pruvot
768b5ccb76 import bmw512 uint2 changes from sp
+ some cleanup... 15KH/s won (750Ti)
2015-01-24 08:02:41 +01:00
Tanguy Pruvot
9f2dd3ee60 Remove some useless conversions
do not impact perfs neither...
2015-01-24 08:00:22 +01:00
Tanguy Pruvot
45206e49c1 hamsi: TPB of 128 give better results (+10kh) 2015-01-24 07:17:12 +01:00
Tanguy Pruvot
bba2df56d5 cpu-affinity: linux thread affinity was not set correctly 2015-01-23 04:59:19 +01:00
Tanguy Pruvot
833238cae5 cpu-affinity: fix windows process cpu affinity mask 2015-01-23 02:49:30 +01:00
Tanguy Pruvot
dfa9f63886 api: report intensity and throughput for each thread
even when no parameter is set (default set in each algo code)
2015-01-22 06:36:22 +01:00
Tanguy Pruvot
2a5233f56e api: report throughput when default 2015-01-22 06:28:59 +01:00
Tanguy Pruvot
bec89724d5 logs: prevent multiple warnings 2015-01-22 05:55:50 +01:00
Tanguy Pruvot
cafd4477d7 Handle a maximum of 16 gpus (vs 8 before)
Some cards have 2 gpus on board...
2015-01-22 04:55:27 +01:00
Tanguy Pruvot
65cefa4a7d cpu-priority: fix for windows process 2015-01-22 03:09:03 +01:00
Tanguy Pruvot
c3b6162899 Add cpu priority and affinity parameters
priority param value is from 0 (default idle like before) to 5 (highest)

affinity could be enhanced to set multiple cores... todo

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2015-01-22 02:08:05 +01:00
Tanguy Pruvot
6500160fe1 api: add cuda threads count 2015-01-19 00:59:25 +01:00
Tanguy Pruvot
b521acb480 groestl: use sp bitslice enhancement, prepare SM 2.x variant
todo: simd512 SM 2.x variant (shfl op), and groestl/myriad functions
2015-01-19 00:42:14 +01:00
Tanguy Pruvot
29c3db5dc6 fix linux configure 2015-01-17 20:10:50 +01:00
Tanguy Pruvot
ec454dd78c api: add fan rpm (windows nvapi only) 2014-12-26 15:33:55 +01:00
Tanguy Pruvot
f761531436 Fix to build with nvcc in path (crux) 2014-12-26 15:28:32 +00:00
Tanguy Pruvot
5cbf239d8a api: add kernel version for linux 2014-12-26 15:20:03 +00:00
Tanguy Pruvot
b3188669e2 lyra2: cleanup
quickly tested with a SM 3.0 binary...
2014-12-20 13:10:33 +01:00
Tanguy Pruvot
da2e2528a7 uint2: fix SM 3.0 ROR and ROL
Not sure its the fastest way, but it works for offsets 0-63 + 64

Also note than asm SM 3.5+ doesn't support ROR with offset 64
2014-12-19 21:45:40 +01:00
Tanguy Pruvot
2dce6733a7 pentablake: cleanup 2014-12-19 20:05:55 +01:00
Tanguy Pruvot
f08bd79a6e Intelligent duplicate shares check
Only enabled on first duplicate share
2014-12-19 12:49:04 +01:00
KlausT
756ea89ecd lyra2 +100kh for GTX970 2014-12-19 11:59:00 +01:00
Tanguy Pruvot
150d6eb290 qubit: fix wrong pool difficulty
hmm... this is weird, should be 256 but make bad shares on pool diff 2
2014-12-19 09:43:35 +01:00
Tanguy Pruvot
90efbdcece simd cleanup 2014-12-19 09:16:55 +01:00
Tanguy Pruvot
ec5a48f420 x11: small simd512 gpu_expand improvement 2014-12-19 09:16:55 +01:00
Tanguy Pruvot
a66d78e692 reduce lyra2 blake and pentablake cpu load 2014-12-19 09:16:55 +01:00
Tanguy Pruvot
010eba1760 start 1.5.2 dev 2014-12-19 09:16:55 +01:00
Tanguy Pruvot
9b5d3bf528 Fix whirl algo 2014-12-19 09:16:51 +01:00
Tanguy Pruvot
ac9b0d27ee Release final 1.5.1 2014-12-17 01:30:42 +01:00
Tanguy Pruvot
63e3387dbb lyra2: add sm30 device compat (skein256) 2014-12-16 14:19:07 +01:00
Tanguy Pruvot
1e24e4899c skein: uint2 optimisation with SM 3.0 compat (+15KH)
Thanks to sp and djm34 for this fast uint64 storage alternative
2014-12-16 13:52:54 +01:00
Tanguy Pruvot
2585e10814 keccak uint2 optimisation for SM>3.0 (x11 +40KH/s)
based on djm34 keccak 256-bit changes, and keep SM3.0 compat

affect most other algos too (quark, nist5, x13...)
2014-12-15 11:34:03 +01:00
Tanguy Pruvot
fa7d744a6c lyra2: make_uint2 and set pool difficulty 2014-12-15 09:48:27 +01:00
Tanguy Pruvot
7a0f7ab276 solo: stop submit of solved blocs (stale work)
rejects in solo mode often means "generated block is stale" (see Debug.log)

Most of current wallets only reports "rejected" reason without explanations.

So prevent this common error,
you can turn off this gbt feature (height check) with --no-gbt option.
2014-12-08 06:31:07 +01:00
Tanguy Pruvot
d09ec41251 api: allow -b <ip>, and set bind retry to 20sec
before, only -b <ip:port>, <port> or 0 (disable) was allowed

easier to set -b 0.0.0.0 with default port
2014-12-08 04:13:41 +01:00
Tanguy Pruvot
70743eb48d solo: get bloc height and trap more errors
and disable multiple nonce on wallets,
a bloc cant be resolved twice ;)

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-12-07 16:53:05 +01:00
Tanguy Pruvot
49a73971c4 Enhance stale work detection + throughput fixes
seems to resolve solo mining lock on share.
export also computed solo work diff in api (not perfect)

In high rate algos, throughput should be unsigned...
This fixes keccak, blake and doom problems

And change terminal color of debug lines, to be selectable in putty,
color code is not supported in windows but selection is ok there.
2014-12-07 12:58:41 +01:00
Tanguy Pruvot
f9bba0ff1a lyra2: small changes and code cleanup 2014-12-07 09:04:03 +01:00
Tanguy Pruvot
ef8a73d6aa keccak: not compatible with second nonces (was broken)
Use djm34 new uint2 method to get a +40% boost (115 to 153MH/s)
2014-12-06 13:55:13 +01:00
Tanguy Pruvot
78f2dd6ffd lyra2: reduce default intensity on windows 2014-12-06 13:03:25 +01:00
Tanguy Pruvot
5db46b74b5 lyra2: fix cpu hash + validation 2014-12-06 11:51:25 +01:00
Tanguy Pruvot
c5b349e079 Add Lyra2 algo, based on Vertcoin published code
Seems to be djm34 work, i recognize the code style ;)

Code was cleaned/indented and adapted to my fork...

Only usable on the test pool until 16 december 2014!
2014-12-06 11:28:26 +01:00
Tanguy Pruvot
6c7fce187b x11: use KlausT optimisation (+20 KHs)
But use a define in AES to use or not device initial memcpy

I already tried to use everywhere direct device constants
and its not faster for big arrays (difference is small)

also change launch bounds to reduce spills (72 regs)

to check on windows too, could improve the perf... or not
2014-12-06 04:14:36 +01:00
Tanguy Pruvot
c3bdb623e8 Check and submit multiple nonces in one loop
Added to most algos, checkhash function scans a big range
and can find multiple nonces at once if the difficulty is low.

Stop ignoring them, submit second one if found...

Clean the draft code for rc=2 implemented for blake and pentablake

btw... fix the reduced displayed hashrate when a nonce is found...

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-12-05 15:53:40 +00:00
Tanguy Pruvot
f387898ead Prepare multiple nonces support in one loop (if found)
Tested on x11 which find sometimes 3 nonces in one call,
actually they are ignored because only the biggest was kept...

This commit doesnt fix that, but will allow to enhance shares rate later...
2014-12-05 10:16:06 +01:00
Tanguy Pruvot
a186a4df8b websocket: enhance histo sample for 2 GPUs 2014-12-03 09:53:23 +01:00
Tanguy Pruvot
543de0a73c api: add support for websocket calls
Allow to directly get api data in HTML5

Tested on Chrome... IE>=10 required, not tested

IE11 seems buggy on connection close...  todo

Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-12-03 08:35:27 +01:00
Tanguy Pruvot
2dcf983290 solo: remove remains of stale debug logs 2014-12-01 06:05:03 +01:00
Tanguy Pruvot
56ed0fed05 blake: remove hashharder workaround 2014-12-01 05:58:01 +01:00
Tanguy Pruvot
118a6be361 checkhash: simplify the common function
use klaus trivial function, the old code has always been a bit weird..

split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff
2014-12-01 00:20:40 +01:00