Tanguy Pruvot
e7ae27137e
x11/qubit: remove some extra MyStreamSynchronize
...
only one per loop is required to prevent 100% cpu usage
2015-04-15 05:30:22 +02:00
Tanguy Pruvot
163430daae
Skein/Skein2 SM 3.0 devices support
...
+ code cleanup
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2015-04-15 01:27:48 +02:00
Tanguy Pruvot
d58d53f2b2
update README, small changes, prepare release 1.6.1
...
still need a SM 3.0 fix for skein...
2015-04-14 23:28:00 +02:00
Tanguy Pruvot
48515ad707
groestl: rename included cuda files
2015-04-06 23:46:34 +02:00
Tanguy Pruvot
37395eefe4
skein: restore previous x11 speed
2015-03-28 13:32:08 +01:00
Tanguy Pruvot
4f43abb402
bmw512: indent and restore SM 3.0 compat
...
could be also the source of the problem seen with CUDA 7
restored the code before sp/klaus changes for SM 3.0 devices...
2015-03-28 12:01:50 +01:00
Tanguy Pruvot
38e6672d70
Allow test of SM 2.1/3.0 binaries on newer cards
...
Implementation based on klausT work.. a bit different
This code must be placed in a common .cu file,
cuda.cpp is not compiled with nvcc and doesnt allow cuda code...
2015-03-28 12:00:53 +01:00
Tanguy Pruvot
f86784ee56
Add skein algo (Skeincoin, Myriad, Unat...)
...
SKEIN512 + SHA256
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2015-03-27 15:24:27 +01:00
Tanguy Pruvot
a37e909db9
Add zr5 algo (for SM 3.5+)
...
uint4 copy + keccak cleanup, groestl: small uint4 opt
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2015-03-27 15:16:25 +01:00
Tanguy Pruvot
9734186a37
jh512: import and improve klaus and sp changes
...
did not import the extra final function, which should stay compatible
with the common cuda_check_hash()
2015-03-20 05:36:40 +01:00
KlausT
ae8e863591
remove uint32_t cast
2015-03-12 01:01:47 +01:00
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
2015-02-28 14:05:09 +01:00
Tanguy Pruvot
09c3ac6b4b
linux: fix missing dirname include
2015-02-11 18:36:57 +01:00
Tanguy Pruvot
2d5e8aaced
anime: fix uint2 error (bmw)
2015-02-08 18:32:42 +01:00
KlausT
a452c330dd
quark: remove unused variables
2015-02-02 10:41:14 +01:00
Tanguy Pruvot
26b51a557b
Allow different intensity per device
...
and clean the old variables, no more required
2015-01-24 11:17:29 +01:00
Tanguy Pruvot
768b5ccb76
import bmw512 uint2 changes from sp
...
+ some cleanup... 15KH/s won (750Ti)
2015-01-24 08:02:41 +01:00
Tanguy Pruvot
9f2dd3ee60
Remove some useless conversions
...
do not impact perfs neither...
2015-01-24 08:00:22 +01:00
Tanguy Pruvot
2a5233f56e
api: report throughput when default
2015-01-22 06:28:59 +01:00
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
2015-01-22 04:55:27 +01:00
Tanguy Pruvot
b521acb480
groestl: use sp bitslice enhancement, prepare SM 2.x variant
...
todo: simd512 SM 2.x variant (shfl op), and groestl/myriad functions
2015-01-19 00:42:14 +01:00
Tanguy Pruvot
ec5a48f420
x11: small simd512 gpu_expand improvement
2014-12-19 09:16:55 +01:00
Tanguy Pruvot
1e24e4899c
skein: uint2 optimisation with SM 3.0 compat (+15KH)
...
Thanks to sp and djm34 for this fast uint64 storage alternative
2014-12-16 13:52:54 +01:00
Tanguy Pruvot
2585e10814
keccak uint2 optimisation for SM>3.0 (x11 +40KH/s)
...
based on djm34 keccak 256-bit changes, and keep SM3.0 compat
affect most other algos too (quark, nist5, x13...)
2014-12-15 11:34:03 +01:00
Tanguy Pruvot
c3bdb623e8
Check and submit multiple nonces in one loop
...
Added to most algos, checkhash function scans a big range
and can find multiple nonces at once if the difficulty is low.
Stop ignoring them, submit second one if found...
Clean the draft code for rc=2 implemented for blake and pentablake
btw... fix the reduced displayed hashrate when a nonce is found...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-12-05 15:53:40 +00:00
Tanguy Pruvot
118a6be361
checkhash: simplify the common function
...
use klaus trivial function, the old code has always been a bit weird..
split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff
2014-12-01 00:20:40 +01:00
Tanguy Pruvot
c218c3f514
quark/anime: +100KH, bmw tpb was not correct
...
This small change also enhance a bit x11..17 algos
2014-11-28 22:18:48 +01:00
Tanguy Pruvot
8ad180cc70
various small changes
...
heavy: reduce by 256 threads default intensity to all -i 20
cuda: put static thread init bools outside the code (made once)
api: fix nvml header to build without
2014-11-28 20:57:35 +01:00
Tanguy Pruvot
6ae28162db
various extern cleanup + api history uids and gpu SM
...
uids could be useful to create graphes from history data
Note: please do a clean build after this commit (changes in miner.h)
2014-11-26 11:55:42 +01:00
Tanguy Pruvot
73f22b237a
Prepare trap of hardware/mem failures
2014-11-20 18:44:25 +01:00
Tanguy Pruvot
fe4ad36b73
intensity: sign warnings fixes min(i,u)
2014-11-17 14:48:55 +01:00
Tanguy Pruvot
c859041993
quark/blake512 opt. pointed by sp without asm
...
indeed, the pragma unroll doesnt always make things faster
asm part... to check later
2014-11-17 00:01:32 +01:00
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
2014-11-11 19:11:16 +01:00
Tanguy Pruvot
7a4e1bb327
Reduce keccak, deep & anime intensity + handle groestl -i param
...
default intensity was the max supported by the card, and perf is
not really better. I prefer to let it one under for cards with lower
memory (1GB)
2014-11-10 18:08:23 +01:00
Tanguy Pruvot
7acf987aba
Add intensity to last algos and fix quark speed
2014-11-10 16:56:03 +01:00
Tanguy Pruvot
11c5ec810d
Handle intensity param in all algos
...
and add a check related to start/max nounce params
2014-11-09 22:27:32 +01:00
Tanguy Pruvot
a747e4ca0f
blake512: use a new SWAPDWORDS asm func (0.05ms)
...
small improvement, do it on pentablake and heavy variants too
based on sp commit (but SWAP32 is already used for 32bit ints)
2014-11-09 01:26:55 +01:00
Tanguy Pruvot
e7beac6b1c
x11: tiny sp_ opt on jh512 (0.05ms)
...
modified a bit.. (and removed the mixed dos end of lines ^M)
also, remove the max reg count, now determined with __launch_bounds__
2014-11-09 00:20:39 +01:00
Tanguy Pruvot
db8681c1db
update readme and fix SM 3.0 build
2014-10-20 06:27:02 +02:00
Tanguy Pruvot
d8a23fa970
Tune quark part of Xn funcs
...
based on klaus commits, will increase a bit speed of most algos
PS: main increase is due to the register count tuning in Makefile
and for skein512 on linux, its the ROTL64
but almost no changes on X11 : 2648MH/s vs 2630 before
2014-10-20 03:15:17 +02:00
Tanguy Pruvot
799b230af2
enhance solo mining, update http headers
...
and prepare next version...
2014-09-28 15:34:44 +02:00
Tanguy Pruvot
7cc5222394
Move common check_cpu functions to root
2014-09-10 00:27:01 +02:00
Tanguy Pruvot
95ac1d0f19
x11: adapt some blake 256 opts to 512 one
...
blake512: for the moment 6.2ms vs 7.12 before (+10%)
2014-09-09 17:55:07 +02:00
Tanguy Pruvot
1fb9becc1f
cpu-miner: sort algos by name, show reject reason
2014-09-01 09:50:19 +02:00
Tanguy Pruvot
c17d11e377
add "blake" 256, 14 rounds (for NEOS blake, not BlakeCoin)
...
also remove "missing" file, its old and not compatible with ubuntu 14.04
2014-09-01 05:53:36 +02:00
Tanguy Pruvot
64e8cd3f98
add x17 algo, cleaned djm34 commit
...
todo: visual studio...
2014-08-23 22:44:17 +02:00
Tanguy Pruvot
b3becb67dd
groestl: small optimisation (nist5 + 100kH on a 750Ti)
...
But, almost nothing on X15, no big changes...
2014-08-22 20:18:29 +02:00
Tanguy Pruvot
a586cee493
quark: dos2unix files to reduce problems later
2014-08-21 08:15:02 +02:00
Tanguy Pruvot
1fbcbbacc4
Add whirlcoin and optimize x11 luffa (maxrregcount)
2014-08-20 07:49:22 +02:00
Tanguy Pruvot
bc2eb75758
Add fresh algo (based on djm34 code)
...
Cleaned up and adapted to my changes (cputest added)
Remove Makefile.in which should be in gitignore
(Plz refresh it with ./config.sh to compile on linux)
2014-08-19 18:31:26 +02:00
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
2014-08-19 03:29:11 +02:00
Tanguy Pruvot
8e32e5ea09
cputest: add other hashes to cpu tests
2014-08-14 23:35:02 +02:00
Tanguy Pruvot
9d3d09103b
Try to restore compat with 2.1 devices (GTX 460)
2014-08-12 18:07:50 +02:00
Tanguy Pruvot
06763c20b1
Implement x14 (cuda + cpu functions)
...
Project was updated for VS2013 and CUDA SDK 6.5
add also a --cputest function to dump cpu hash results
TODO: x15 is not fully functional, but first loop seems ok
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-08-12 14:47:03 +02:00
Christian Buchner
3b21069504
bump to revision V1.1 with Killer Groestl
2014-06-14 01:43:28 +02:00
Christian Buchner
ea32195a8a
v0.9: adding missing VC++ project file and fixing Win compilation
2014-05-06 16:09:59 +02:00
Christian Buchner
e049f32fee
bump to revision v0.9 (VC++ project files not updated yet)
2014-05-06 13:53:52 +02:00
Christian Buchner
61cbdc62d0
fix the dreaded FEMRI spelling mistake
2014-05-01 10:09:26 +02:00
Christian Buchner
433d653723
bump to revision 0.7
2014-05-01 08:36:56 +02:00
Christian Buchner
0659d697c1
Revision 0.6 with myriad-groestl and jackpot coin
2014-04-27 01:26:08 +02:00
Christian Buchner
07cbafe96c
Revision 0.6 with myriad-groestl and jackpot coin
2014-04-27 01:14:18 +02:00