Tanguy Pruvot
438308b3a2
Rework benchmark mode and min/max range
...
Was maybe my fault, but the benchmark mode was
always recomputing from nonce 0.
Also fix blake if -d 1 is used (one thread but second gpu)
stats: do not use thread id as key, prefer gpu id...
2014-11-16 23:28:18 +01:00
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
2014-11-11 19:11:16 +01:00
Tanguy Pruvot
11c5ec810d
Handle intensity param in all algos
...
and add a check related to start/max nounce params
2014-11-09 22:27:32 +01:00
Tanguy Pruvot
4c3964539f
Fix vc debug builds, missing symbols
2014-11-06 17:42:01 +01:00
Tanguy Pruvot
12fafd5687
Try to reconnect on pool duplicates
...
reduce log announces and define uchar in miner.h
2014-11-04 15:14:24 +01:00
Tanguy Pruvot
187e293f71
blake: some fine tuning + cleanup
2014-11-03 20:55:03 +01:00
Tanguy Pruvot
5bc969fa57
Some work on data alignment
...
linux: add -march=native (we build it ourself) and some other flags
+ remove unused vars (seen with -Wall)
2014-11-03 16:40:13 +01:00
Tanguy Pruvot
93bb428bdf
blake: rewrite the cache system
...
Unlike other hash algos, blake256 compute the hash
with blocks of 64 bytes.
We can do the first part on the cpu, only the 4 last int32
are computed on gpu (including the tested nonce)
Previous method was also using this kind of cache with a crc.
Blake Hash Speed: +5%
2014-11-03 16:33:59 +01:00
Tanguy Pruvot
ba33492592
blake: return to ptarget 6:7 compare
...
clz can be erroneous, ex 0xE0 vs 0xF0
2014-09-19 05:01:16 +02:00
Tanguy Pruvot
91eea0d76b
blake: remove int cudaMemcpyToSymbol for MSVC
...
use clz (leading zeros) asm func for a fast gpu compare of ptarget[6]:[7]
add also missing windows ctz/clz host functions
New NEOS speed: 227MH to 270MH (Gigabyte 750Ti Black Edition)
2014-09-13 17:31:01 +02:00
Tanguy Pruvot
9efe0b965d
blake: only use high part of target on gpu
...
Add another few MH/s boost :)
2014-09-13 00:15:34 +02:00
Tanguy Pruvot
8925a7551f
blake: final cleanup (225MH/s)
2014-09-11 20:16:16 +02:00
Tanguy Pruvot
347d4e4928
blake: +8MH/s on linux, weird optimisation
...
Like doom/luffa, using a int pos make the proc faster
2014-09-11 02:33:34 +02:00
Tanguy Pruvot
cec5baea95
enable colors by default, except for syslog
...
debug: show compared hash diffs in color
2014-09-09 21:59:03 +02:00
Tanguy Pruvot
402e416853
Add pentablake algo (-a penta)
...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-09-09 21:58:58 +02:00
Tanguy Pruvot
42eafcbe85
Put CRC-32 function in a new unit
...
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
2014-09-06 19:41:48 +02:00
Tanguy Pruvot
5ccd166916
blake: introduce pdata head cache (speed x2)
2014-09-06 16:26:53 +02:00
Tanguy Pruvot
65909ec3b7
blake: handle case when 2 hashes are found in a call
2014-09-06 11:43:32 +02:00
Tanguy Pruvot
52ec8830b1
blake: blakecoin variant now works
2014-09-06 01:21:30 +02:00
Tanguy Pruvot
ecc86af102
blake: sometimes faster, or not
2014-09-05 21:12:38 +02:00
Tanguy Pruvot
3356e6f8bf
blake: some more KH/s on linux
2014-09-05 20:00:07 +02:00
Tanguy Pruvot
12fefe5de0
blake: add a few more MH/s, prepare blakecoin
2014-09-05 19:12:29 +02:00
Tanguy Pruvot
5682b7d241
blake: add also blakecoin (8-rounds) variant
2014-09-05 19:12:29 +02:00
Tanguy Pruvot
e1159629b4
blake: typo for windows on last commit
2014-09-05 09:47:24 +02:00
Tanguy Pruvot
746398f435
blake: fix reduced speed on windows, wtf
2014-09-04 17:34:30 +02:00
Tanguy Pruvot
3341e0324f
blake: speed +10%, no more size conversions
2014-09-04 16:17:11 +02:00
Tanguy Pruvot
806c3e8691
enhance double scan checks
2014-09-04 11:11:39 +02:00
Tanguy Pruvot
69616b37ac
hashlog: prepare store of scanned range
2014-09-03 14:49:13 +02:00
Tanguy Pruvot
1a4391d7ff
hashlog: prevent double computing on jobs already done
2014-09-03 12:20:24 +02:00
Tanguy Pruvot
049e577301
tmp blake log
2014-09-03 09:49:14 +02:00
Tanguy Pruvot
43d3e93e1a
blake: set a max throughput
2014-09-03 09:29:51 +02:00
Tanguy Pruvot
7e595a36ea
blake: cleanup, remove d_hash buf, not in a chain
...
host: only bencode if gpu hash was found
2014-09-03 08:33:55 +02:00
Tanguy Pruvot
de80c7e9d1
blake: remove unused parameter and fix index in d_hash
...
that reduce the speed to 92MH/s but the next commit
give us 30 more
so, todo: merge the whole checkhash proc in gpu_hash
and remove this d_hash buffer...
2014-09-02 20:20:53 +02:00
Tanguy Pruvot
530732458a
blake: use a constant for threads, reduce mallocated d_hash size
...
and clean a bit more...
2014-09-01 20:00:25 +02:00
Tanguy Pruvot
0aeac878ef
blake: tune up and cleanup, ~100 MH/s on a normal 750Ti
...
tested on linux and windows (x86 binary)...
but there is a high number of duplicated shares... weird
2014-09-01 12:24:26 +02:00
Tanguy Pruvot
c17d11e377
add "blake" 256, 14 rounds (for NEOS blake, not BlakeCoin)
...
also remove "missing" file, its old and not compatible with ubuntu 14.04
2014-09-01 05:53:36 +02:00