Tanguy Pruvot
c859041993
quark/blake512 opt. pointed by sp without asm
...
indeed, the pragma unroll doesnt always make things faster
asm part... to check later
10 years ago
Tanguy Pruvot
438308b3a2
Rework benchmark mode and min/max range
...
Was maybe my fault, but the benchmark mode was
always recomputing from nonce 0.
Also fix blake if -d 1 is used (one thread but second gpu)
stats: do not use thread id as key, prefer gpu id...
10 years ago
Tanguy Pruvot
11dbbcc12d
checkhash: some work on a faster variant (wip)
...
This should not be used for all algos... not enabled yet
todo: multiple nounces or blake32 style checkup
10 years ago
Tanguy Pruvot
14a41959f8
x11: switch to intensity 20 for SM>=5.2 750+970
10 years ago
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
10 years ago
Tanguy Pruvot
11c5ec810d
Handle intensity param in all algos
...
and add a check related to start/max nounce params
10 years ago
Tanguy Pruvot
93f4409dde
simd: then reindent the code
...
no changes, only error checks (cuda safe call)
10 years ago
Tanguy Pruvot
d8a23fa970
Tune quark part of Xn funcs
...
based on klaus commits, will increase a bit speed of most algos
PS: main increase is due to the register count tuning in Makefile
and for skein512 on linux, its the ROTL64
but almost no changes on X11 : 2648MH/s vs 2630 before
10 years ago
Tanguy Pruvot
7cc5222394
Move common check_cpu functions to root
10 years ago
Tanguy Pruvot
95ac1d0f19
x11: adapt some blake 256 opts to 512 one
...
blake512: for the moment 6.2ms vs 7.12 before (+10%)
10 years ago
Tanguy Pruvot
b4e690b486
sources: swith to UTF-8
10 years ago
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
10 years ago
Tanguy Pruvot
a9a3ad8afc
cuda: check for errors on cuda mem alloc
10 years ago
Tanguy Pruvot
06763c20b1
Implement x14 (cuda + cpu functions)
...
Project was updated for VS2013 and CUDA SDK 6.5
add also a --cputest function to dump cpu hash results
TODO: x15 is not fully functional, but first loop seems ok
Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>
10 years ago
Christian Buchner
3b21069504
bump to revision V1.1 with Killer Groestl
11 years ago
Christian Buchner
be5ba30131
massive speed upgrade for the SIMD hash. AMD, be afraid.
11 years ago
Christian Buchner
44d38e3a9a
Simplification of the SIMD hash code (remove unnecessary lookup tables), increase X11 throughput value somewhat
11 years ago
Christian Buchner
af07302b4b
v1.0 - Yo, I heard y'all like X11
11 years ago
Christian Buchner
e049f32fee
bump to revision v0.9 (VC++ project files not updated yet)
11 years ago