Tanguy Pruvot
66d2c5202b
x11: small simd512 gpu_expand improvement
10 years ago
Tanguy Pruvot
1e24e4899c
skein: uint2 optimisation with SM 3.0 compat (+15KH)
...
Thanks to sp and djm34 for this fast uint64 storage alternative
10 years ago
Tanguy Pruvot
b128312efb
cuda: store device SM in a global var
...
sample usage made for blake and fugue (higher intensity for SM5.2)
add these to cuda_helper and clean unused code
10 years ago
Tanguy Pruvot
db8681c1db
update readme and fix SM 3.0 build
10 years ago
Tanguy Pruvot
d8a23fa970
Tune quark part of Xn funcs
...
based on klaus commits, will increase a bit speed of most algos
PS: main increase is due to the register count tuning in Makefile
and for skein512 on linux, its the ROTL64
but almost no changes on X11 : 2648MH/s vs 2630 before
10 years ago
Tanguy Pruvot
a586cee493
quark: dos2unix files to reduce problems later
10 years ago
Tanguy Pruvot
1fbcbbacc4
Add whirlcoin and optimize x11 luffa (maxrregcount)
10 years ago
Tanguy Pruvot
d9ea5f72ce
Remove duplicated defines present in cuda_helper.h
...
also add cudaDeviceReset() on Ctrl+C for nvprof
10 years ago
Christian Buchner
3b21069504
bump to revision V1.1 with Killer Groestl
11 years ago
Christian Buchner
e049f32fee
bump to revision v0.9 (VC++ project files not updated yet)
11 years ago
Christian Buchner
433d653723
bump to revision 0.7
11 years ago