Tanguy Pruvot
94d86ca3b3
lyra2v2: fix tpb condition for SM 5.2
9 years ago
Tanguy Pruvot
41543b5c0b
lyra2v2: add support for SM 2.1 devices
...
and improve a bit SM 3 perf
9 years ago
Tanguy Pruvot
e50556b637
various changes, cleanup for the release
...
small fixes to handle better the multi thread per gpu
explicitly report than quark is not compatible with SM 2.1 (compact shuffle)
9 years ago
Tanguy Pruvot
61ff92b5b4
never interrupt global benchmark with found nonces
...
fix some algo weird hashrates (like blake)
and reset device between algos, for better accuracy
but this reset doesnt seems enough to bench all algos correctly...
to test on linux, could be a driver issue...
heavy: fix first alloc and indent with tabs...
9 years ago
Tanguy Pruvot
03b2bddc16
lyra2v2: fix SM 3.5 support
...
May work also on SM 3.0 (to check)
9 years ago
Tanguy Pruvot
3b7ef923c7
lyra2(v1): use a common uint2x4 include
...
lyrav2 still need more definitions (uint16)
9 years ago
Tanguy Pruvot
4868c412b0
windows: add support for SM 2.1, drop SM 3.5 (x86)
...
Mostly to do compatibilty tests, SM 2.1 support is very limited
SM 3.0 code should run on SM 3.5 (only a few cards use this arch)
As i can't test SM 3.5, its best to let users do their own tests...
9 years ago
Tanguy Pruvot
e5d1cf8416
lyra2v2: typo in type, its a struct of 4x uint2 :p
9 years ago
Tanguy Pruvot
6a9280a045
lyra2v2: set a better TPB for intensity 20 (sm52)
...
use sp forced unroll in skein and do some cleanup...
9 years ago
Tanguy Pruvot
32f212469b
lyra2/v2: fixes for vstudio
9 years ago
Tanguy Pruvot
5bf1f98200
various fixes for SM 2.1 and the benchmark
...
X11+ algos and quark are not compatible for the moment
but these ones are :
Benchmark results for Gigabyte GTX 460 (SM 2.1 / 1 GB):
blakecoin : 159090.5 kH/s, 1 MB, 1048576 thr.
blake : 70208.9 kH/s, 1 MB, 1048576 thr.
bmw : 122802.6 kH/s, 65 MB, 2097152 thr.
deep : 3533.6 kH/s, 33 MB, 524288 thr.
fugue256 : 43177.9 kH/s, 17 MB, 524288 thr.
heavy : 4118.2 kH/s, 147 MB, 524032 thr.
keccak : 18673.1 kH/s, 129 MB, 2097152 thr.
luffa : 28816.0 kH/s, 257 MB, 4194304 thr.
lyra2 : 213.7 kH/s, 570 MB, 65536 thr.
mjollnir : 3895.6 kH/s, 147 MB, 524032 thr.
nist5 : 1101.4 kH/s, 67 MB, 1048576 thr.
penta : 501.6 kH/s, 21 MB, 327680 thr.
skein : 5432.4 kH/s, 65 MB, 1048576 thr.
skein2 : 6788.9 kH/s, 33 MB, 524288 thr.
whirlpool : 688.5 kH/s, 33 MB, 524288 thr.
zr5 : 122.5 kH/s, 86 MB, 262144 thr.
9 years ago
Tanguy Pruvot
8fd2739a65
lyra2: support for SM 2.1 cards (GTX 460)
...
also fix the build (scrypt) for this arch.
else, 318,26 kH/s on a GTX 460...
9 years ago
Tanguy Pruvot
fc84c719e9
lyra2: improve cuda implementation (part 1, SM5+)
...
based on the new djm34 method, 2x faster than first version
cleaned and tuned for the GTX 750/960 (linux / cuda 6.5)
9 years ago
Tanguy Pruvot
9dfa757dc7
warn on cuda errors + various small changes
...
The full benchmark can now be launched with "ccminer --benchmark"
add a new helper function which log a warning with last cuda error
(not shown with the quiet option) : CUDA_LOG_ERROR();
it can be used where miner.h is included (.c/.cpp/.cu)
fix x14 (in ccminer.cpp), a break was missing in switch..case
9 years ago
Tanguy Pruvot
d195f2e8a2
intensity: do not reduce throughput before init
...
Else the memory allocated could be less than required later
btw, use the new "cuda" function to apply intensity/throughput
9 years ago
Tanguy Pruvot
8db5a0bc9e
blake: change dynamic round system
...
blakecoin was conflicting with lyra2, set the rounds more properly
9 years ago
Tanguy Pruvot
4e1e03b891
benchmark: store all algos results + cuda fixes
...
Note: lyra2, lyra2v2 and script seems to have problems
to coexist with other algos... to run after some of them...
moved lyra2 first and skip scrypt/jane for the moment...
Only stored in memory for now.. to display a table after the bench
ccminer -a auto --benchmark
Results may be exported later to a json file...
9 years ago
Tanguy Pruvot
922c2a5cd7
algos: free allocated mem for algo switch
...
All can be freed propertly now, except script (reset) and lyra2 (leak)
9 years ago
Tanguy Pruvot
ee93927fac
diff: use the new function in all algos
9 years ago
Tanguy Pruvot
87edf84bf3
lyra2v2: increase default intensity
...
to be able to say, like sp, that its faster :p
9 years ago
Tanguy Pruvot
b3adebdf2a
lyra2v2: improve speed on SM 5.2 (Cuda 6.5) with sp unrolls
...
Reduce a bit the 750Ti speed but improve a lot the 9xx speed.
Keep compat for SM 3/3.5 in a second file..
Note: With this code and Cuda 7.5, the speed won is the reverse...
May be "reverted" soon
9 years ago
Tanguy Pruvot
e1c4b3042c
algos: add functions to free allocated resources
...
Will be used later for algo switching
not really tested yet...
9 years ago
Tanguy Pruvot
5308898d1c
start v1.7, apply new prototypes to all algos
9 years ago
Tanguy Pruvot
4babf37256
Add --show-diff parameter and fix pool net diff
...
which display submitted block and net difficulty and is able
to detect shares above net diff (solved blocs)
Note: only made on lyra2v2 and zr5 algos
TODO: compute the found diff on all algos...
require changes in all scan hash "kernel" function parameters
to be continued...
9 years ago
Tanguy Pruvot
8f98bde4fb
lyra2v2: improve cubehash with uint2
9 years ago
Tanguy Pruvot
877f25aee5
lyra2v2: SM3.0 compat too
...
and strip most of the unused vectors defines...
9 years ago
Tanguy Pruvot
e4414752c4
vstudio: lyra warning fixes + project dups
9 years ago
Tanguy Pruvot
6500e7401a
lyra2v2: properly exclude SM 3.0 devices
9 years ago
Tanguy Pruvot
53cd591956
lyra2v2, bmw256 and cubehash256 cleanup + diff fix
9 years ago
Tanguy Pruvot
c9a66b696f
return to original lyra2 code
...
until i find the problem...
9 years ago
Tanguy Pruvot
d4e191610e
Import and adapt lyra2v2
...
not tested on windows and with SM <= 5
9 years ago
Tanguy Pruvot
d764b5cca2
lyra2: improve a bit the speed on SM 5.2
...
(simplified sp commit)
10 years ago
Tanguy Pruvot
03c3b7d341
Various algos cleanup + lyra2 sec nonce fix
10 years ago
Tanguy Pruvot
34fd408440
lyra2: get a second nonce per gpu scan
10 years ago
Tanguy Pruvot
3d3f2e2cb5
warnings: use the right device id (device_map[thr_id])
10 years ago
KlausT
ae8e863591
remove uint32_t cast
10 years ago
Tanguy Pruvot
e6112e878d
cleanup: use unsigned throughput parameters
...
Yes, its a big commit, was waiting 1.6 to do that...
Sorry for your possible merge issues ;)
10 years ago
Tanguy Pruvot
26b51a557b
Allow different intensity per device
...
and clean the old variables, no more required
10 years ago
Tanguy Pruvot
2a5233f56e
api: report throughput when default
10 years ago
Tanguy Pruvot
cafd4477d7
Handle a maximum of 16 gpus (vs 8 before)
...
Some cards have 2 gpus on board...
10 years ago
Tanguy Pruvot
b3188669e2
lyra2: cleanup
...
quickly tested with a SM 3.0 binary...
10 years ago
KlausT
756ea89ecd
lyra2 +100kh for GTX970
10 years ago
Tanguy Pruvot
a66d78e692
reduce lyra2 blake and pentablake cpu load
10 years ago
Tanguy Pruvot
ac9b0d27ee
Release final 1.5.1
10 years ago
Tanguy Pruvot
63e3387dbb
lyra2: add sm30 device compat (skein256)
10 years ago
Tanguy Pruvot
f9bba0ff1a
lyra2: small changes and code cleanup
10 years ago
Tanguy Pruvot
78f2dd6ffd
lyra2: reduce default intensity on windows
10 years ago
Tanguy Pruvot
5db46b74b5
lyra2: fix cpu hash + validation
10 years ago
Tanguy Pruvot
c5b349e079
Add Lyra2 algo, based on Vertcoin published code
...
Seems to be djm34 work, i recognize the code style ;)
Code was cleaned/indented and adapted to my fork...
Only usable on the test pool until 16 december 2014!
10 years ago