ccminer

mirror of https://github.com/GOSTSec/ccminer synced 2025-01-10 14:57:53 +00:00

Author	SHA1	Message	Date
Tanguy Pruvot	8d4d4d65ce	cuda: header for common kernel functions (quark/x11) Was thinking about doing that since months ;) lets go	2015-10-25 06:54:17 +01:00
Tanguy Pruvot	d43dc9a021	use blake512 sp kernels on SM 5+ (80+64) import and keep my code for older archs, like skein 64 reduce the gap between our versions... +150kH x11 GTX 960 / +30kH 750Ti +900kH quark GTX 960 / +230kH 750Ti	2015-10-24 13:43:22 +02:00
Tanguy Pruvot	355b835ae0	benchmark: enhance the mem leak detection reduce "false" warnings, and ignore unrelated/small ones <= 1 MB On windows the gpu memory can be allocated by other processes + some cleanup in algos... (free/gpulog)	2015-10-16 22:04:30 +02:00
Tanguy Pruvot	9dfa757dc7	warn on cuda errors + various small changes The full benchmark can now be launched with "ccminer --benchmark" add a new helper function which log a warning with last cuda error (not shown with the quiet option) : CUDA_LOG_ERROR(); it can be used where miner.h is included (.c/.cpp/.cu) fix x14 (in ccminer.cpp), a break was missing in switch..case	2015-10-12 08:46:13 +02:00
Tanguy Pruvot	d195f2e8a2	intensity: do not reduce throughput before init Else the memory allocated could be less than required later btw, use the new "cuda" function to apply intensity/throughput	2015-10-11 05:01:41 +02:00
Tanguy Pruvot	4e1e03b891	benchmark: store all algos results + cuda fixes Note: lyra2, lyra2v2 and script seems to have problems to coexist with other algos... to run after some of them... moved lyra2 first and skip scrypt/jane for the moment... Only stored in memory for now.. to display a table after the bench ccminer -a auto --benchmark Results may be exported later to a json file...	2015-10-09 02:07:08 +02:00
Tanguy Pruvot	922c2a5cd7	algos: free allocated mem for algo switch All can be freed propertly now, except script (reset) and lyra2 (leak)	2015-10-08 21:35:30 +02:00
Tanguy Pruvot	ee93927fac	diff: use the new function in all algos	2015-10-07 20:10:15 +02:00
Tanguy Pruvot	e1c4b3042c	algos: add functions to free allocated resources Will be used later for algo switching not really tested yet...	2015-09-25 07:51:57 +02:00
Tanguy Pruvot	5308898d1c	start v1.7, apply new prototypes to all algos	2015-09-23 15:42:17 +02:00
Tanguy Pruvot	42bcb91ca0	x11: update sp luffa/cube to get closer x11 speeds.. i had to clean it... lot of unused defines...	2015-06-17 02:31:15 +02:00
Tanguy Pruvot	2113be6eec	blake80: some changes and launch bounds, no perf changes	2015-04-24 14:12:21 +02:00
Tanguy Pruvot	3d3f2e2cb5	warnings: use the right device id (device_map[thr_id])	2015-04-23 09:41:56 +02:00
KlausT	ae8e863591	remove uint32_t cast	2015-03-12 01:01:47 +01:00
Tanguy Pruvot	e6112e878d	cleanup: use unsigned throughput parameters Yes, its a big commit, was waiting 1.6 to do that... Sorry for your possible merge issues ;)	2015-02-28 14:05:09 +01:00
Tanguy Pruvot	26b51a557b	Allow different intensity per device and clean the old variables, no more required	2015-01-24 11:17:29 +01:00
Tanguy Pruvot	2a5233f56e	api: report throughput when default	2015-01-22 06:28:59 +01:00
Tanguy Pruvot	cafd4477d7	Handle a maximum of 16 gpus (vs 8 before) Some cards have 2 gpus on board...	2015-01-22 04:55:27 +01:00
Tanguy Pruvot	c3bdb623e8	Check and submit multiple nonces in one loop Added to most algos, checkhash function scans a big range and can find multiple nonces at once if the difficulty is low. Stop ignoring them, submit second one if found... Clean the draft code for rc=2 implemented for blake and pentablake btw... fix the reduced displayed hashrate when a nonce is found... Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>	2014-12-05 15:53:40 +00:00
Tanguy Pruvot	118a6be361	checkhash: simplify the common function use klaus trivial function, the old code has always been a bit weird.. split cuda_check_cpu_hash_64 in two functions, keep old for branched stuff	2014-12-01 00:20:40 +01:00
Tanguy Pruvot	8ad180cc70	various small changes heavy: reduce by 256 threads default intensity to all -i 20 cuda: put static thread init bools outside the code (made once) api: fix nvml header to build without	2014-11-28 20:57:35 +01:00
sp-hash	f0d91ab8a6	Luffa and simd merged to one kernal. Small echo rewrite. +10KHASH on the 650(compute 3.0) tpruvot: add Linux Makefile - Force to 80 registers (else -30KH/s) Note : the hashrate seems more constant with this change	2014-11-23 07:04:07 +01:00
Tanguy Pruvot	73f22b237a	Prepare trap of hardware/mem failures	2014-11-20 18:44:25 +01:00
Tanguy Pruvot	fe4ad36b73	intensity: sign warnings fixes min(i,u)	2014-11-17 14:48:55 +01:00
Tanguy Pruvot	c859041993	quark/blake512 opt. pointed by sp without asm indeed, the pragma unroll doesnt always make things faster asm part... to check later	2014-11-17 00:01:32 +01:00
Tanguy Pruvot	438308b3a2	Rework benchmark mode and min/max range Was maybe my fault, but the benchmark mode was always recomputing from nonce 0. Also fix blake if -d 1 is used (one thread but second gpu) stats: do not use thread id as key, prefer gpu id...	2014-11-16 23:28:18 +01:00
Tanguy Pruvot	b128312efb	cuda: store device SM in a global var sample usage made for blake and fugue (higher intensity for SM5.2) add these to cuda_helper and clean unused code	2014-11-11 19:11:16 +01:00
Tanguy Pruvot	11c5ec810d	Handle intensity param in all algos and add a check related to start/max nounce params	2014-11-09 22:27:32 +01:00
Tanguy Pruvot	7cc5222394	Move common check_cpu functions to root	2014-09-10 00:27:01 +02:00
Tanguy Pruvot	95ac1d0f19	x11: adapt some blake 256 opts to 512 one blake512: for the moment 6.2ms vs 7.12 before (+10%)	2014-09-09 17:55:07 +02:00
Tanguy Pruvot	4bc23048b5	x15: use djm34 code with asm xor64 + my rot64 some optimizations could be done later, after whirlcoin integration	2014-08-20 05:54:47 +02:00
Tanguy Pruvot	194fda87c1	x11: restore simd host2dev memcpytosymbol to reduce used cmem Remove define attempts for SM 2.1 devices, fermi is not compatible	2014-08-19 18:32:14 +02:00
Tanguy Pruvot	d9ea5f72ce	Remove duplicated defines present in cuda_helper.h also add cudaDeviceReset() on Ctrl+C for nvprof	2014-08-19 03:29:11 +02:00
Tanguy Pruvot	a9a3ad8afc	cuda: check for errors on cuda mem alloc	2014-08-17 22:41:05 +02:00
Tanguy Pruvot	81d7e0421d	x15 now works	2014-08-15 00:11:59 +02:00
Tanguy Pruvot	06763c20b1	Implement x14 (cuda + cpu functions) Project was updated for VS2013 and CUDA SDK 6.5 add also a --cputest function to dump cpu hash results TODO: x15 is not fully functional, but first loop seems ok Signed-off-by: Tanguy Pruvot <tanguy.pruvot@gmail.com>	2014-08-12 14:47:03 +02:00

36 Commits