1
0
mirror of https://github.com/GOSTSec/sgminer synced 2025-01-11 23:37:54 +00:00
Commit Graph

6143 Commits

Author SHA1 Message Date
Noel Maersk
5f326d64f5 kernel: allow choosing Alexey Karimov's kernels.
Am I the only one who gets a segfault?..
2014-01-21 10:36:55 +02:00
Noel Maersk
9545c143e8 Merge pull request #25 from gdevenyi/kernels
Replace multiply by 2 (i*2) with left shift.
2014-01-21 00:34:01 -08:00
Noel Maersk
81337e0064 kernel: vanilla Alexey Karimov kernels (alexkarnew and alexkarold).
alexkarnew: (for driver 13.4 and newer, and cgminer 3.3.1)
alexkarold: (for older drivers than 13.4, and cgminer 3.3.1)

https://litecointalk.org/index.php?topic=4082.0

> I was able to optimize the code of cgminer's scrypt.cl.
> It gives 0-3% increase, depending on the drivers and hardware.

> 1. Without optimization, when "CO" is used, every time
> z+x*zSIZE+y*xSIZE*zSIZE is calculated.

> I have created "CO" variable, and made so that x*SIZE is calculated only
> once. Now, when "CO" is used, every time z+y*xSIZE*zSIZE is calculated.

> In one case, variable y is incremented by 1 after 8 "CO" calculations.
> I have created "CO_tmp" variable, where contains result of xSIZE*zSIZE.
> And after 8 "CO" calculations I add "CO_tmp" to "CO".

> Now, when "CO" is used, every time only z is calculated. It is faster as
> z+x*zSIZE+y*xSIZE*zSIZE :)

> In other case when "CO" is used, every time z+y*xSIZE*zSIZE is
> calculated, but it faster than z+x*zSIZE+y*xSIZE*zSIZE too.

> 2. I have replaced multiplication by 2 with bit rotation - it is faster.

> For 7xxx cards you can try to set --thread-concurrency equal to (2^n + 1).
> It may give a little more mining speed.
> For example: 16385 (it is 2^14 + 1), 8193 (2^13 + 1), or 4097 (2^12 + 1).
> I have almost no information, how it works on other series.

> LMqRcHdwnZtTMH6c2kWoxSoKM5KySfaP5C
2014-01-21 09:45:10 +02:00
Gabriel A. Devenyi
a6ccc05e5f Replace multiply by 2 (i*2) with left shift. This should be slightly faster. Can't use OpenCL rotate becase of bit carry overs 2014-01-20 17:13:18 -05:00
Noel Maersk
f796aa51ca kernel/misc: change ckolivas.cl encoding to UTF-8. 2014-01-20 23:47:24 +02:00
Noel Maersk
7950c10374 kernel: integrate Zuikkis' kernel, selectable with --kernel=zuikkis.
Kernel ckolivas is still the default. Needs documentation, too.

https://github.com/veox/sgminer/issues/4
2014-01-20 23:38:39 +02:00
Noel Maersk
afdc8b5ffb kernel: zuikkis' vanilla kernel added.
Changed encoding to UTF-8.

Will not build with sgminer (fix in next commit).

http://www.reddit.com/r/dogecoin/comments/1ui3bx/increase_such_hashrate_1_to_5_scrypt_tweaking/ceir5na

> It is pretty much stock, except that I have removed all the #pragma
> unrolls, and optimized the inner scrypt_core loop. #pragma unroll does
> not give any speedup here.

> The idea is to move the "if (j&1)" comparison to outside of the lookup
> loops. Then, if j&1 happens to be zero, the V[z] and X[z] loops can be
> combined to a single loop, which gives the speedup!

> This loop and the salsa function are the most important places in the
> entire source, it probably spends over 90% of time in here.. There's
> very little to be gained outside of these, I think.

> Donations: DQj4t2DFMQtXofhstouyZw1sYUKWUJn4wv

https://github.com/veox/sgminer/issues/4#issuecomment-32753290

> Most of these optimized kernels (including mine), have fixed
> lookup-gap=2. However, I have never seen anyone use any other value, for
> any GPU, so I think you could just remove the configurable value.

> Or with some #if LOOKUP_GAP==2 magic it is of course possible to make
> such source that allows any value.

> Some users have reported slightly slower hashrate with my kernel as
> well, but this could be some misconfiguration also.. If scrypt kernel
> becomes faster, you may need to lower the GPU engine clock to get full
> speed. Same as if you increase GPU clock too high, you will get a drop
> in hash rate.

> My source is free to use in sgminer. And if you diff to original you
> will see that the changes are not very big.

> Removing of #pragma unrolls helps in any GPU, in my opinion.. Current
> compilers know better when unrolling helps.
2014-01-20 23:36:58 +02:00
Noel Maersk
9d1db95a28 misc: move scrypt130511.cl kernel file to kernel/ckolivas.cl
Prepare for more kernels. :)

Get rid of the kernel date, since this fudges up commit history.

Named after Con Kolivas, the most recent committer.
2014-01-20 16:04:43 +02:00
Noel Maersk
875db702b3 adl: unambiguosly check if FanSpeedInfo specifies support for either mechanism to set FanSpeed.
Reported by olejr: https://github.com/veox/sgminer/issues/15#issuecomment-32715082

Related issue: https://github.com/veox/sgminer/issues/15
2014-01-19 21:12:13 +02:00
Luke Dashjr
74fb5ab4b8 adl: Set iSpeedType to RPM for get-fanspeed requests
...and ensure we don't change do something weird with the fan when initially setting user-defined speed flag.

Fixes crash on R9 cards. Thanks to tkg for hints on what was wrong!
RPM preferred over percent since writing RPM back is always supported (while percent is not in some cases).

Conflicts:
	adl.c
2014-01-19 20:58:02 +02:00
Noel Maersk
99fa25c27f core: don't use magic numbers to set xintensity. 2014-01-19 20:06:55 +02:00
Noel Maersk
463668b878 core: initial implementation of user-settable xintensity. Has a bug.
Changing `intensity` and to it from `xintensity` works fine.

Changing `xintensity` sometimes fails to enqueue kernel.

For example, starting with --xintensity=128 (on a 5850) and
then changing to 64, 42, or 100 is reliable. However, changing to 127 is
not, and produces

    [04:13:01] Error -54: Enqueueing kernel onto command queue. (clEnqueueNDRangeKernel)
    [04:13:01] GPU 0 failure, disabling!

Manually enabling the disabled GPU is successful, but the GPU no longer
submits shares.

This might be a hardware limitation.
2014-01-19 05:14:17 +02:00
Noel Maersk
9d0b7f9658 core: Accidentally left a part of a previously non-merged commit (thread display).
IMO threads don't change dynamically, so they shouldn't be displayed
in the main window, but in "GPU settings" menu instead.
2014-01-19 02:46:21 +02:00
Noel Maersk
1049af3a04 misc: Add Luke-jr back to AUTHORS.md and reorder a bit. 2014-01-19 02:05:53 +02:00
Luke Dashjr
3faf4dd4a6 Use lround instead of llround, since it has better compatibility with older MingW and Cygwin versions 2014-01-19 01:46:09 +02:00
Martin Danielsen
7aeae40af2 EXPERIMENTAL: A new way of setting intensity; introducing xintensity!
All of this is credited to ArGee of RGMiner, he did the initial ground work for this setting.

This new setting allows for a much finer grained intensity setting and also opens up for dual gpu threads on devices not previously able to. Note: make sure to use lower thread-concurrency values when you increase cpu threads.

Intensity is currently used to spawn GPU threads as a simple 2^value setting.
  I:13 =    8192 threads
  I:15 =   32768 threads
  I:17 =  131072 threads
  I:18 =  262144 threads
  I:19 =  524288 threads
  I:20 = 1048576 threads
Notice how the higher settings increase thread count tremendously.

Now enter the xintensity setting (Yes, I am a genius with my naming convention!).
It is simply a shader multiplier, obviously based on the amount of shaders you got on a card, this should allow the same value to scale with different card models.
   6970 with 1536 shaders: xI:64 = 98304 threads
R9 280X with 2048 shaders: xI:64 = 131072 threads
 R9 290 with 2560 shaders: xI:64 = 180224 threads
R9 290X with 2816 shaders: xI:64 = 163840 threads

   6970 with 1536 shaders: xI:300 = 460800 threads
R9 280X with 2048 shaders: xI:300 = 614400 threads
 R9 290 with 2560 shaders: xI:300 = 768000 threads
R9 290X with 2816 shaders: xI:300 = 844800 threads

It's now much easier to control thread intensity and it potentially allows for a uniform way of setting the intensity on your system. I'm very interested in constructive feedback, as I do not have access to a lot of different card models.

This change has been tested on 6970, R9 290, R9 290X - all with equal or a little better speeds than regular intensity setting after a little tuning, but your mileage may vary. Don't fret it, if this doesn't work for you, the regular intensity setting is still available.

Conflicts:
	driver-opencl.c
	sgminer.c
2014-01-18 21:11:31 +02:00
Martin Danielsen
d7e469bd76 Stratum servers can now be named.
The names will be used throughout the display in the program, when not set "Pool 1" will
simply be used instead. The names are not exposed through API yet, it's on my TODO list.

Use "poolname" like this:

	{
		"poolname" : "Example pool",
		"url" : "stratum+tcp://example.com:8080",
		"user" : "y",
		"pass" : "x"
	},

Conflicts:
	sgminer.c
	util.c
2014-01-18 21:05:26 +02:00
Noel Maersk
2f0fac6c66 core: Allow setting kernel.
https://github.com/veox/sgminer/issues/14
2014-01-17 17:44:18 +02:00
Noel Maersk
b16a05aa0e core: lower maximum intensity to 31.
Intensity determines the number of threads started, is an unsigned int and
is used as (1 << intensity), so anything over 31 is meaningless.
2014-01-17 16:00:20 +02:00
Wolf
39b9945c11 Changed API reporting of MH/s for added precision. 2014-01-17 11:55:13 +02:00
Noel Maersk
bc1ce6984b doc: minor FAQ update. 2014-01-17 10:37:34 +02:00
Noel Maersk
f16f1b1b05 Merge pull request #13 from Drogean/patch-1
doc: Added SGMINER specific steps (for building on Windows)
2014-01-16 14:35:27 -08:00
Drogean
b1de4c896b Added SGMINER specific steps
Modified multiple steps for specifically compiling the 4.0.0 SGMINER build
2014-01-16 17:19:39 -05:00
Noel Maersk
3845e89d00 doc: wrong link in ADL_SDK readme.txt.
https://github.com/veox/sgminer/issues/7
2014-01-16 16:27:12 +02:00
Noel Maersk
693a5679e6 Merge pull request #3 from jbruggeman/master
doc: update faq and windows build docs
2014-01-16 04:01:48 -08:00
Joseph Bruggeman
4b7cf40e63 doc: update faq to remove non-scrypt (irrelevent) information 2014-01-15 23:57:52 -05:00
Joseph Bruggeman
9b37a0131b doc: update windows doc to reflect changes in configure.ac 2014-01-15 23:46:45 -05:00
Noel Maersk
3af2534e22 doc: missing 'make' in README + file name misspelled. 2014-01-16 02:48:31 +02:00
Noel Maersk
3bf3df09b7 Bump version to 4.0.0.
First sgminer release.
2014-01-15 17:26:18 +02:00
Noel Maersk
cbad2d67df doc: clarify what doc/GPU is. 2014-01-15 17:25:27 +02:00
Noel Maersk
11920081ec ui: Don't pad most displayed strings with spaces. 2014-01-15 17:20:54 +02:00
Noel Maersk
e46581dea1 Remove --vectors (current kernel only supports 1 vector).
The functionality to set the number of vectors still remains, though.
2014-01-15 17:03:50 +02:00
Noel Maersk
e6b5fb7aa8 core: remove deprecated --scrypt option. 2014-01-15 16:45:04 +02:00
Noel Maersk
50066cf3c7 misc: Replace all remaining instances of 'cgminer' with 'sgminer'.
sed again.
2014-01-15 16:36:48 +02:00
Noel Maersk
6ea3b93809 core: fix build error due to CGMINER_PREFIX reference. 2014-01-15 16:03:24 +02:00
Noel Maersk
9d16ff98f5 doc: reference veox/sgminer in ChangeLog. 2014-01-15 15:58:51 +02:00
Noel Maersk
683abd0b17 build: fix warning + other small fixes. 2014-01-15 15:52:05 +02:00
Noel Maersk
5b977125fa misc: Replace many (but not all) instances of 'cgminer' with 'sgminer'.
Just ran a `sed -i 's/cgminer/sgminer/'` on several files.
2014-01-15 15:36:19 +02:00
Noel Maersk
909af927c3 Merge pull request #2 from veox/forward-port
Forward-port relevant changes up to ckolivas/cgminer 3.10.0
2014-01-15 04:43:24 -08:00
Kano
65518d3712 api.c no decref if not json 2014-01-15 14:32:31 +02:00
Con Kolivas
350fe7f135 Minimise risk of nonce2 overflow with small nonce2 lengths by always encoding the work little endian, and increasing the maximum size of nonce2 to 8 bytes. 2014-01-15 14:31:28 +02:00
Con Kolivas
9628207066 Provide a helper function that can reset cgsems to zero. 2014-01-15 14:28:24 +02:00
Con Kolivas
155945094f Add to cgminer_CPPFLAGS instead of redefining them.
Conflicts:
	Makefile.am
2014-01-15 13:54:14 +02:00
Noel Maersk
747d870d32 build: remove reference to a linux-usb-miner 2014-01-15 13:49:01 +02:00
Con Kolivas
d26fb09e7c Replace deprecated use of INCLUDES with _CPPFLAGS.
Conflicts:
	Makefile.am
2014-01-15 13:48:33 +02:00
Con Kolivas
1ca4198a45 Quieten down jansson component of build. 2014-01-15 13:44:49 +02:00
Tim Bateman
0a187bdd41 Fixed one byte stack overflow in mcast recvfrom.
The actual overflow happens when enforcing the NULL termination shortly
after the recvfrom.
2014-01-15 13:43:10 +02:00
Con Kolivas
65d9136dd5 Fix json parsing in api.c 2014-01-15 13:39:51 +02:00
Con Kolivas
79521fe8f0 Initialise devices before attempting to connect to pools to allow their thread prepare function to be called before having to connect to pools.
Conflicts:
	cgminer.c
2014-01-15 13:38:44 +02:00
Con Kolivas
7d2cee3066 Drop json stratum auth failed message log level to verbose. 2014-01-15 13:35:19 +02:00