Sanjin Trošelj
5d5769156f
VS2010 build: Prepare ocl.c
...
Cherry-picked by veox.
2014-01-28 00:24:51 +02:00
Noel Maersk
e8978db55f
kernel: slight refactor, add warnings about new kernels, and known limitations of zuikkis.
...
Limitations of alexkar* kernels are not known to me.
Closes https://github.com/veox/sgminer/issues/28
2014-01-23 20:08:29 +02:00
Noel Maersk
5f326d64f5
kernel: allow choosing Alexey Karimov's kernels.
...
Am I the only one who gets a segfault?..
2014-01-21 10:36:55 +02:00
Noel Maersk
7950c10374
kernel: integrate Zuikkis' kernel, selectable with --kernel=zuikkis
.
...
Kernel ckolivas is still the default. Needs documentation, too.
https://github.com/veox/sgminer/issues/4
2014-01-20 23:38:39 +02:00
Noel Maersk
9d1db95a28
misc: move scrypt130511.cl kernel file to kernel/ckolivas.cl
...
Prepare for more kernels. :)
Get rid of the kernel date, since this fudges up commit history.
Named after Con Kolivas, the most recent committer.
2014-01-20 16:04:43 +02:00
Martin Danielsen
7aeae40af2
EXPERIMENTAL: A new way of setting intensity; introducing xintensity!
...
All of this is credited to ArGee of RGMiner, he did the initial ground work for this setting.
This new setting allows for a much finer grained intensity setting and also opens up for dual gpu threads on devices not previously able to. Note: make sure to use lower thread-concurrency values when you increase cpu threads.
Intensity is currently used to spawn GPU threads as a simple 2^value setting.
I:13 = 8192 threads
I:15 = 32768 threads
I:17 = 131072 threads
I:18 = 262144 threads
I:19 = 524288 threads
I:20 = 1048576 threads
Notice how the higher settings increase thread count tremendously.
Now enter the xintensity setting (Yes, I am a genius with my naming convention!).
It is simply a shader multiplier, obviously based on the amount of shaders you got on a card, this should allow the same value to scale with different card models.
6970 with 1536 shaders: xI:64 = 98304 threads
R9 280X with 2048 shaders: xI:64 = 131072 threads
R9 290 with 2560 shaders: xI:64 = 180224 threads
R9 290X with 2816 shaders: xI:64 = 163840 threads
6970 with 1536 shaders: xI:300 = 460800 threads
R9 280X with 2048 shaders: xI:300 = 614400 threads
R9 290 with 2560 shaders: xI:300 = 768000 threads
R9 290X with 2816 shaders: xI:300 = 844800 threads
It's now much easier to control thread intensity and it potentially allows for a uniform way of setting the intensity on your system. I'm very interested in constructive feedback, as I do not have access to a lot of different card models.
This change has been tested on 6970, R9 290, R9 290X - all with equal or a little better speeds than regular intensity setting after a little tuning, but your mileage may vary. Don't fret it, if this doesn't work for you, the regular intensity setting is still available.
Conflicts:
driver-opencl.c
sgminer.c
2014-01-18 21:11:31 +02:00
Noel Maersk
50066cf3c7
misc: Replace all remaining instances of 'cgminer' with 'sgminer'.
...
sed again.
2014-01-15 16:36:48 +02:00
Noel Maersk
199cb10e18
HAVE_OPENCL is mandatory, remove checks form code an build system.
2013-12-05 20:49:18 +02:00
Noel Maersk
95586f72f0
Squash-merge branch dead-end with unnecessary check removals.
2013-11-24 16:18:10 +02:00
Noel Maersk
d0070c0424
HARD RESET to: Remove Bitcoin detection code.
...
I'm pretty sure no one was working on this but me.
The code was unusable (gave 100% hardware errors) due to me not
testing it sufficiently. I hope this will not happen again.
ADL doesn't work for some reason, too.
2013-11-19 02:32:58 +02:00
Con Kolivas
578fabe07c
Do not perform bfi int patching for opencl1.2 or later.
2013-10-12 16:54:39 +11:00
Con Kolivas
466d3beca0
Provide a --disable-libcurl config option to build support for stratum mining only.
2013-09-21 22:49:28 +10:00
Kano
383eeabc71
ocl.c fix applog warnings on windows
2013-06-16 18:39:09 +10:00
Con Kolivas
b196d4fa0c
Support more shares to be returned for scrypt mining.
2013-05-11 16:19:02 +10:00
Kano
54d5d26095
ocl correct applog typing
2013-05-02 23:41:51 +10:00
Con Kolivas
2643ad1b22
Use only the one jump in ocl.c to bypass binary saves for osx opencl.
2013-04-25 00:16:09 +10:00
Con Kolivas
0a8f584909
Initialise variables not set on OSX in ocl.c.
2013-04-25 00:09:09 +10:00
Con Kolivas
9aae2256d3
Bypass attempting to read and save binary files on OSX to avoid crashes on >1 GPU.
2013-04-24 23:53:44 +10:00
Con Kolivas
57e5bfbb25
Set default ocl work size for scrypt to 256.
2013-04-21 09:36:49 +10:00
ckolivas
6ffba7e9d8
Convert error getting device IDs in ocl code to info log level only since multiple platforms may be installed and the error is harmless there.
2013-04-17 22:03:00 +10:00
ckolivas
a797898fc3
Unnecessary extra array in ocl code.
2013-04-17 22:01:38 +10:00
Kano
ed480de9c1
LTC text typo
2013-04-03 09:57:16 +11:00
Con Kolivas
132ee4c981
Do not scan other gpu platforms if one is specified.
2013-03-21 14:56:07 +11:00
Con Kolivas
584fc013ab
Use a new algorithm for choosing a thread concurrency when none or no shader value is specified for scrypt.
2013-03-15 22:31:46 +11:00
Con Kolivas
d0f18e83ad
Do not round up the bufsize to the maximum allocable with scrypt.
2013-03-15 22:00:52 +11:00
Con Kolivas
3c3fbdce1c
Remove the rounding-up of the scrypt padbuffer which was not effectual and counter-productive on devices with lots of ram, limiting thread concurrencies and intensities.
2013-03-15 21:48:48 +11:00
Con Kolivas
1c6d8a36d8
bufsize is an unsigned integer, make it so for debug.
2013-03-15 19:43:38 +11:00
Con Kolivas
767d6df1a5
Whitelist AMD APP SDK 2.8 for diablo kernel.
2013-03-09 16:45:58 +11:00
Con Kolivas
87b62bde43
Cope with the highest opencl platform not having usable devices.
2013-03-09 16:19:00 +11:00
Con Kolivas
266d31271a
Make the numbuf larger to accept larger scrypt parameters.
2013-02-10 15:07:49 +11:00
Con Kolivas
69494c12ed
BeaverCreek doesn't like BFI INT patching.
2012-12-10 15:38:21 +11:00
Con Kolivas
25c39c96bb
Ease the checking on allocation of padbuffer8 in the hope it works partially anyway on an apparently failed call.
2012-10-15 12:31:57 +11:00
Con Kolivas
cc3b693c6d
Minor warning fixes.
2012-10-07 12:27:02 +11:00
Con Kolivas
40b747bae6
Put scrypt warning on separate line to avoid 0 being shown on windows as bufsize.
2012-10-07 10:08:45 +11:00
Con Kolivas
d91af893c8
Use correct sdk version detection for SDK 2.7
2012-08-28 18:08:39 +10:00
Con Kolivas
69983b778b
Revert "Pick worksize 256 with Cypress if none is specified."
...
This reverts commit 482322a4b7
.
Worksize 256 was only helpful on cypress with ultra-low memory speeds with old SDKs and the new kernels require higher memory clocks, having the opposite net effect.
2012-08-28 17:19:38 +10:00
Con Kolivas
4fbe5bed15
OpenCL 1.0 does not have native atomic_add and extremely slow support with atom_add so detect opencl1.0 and use a non-atomic workaround.
2012-08-23 23:25:32 +10:00
Con Kolivas
482322a4b7
Pick worksize 256 with Cypress if none is specified.
2012-08-23 12:47:28 +10:00
Con Kolivas
be06cf7083
Give warning with sdk2.7 and phatk as well.
2012-08-23 12:44:42 +10:00
Con Kolivas
cce19d9005
Whitelist sdk2.7 for diablo kernel as well.
2012-08-23 12:42:10 +10:00
Con Kolivas
fc44b6d7a1
Use different variables for command line specified lookup gap and thread concurrency to differentiate user defined versus auto chosen values.
2012-08-05 15:32:44 +10:00
Con Kolivas
97aa6ea492
Fix build error without scrypt enabled.
2012-07-29 19:13:45 +10:00
Con Kolivas
43752ee58c
Limit thread concurrency for scrypt to 5xshaders if shaders is specified.
2012-07-26 16:12:45 +10:00
Con Kolivas
da1b996a39
Simplify repeated use of gpus[gpu]. in ocl.c
2012-07-26 16:10:21 +10:00
Con Kolivas
ea10b08dce
Find the nearest power of 2 maximum alloc size for the scrypt buffer that can successfully be allocated and is large enough to accomodate the thread concurrency chosen, thus mapping it to an intensity.
2012-07-25 22:02:14 +10:00
Con Kolivas
9a6c082ad1
Make the thread concurrency and lookup gap options hidden on the command line and autotune parameters with a newly parsed --shaders option.
2012-07-24 20:27:37 +10:00
Con Kolivas
3a0d60cfe1
Always create the largest possible padbuffer for scrypt kernels even if not needed for thread_concurrency, giving us some headroom for intensity levels.
2012-07-23 21:30:30 +10:00
Con Kolivas
d8f81c18ee
Use the detected maximum allocable memory on a GPU to determine the optimal scrypt settings when lookup_gap and thread_concurrency parameters are not given.
2012-07-23 17:51:57 +10:00
Con Kolivas
89eb1fa393
Check the maximum allocable memory size per opencl device.
2012-07-23 17:41:31 +10:00
Con Kolivas
5087ff9069
Add debugging output if buffer allocation fails for scrypt and round up bufsize to a multiple of 256.
2012-07-23 16:37:13 +10:00