Con Kolivas
2643ad1b22
Use only the one jump in ocl.c to bypass binary saves for osx opencl.
12 years ago
Con Kolivas
0a8f584909
Initialise variables not set on OSX in ocl.c.
12 years ago
Con Kolivas
9aae2256d3
Bypass attempting to read and save binary files on OSX to avoid crashes on >1 GPU.
12 years ago
Con Kolivas
57e5bfbb25
Set default ocl work size for scrypt to 256.
12 years ago
ckolivas
6ffba7e9d8
Convert error getting device IDs in ocl code to info log level only since multiple platforms may be installed and the error is harmless there.
12 years ago
ckolivas
a797898fc3
Unnecessary extra array in ocl code.
12 years ago
Kano
ed480de9c1
LTC text typo
12 years ago
Con Kolivas
132ee4c981
Do not scan other gpu platforms if one is specified.
12 years ago
Con Kolivas
584fc013ab
Use a new algorithm for choosing a thread concurrency when none or no shader value is specified for scrypt.
12 years ago
Con Kolivas
d0f18e83ad
Do not round up the bufsize to the maximum allocable with scrypt.
12 years ago
Con Kolivas
3c3fbdce1c
Remove the rounding-up of the scrypt padbuffer which was not effectual and counter-productive on devices with lots of ram, limiting thread concurrencies and intensities.
12 years ago
Con Kolivas
1c6d8a36d8
bufsize is an unsigned integer, make it so for debug.
12 years ago
Con Kolivas
767d6df1a5
Whitelist AMD APP SDK 2.8 for diablo kernel.
12 years ago
Con Kolivas
87b62bde43
Cope with the highest opencl platform not having usable devices.
12 years ago
Con Kolivas
266d31271a
Make the numbuf larger to accept larger scrypt parameters.
12 years ago
Con Kolivas
69494c12ed
BeaverCreek doesn't like BFI INT patching.
12 years ago
Con Kolivas
25c39c96bb
Ease the checking on allocation of padbuffer8 in the hope it works partially anyway on an apparently failed call.
12 years ago
Con Kolivas
cc3b693c6d
Minor warning fixes.
12 years ago
Con Kolivas
40b747bae6
Put scrypt warning on separate line to avoid 0 being shown on windows as bufsize.
12 years ago
Con Kolivas
d91af893c8
Use correct sdk version detection for SDK 2.7
12 years ago
Con Kolivas
69983b778b
Revert "Pick worksize 256 with Cypress if none is specified."
...
This reverts commit 482322a4b7
.
Worksize 256 was only helpful on cypress with ultra-low memory speeds with old SDKs and the new kernels require higher memory clocks, having the opposite net effect.
12 years ago
Con Kolivas
4fbe5bed15
OpenCL 1.0 does not have native atomic_add and extremely slow support with atom_add so detect opencl1.0 and use a non-atomic workaround.
12 years ago
Con Kolivas
482322a4b7
Pick worksize 256 with Cypress if none is specified.
12 years ago
Con Kolivas
be06cf7083
Give warning with sdk2.7 and phatk as well.
12 years ago
Con Kolivas
cce19d9005
Whitelist sdk2.7 for diablo kernel as well.
12 years ago
Con Kolivas
fc44b6d7a1
Use different variables for command line specified lookup gap and thread concurrency to differentiate user defined versus auto chosen values.
13 years ago
Con Kolivas
97aa6ea492
Fix build error without scrypt enabled.
13 years ago
Con Kolivas
43752ee58c
Limit thread concurrency for scrypt to 5xshaders if shaders is specified.
13 years ago
Con Kolivas
da1b996a39
Simplify repeated use of gpus[gpu]. in ocl.c
13 years ago
Con Kolivas
ea10b08dce
Find the nearest power of 2 maximum alloc size for the scrypt buffer that can successfully be allocated and is large enough to accomodate the thread concurrency chosen, thus mapping it to an intensity.
13 years ago
Con Kolivas
9a6c082ad1
Make the thread concurrency and lookup gap options hidden on the command line and autotune parameters with a newly parsed --shaders option.
13 years ago
Con Kolivas
3a0d60cfe1
Always create the largest possible padbuffer for scrypt kernels even if not needed for thread_concurrency, giving us some headroom for intensity levels.
13 years ago
Con Kolivas
d8f81c18ee
Use the detected maximum allocable memory on a GPU to determine the optimal scrypt settings when lookup_gap and thread_concurrency parameters are not given.
13 years ago
Con Kolivas
89eb1fa393
Check the maximum allocable memory size per opencl device.
13 years ago
Con Kolivas
5087ff9069
Add debugging output if buffer allocation fails for scrypt and round up bufsize to a multiple of 256.
13 years ago
Con Kolivas
1711b4eb77
Display size of scrypt buffer used in debug.
13 years ago
Con Kolivas
39f7d2fa74
Allow lookup gap and thread concurrency to be passed per device and store details in kernel binary filename.
13 years ago
Con Kolivas
7d53fba1ad
Reinstate GPU only opencl device detection.
13 years ago
Con Kolivas
d13a3f1d50
Decrease lookup gap to 1. Does not seem to help in any way being 2.
13 years ago
Con Kolivas
d72add9af3
Send correct values to scrypt kernel to get it finally working.
13 years ago
Con Kolivas
3e61db105d
Create command queue before compiling program in opencl.
13 years ago
Con Kolivas
471daecb5f
Initialise mdplatform.
13 years ago
Con Kolivas
07292f73a1
Initialise mdplatform.
13 years ago
Con Kolivas
ffd21f8db3
Find the gpu platform with the most devices and use that if no platform option is passed.
13 years ago
Con Kolivas
f99ac0ca78
Allow more platforms to be probed if first does not return GPUs.
13 years ago
Con Kolivas
428d5e5d4d
Limit scrypt to 1 vector.
13 years ago
Con Kolivas
a9a0bba18b
Set the correct data for cldata and prepare for pad8 fixes.
13 years ago
Con Kolivas
04edf4bfa2
Temporarily set opencl to use all devices to allow debugging of scrypt kernel rapidly.
13 years ago
Con Kolivas
53e9c61c02
Find the gpu platform with the most devices and use that if no platform option is passed.
13 years ago
Con Kolivas
884f83f313
Allow more platforms to be probed if first does not return GPUs.
13 years ago