Con Kolivas
cc3b693c6d
Minor warning fixes.
2012-10-07 12:27:02 +11:00
Con Kolivas
40b747bae6
Put scrypt warning on separate line to avoid 0 being shown on windows as bufsize.
2012-10-07 10:08:45 +11:00
Con Kolivas
d91af893c8
Use correct sdk version detection for SDK 2.7
2012-08-28 18:08:39 +10:00
Con Kolivas
69983b778b
Revert "Pick worksize 256 with Cypress if none is specified."
...
This reverts commit 482322a4b7
.
Worksize 256 was only helpful on cypress with ultra-low memory speeds with old SDKs and the new kernels require higher memory clocks, having the opposite net effect.
2012-08-28 17:19:38 +10:00
Con Kolivas
4fbe5bed15
OpenCL 1.0 does not have native atomic_add and extremely slow support with atom_add so detect opencl1.0 and use a non-atomic workaround.
2012-08-23 23:25:32 +10:00
Con Kolivas
482322a4b7
Pick worksize 256 with Cypress if none is specified.
2012-08-23 12:47:28 +10:00
Con Kolivas
be06cf7083
Give warning with sdk2.7 and phatk as well.
2012-08-23 12:44:42 +10:00
Con Kolivas
cce19d9005
Whitelist sdk2.7 for diablo kernel as well.
2012-08-23 12:42:10 +10:00
Con Kolivas
fc44b6d7a1
Use different variables for command line specified lookup gap and thread concurrency to differentiate user defined versus auto chosen values.
2012-08-05 15:32:44 +10:00
Con Kolivas
97aa6ea492
Fix build error without scrypt enabled.
2012-07-29 19:13:45 +10:00
Con Kolivas
43752ee58c
Limit thread concurrency for scrypt to 5xshaders if shaders is specified.
2012-07-26 16:12:45 +10:00
Con Kolivas
da1b996a39
Simplify repeated use of gpus[gpu]. in ocl.c
2012-07-26 16:10:21 +10:00
Con Kolivas
ea10b08dce
Find the nearest power of 2 maximum alloc size for the scrypt buffer that can successfully be allocated and is large enough to accomodate the thread concurrency chosen, thus mapping it to an intensity.
2012-07-25 22:02:14 +10:00
Con Kolivas
9a6c082ad1
Make the thread concurrency and lookup gap options hidden on the command line and autotune parameters with a newly parsed --shaders option.
2012-07-24 20:27:37 +10:00
Con Kolivas
3a0d60cfe1
Always create the largest possible padbuffer for scrypt kernels even if not needed for thread_concurrency, giving us some headroom for intensity levels.
2012-07-23 21:30:30 +10:00
Con Kolivas
d8f81c18ee
Use the detected maximum allocable memory on a GPU to determine the optimal scrypt settings when lookup_gap and thread_concurrency parameters are not given.
2012-07-23 17:51:57 +10:00
Con Kolivas
89eb1fa393
Check the maximum allocable memory size per opencl device.
2012-07-23 17:41:31 +10:00
Con Kolivas
5087ff9069
Add debugging output if buffer allocation fails for scrypt and round up bufsize to a multiple of 256.
2012-07-23 16:37:13 +10:00
Con Kolivas
1711b4eb77
Display size of scrypt buffer used in debug.
2012-07-22 00:58:09 +10:00
Con Kolivas
39f7d2fa74
Allow lookup gap and thread concurrency to be passed per device and store details in kernel binary filename.
2012-07-21 17:31:06 +10:00
Con Kolivas
7d53fba1ad
Reinstate GPU only opencl device detection.
2012-07-21 02:49:50 +10:00
Con Kolivas
d13a3f1d50
Decrease lookup gap to 1. Does not seem to help in any way being 2.
2012-07-21 02:47:27 +10:00
Con Kolivas
d72add9af3
Send correct values to scrypt kernel to get it finally working.
2012-07-20 16:16:18 +10:00
Con Kolivas
3e61db105d
Create command queue before compiling program in opencl.
2012-07-18 21:58:27 +10:00
Con Kolivas
471daecb5f
Initialise mdplatform.
2012-07-16 20:15:27 +10:00
Con Kolivas
428d5e5d4d
Limit scrypt to 1 vector.
2012-07-16 13:22:35 +10:00
Con Kolivas
a9a0bba18b
Set the correct data for cldata and prepare for pad8 fixes.
2012-07-16 11:53:18 +10:00
Con Kolivas
04edf4bfa2
Temporarily set opencl to use all devices to allow debugging of scrypt kernel rapidly.
2012-07-15 13:40:56 +10:00
Con Kolivas
53e9c61c02
Find the gpu platform with the most devices and use that if no platform option is passed.
2012-07-15 13:40:11 +10:00
Con Kolivas
884f83f313
Allow more platforms to be probed if first does not return GPUs.
2012-07-15 13:31:03 +10:00
Con Kolivas
243d005b1b
Set scrypt settings and buffer size in ocl.c code to be future modifiable.
2012-07-14 16:21:27 +10:00
Con Kolivas
aabc723326
Make sure goffset is set for scrypt and drop padbuffer8 to something manageable for now.
2012-07-14 00:30:25 +10:00
Con Kolivas
e0296c411b
Set up buffer8 for scrypt.
2012-07-13 21:35:25 +10:00
Con Kolivas
0f43eb5eb7
Don't test nonce with sha and various fixes for scrypt.
2012-07-13 20:35:44 +10:00
Con Kolivas
b085c338f6
Make scrypt buffers and midstate compatible with cgminer.
2012-07-13 20:28:36 +10:00
Con Kolivas
dd740caa98
Provide initial support for the scrypt kernel to compile with and mine scrypt with the --scrypt option.
2012-07-13 19:02:43 +10:00
Philip Kaufmann
f479be0700
add goffset support for diakgcn with -v 1 and update kernel version
2012-04-27 09:58:30 +02:00
Con Kolivas
9a3ae2660e
Add support for latest ATI SDK on windows.
2012-04-27 10:22:53 +10:00
Con Kolivas
bb31988347
Detect poorly performing combination of SDK and phatk kernel and add verbose warning at startup.
2012-04-25 11:41:35 +10:00
Con Kolivas
9175e4f25c
Display all OpenCL devices when -n is called as well to allow debugging of differential mapping of OpenCL to ADL.
2012-04-23 17:56:31 +10:00
Con Kolivas
6274fbe727
Change the preferred vector width to 1 for Tahiti only, not all poclbm kernels.
2012-03-30 09:32:42 +11:00
Con Kolivas
621bcca7f5
Use global offset parameter to diablo and poclbm kernel ONLY for 1 vector kernels.
2012-03-27 22:58:46 +11:00
Con Kolivas
39395eb1e0
Use poclbm preferentially on Tahiti now regardless of SDK.
2012-03-27 22:58:46 +11:00
Con Kolivas
edb070c833
Fixes.
2012-02-24 14:14:05 +11:00
Con Kolivas
fb077c6d59
Pass vectors * worksize to kernel to avoid one op.
2012-02-24 14:14:04 +11:00
Con Kolivas
709c4cd8e1
Use diablo kernel on all future SDKs for Tahiti and set preferred vector width to 1 on poclbm kernel only.
2012-02-23 20:24:32 +11:00
ckolivas
dfcb98debf
Use the SDK and hardware information to choose good performing default kernels.
2012-02-23 00:45:40 +11:00
ckolivas
d3ad87f5d2
Allow writing of multiple worksizes to the configuration file.
2012-02-22 20:13:23 +11:00
ckolivas
1b1fa5cd89
Allow writing of multiple vector sizes to the configuration file.
2012-02-22 20:08:29 +11:00
ckolivas
994cd77501
Allow writing of multiple kernels to the configuration file.
2012-02-22 20:01:09 +11:00