Con Kolivas
|
4fbe5bed15
|
OpenCL 1.0 does not have native atomic_add and extremely slow support with atom_add so detect opencl1.0 and use a non-atomic workaround.
|
13 years ago |
Con Kolivas
|
482322a4b7
|
Pick worksize 256 with Cypress if none is specified.
|
13 years ago |
Con Kolivas
|
be06cf7083
|
Give warning with sdk2.7 and phatk as well.
|
13 years ago |
Con Kolivas
|
cce19d9005
|
Whitelist sdk2.7 for diablo kernel as well.
|
13 years ago |
Con Kolivas
|
fc44b6d7a1
|
Use different variables for command line specified lookup gap and thread concurrency to differentiate user defined versus auto chosen values.
|
13 years ago |
Con Kolivas
|
97aa6ea492
|
Fix build error without scrypt enabled.
|
13 years ago |
Con Kolivas
|
43752ee58c
|
Limit thread concurrency for scrypt to 5xshaders if shaders is specified.
|
13 years ago |
Con Kolivas
|
da1b996a39
|
Simplify repeated use of gpus[gpu]. in ocl.c
|
13 years ago |
Con Kolivas
|
ea10b08dce
|
Find the nearest power of 2 maximum alloc size for the scrypt buffer that can successfully be allocated and is large enough to accomodate the thread concurrency chosen, thus mapping it to an intensity.
|
13 years ago |
Con Kolivas
|
9a6c082ad1
|
Make the thread concurrency and lookup gap options hidden on the command line and autotune parameters with a newly parsed --shaders option.
|
13 years ago |
Con Kolivas
|
3a0d60cfe1
|
Always create the largest possible padbuffer for scrypt kernels even if not needed for thread_concurrency, giving us some headroom for intensity levels.
|
13 years ago |
Con Kolivas
|
d8f81c18ee
|
Use the detected maximum allocable memory on a GPU to determine the optimal scrypt settings when lookup_gap and thread_concurrency parameters are not given.
|
13 years ago |
Con Kolivas
|
89eb1fa393
|
Check the maximum allocable memory size per opencl device.
|
13 years ago |
Con Kolivas
|
5087ff9069
|
Add debugging output if buffer allocation fails for scrypt and round up bufsize to a multiple of 256.
|
13 years ago |
Con Kolivas
|
1711b4eb77
|
Display size of scrypt buffer used in debug.
|
13 years ago |
Con Kolivas
|
39f7d2fa74
|
Allow lookup gap and thread concurrency to be passed per device and store details in kernel binary filename.
|
13 years ago |
Con Kolivas
|
7d53fba1ad
|
Reinstate GPU only opencl device detection.
|
13 years ago |
Con Kolivas
|
d13a3f1d50
|
Decrease lookup gap to 1. Does not seem to help in any way being 2.
|
13 years ago |
Con Kolivas
|
d72add9af3
|
Send correct values to scrypt kernel to get it finally working.
|
13 years ago |
Con Kolivas
|
3e61db105d
|
Create command queue before compiling program in opencl.
|
13 years ago |
Con Kolivas
|
471daecb5f
|
Initialise mdplatform.
|
13 years ago |
Con Kolivas
|
07292f73a1
|
Initialise mdplatform.
|
13 years ago |
Con Kolivas
|
ffd21f8db3
|
Find the gpu platform with the most devices and use that if no platform option is passed.
|
13 years ago |
Con Kolivas
|
f99ac0ca78
|
Allow more platforms to be probed if first does not return GPUs.
|
13 years ago |
Con Kolivas
|
428d5e5d4d
|
Limit scrypt to 1 vector.
|
13 years ago |
Con Kolivas
|
a9a0bba18b
|
Set the correct data for cldata and prepare for pad8 fixes.
|
13 years ago |
Con Kolivas
|
04edf4bfa2
|
Temporarily set opencl to use all devices to allow debugging of scrypt kernel rapidly.
|
13 years ago |
Con Kolivas
|
53e9c61c02
|
Find the gpu platform with the most devices and use that if no platform option is passed.
|
13 years ago |
Con Kolivas
|
884f83f313
|
Allow more platforms to be probed if first does not return GPUs.
|
13 years ago |
Con Kolivas
|
243d005b1b
|
Set scrypt settings and buffer size in ocl.c code to be future modifiable.
|
13 years ago |
Con Kolivas
|
aabc723326
|
Make sure goffset is set for scrypt and drop padbuffer8 to something manageable for now.
|
13 years ago |
Con Kolivas
|
e0296c411b
|
Set up buffer8 for scrypt.
|
13 years ago |
Con Kolivas
|
0f43eb5eb7
|
Don't test nonce with sha and various fixes for scrypt.
|
13 years ago |
Con Kolivas
|
b085c338f6
|
Make scrypt buffers and midstate compatible with cgminer.
|
13 years ago |
Con Kolivas
|
dd740caa98
|
Provide initial support for the scrypt kernel to compile with and mine scrypt with the --scrypt option.
|
13 years ago |
Philip Kaufmann
|
f479be0700
|
add goffset support for diakgcn with -v 1 and update kernel version
|
13 years ago |
Con Kolivas
|
9a3ae2660e
|
Add support for latest ATI SDK on windows.
|
13 years ago |
Con Kolivas
|
bb31988347
|
Detect poorly performing combination of SDK and phatk kernel and add verbose warning at startup.
|
13 years ago |
Con Kolivas
|
9175e4f25c
|
Display all OpenCL devices when -n is called as well to allow debugging of differential mapping of OpenCL to ADL.
|
13 years ago |
Con Kolivas
|
6274fbe727
|
Change the preferred vector width to 1 for Tahiti only, not all poclbm kernels.
|
13 years ago |
Con Kolivas
|
621bcca7f5
|
Use global offset parameter to diablo and poclbm kernel ONLY for 1 vector kernels.
|
13 years ago |
Con Kolivas
|
39395eb1e0
|
Use poclbm preferentially on Tahiti now regardless of SDK.
|
13 years ago |
Con Kolivas
|
edb070c833
|
Fixes.
|
13 years ago |
Con Kolivas
|
fb077c6d59
|
Pass vectors * worksize to kernel to avoid one op.
|
13 years ago |
Con Kolivas
|
709c4cd8e1
|
Use diablo kernel on all future SDKs for Tahiti and set preferred vector width to 1 on poclbm kernel only.
|
13 years ago |
ckolivas
|
dfcb98debf
|
Use the SDK and hardware information to choose good performing default kernels.
|
13 years ago |
ckolivas
|
d3ad87f5d2
|
Allow writing of multiple worksizes to the configuration file.
|
13 years ago |
ckolivas
|
1b1fa5cd89
|
Allow writing of multiple vector sizes to the configuration file.
|
13 years ago |
ckolivas
|
994cd77501
|
Allow writing of multiple kernels to the configuration file.
|
13 years ago |
ckolivas
|
93efb726bb
|
Allow multiple different kernels to be chosen per device.
|
13 years ago |