4 Commits (3ab48f9d1ed0049ae1386cc26a5fa8caca3123f9)

Author SHA1 Message Date
Con Kolivas b0a01be319 Revert use of any() in output code in poclbm kernel. Slower. 13 years ago
Con Kolivas df58517626 Extra byte was being unused and leading to failure on some platforms. 13 years ago
Con Kolivas 93459839c8 Explicitly type the constants in poclbm kernel as uint. 13 years ago
Con Kolivas 0bde957912 Update all kernel version names. 13 years ago
Con Kolivas 8f08a775ad Use any() in kernel output code and revert breakage of diakgcn kernel. 13 years ago
Con Kolivas 145f3c0b1d Put the nonce for each vector offset in advance, avoiding one extra addition in the kernel. 13 years ago
Con Kolivas 5e31785e7b Increase poclbm version number. 13 years ago
Con Kolivas 49c28b3929 Use PreVal4addT1 instead of PreVal4 in poclbm kernel. 13 years ago
Con Kolivas 5c4df1309a Import PreVal4 and PreVal0 into poclbm kernel. 13 years ago
Con Kolivas f5c296785f Import more prepared constants into poclbm kernel. 13 years ago
Con Kolivas 734dfecec5 Keep variables in one array but use Vals[] name for consistency with other kernel designs. 13 years ago
Con Kolivas 3f9e34a53c Replace constants that are mandatorily added in poclbm kernel with one value. 13 years ago
Con Kolivas b941146c29 Remove addition of final constant before testing for result in poclbm kernel. 13 years ago
Con Kolivas 81cb584586 Hand optimise variable addition order. 13 years ago
Con Kolivas dc2d553d5b Hand optimise first variable declaration order in poclbm kernel. 13 years ago
Con Kolivas f39fac9e4d Third pass reorder. 13 years ago
Con Kolivas b754fb8f4e 2nd pass radical reorder. 13 years ago
ckolivas e2b3c85d59 Radical reordering machine based first pass to change variables as late as possible, bringing their usage close together. 13 years ago
Con Kolivas 57dad38d04 Unroll all additions to enable further optimisations. 13 years ago
Con Kolivas 64acb9dae7 Increase version numbers of modified kernels. 13 years ago
Con Kolivas 210fe9d5b9 Constify nonce in poclbm. 13 years ago
Con Kolivas 60f8ccb313 Use local and group id on poclbm kernel as well. 13 years ago
Con Kolivas 8be9d13ff2 Further generic microoptimisations to poclbm kernel. 13 years ago
Con Kolivas cad84c6f2c Change poclbm version number. 13 years ago
Con Kolivas 4f1676f67f One array is faster than 2 separate arrays so change to that in poclbm kernel.. 13 years ago
Con Kolivas f5903e609d Microoptimisations to poclbm kernel which increase throughput slightly. 13 years ago
Con Kolivas 2fa142d1ce One array is faster than 2 separate arrays so change to that in poclbm kernel.. 13 years ago
Con Kolivas 1355859742 Microoptimisations to poclbm kernel which increase throughput slightly. 13 years ago
Con Kolivas ebaa2be1df Update poclbm kernel for better performance on GCN and new SDKs with bitalign support when not BFI INT patching. 13 years ago
Con Kolivas 3567b69e5e Remove fragile source patching for bitalign, vectors et. al and simply pass it with the compiler options. 13 years ago
Con Kolivas 6d10ef2f6e Bump version numbers of kernels to indicate slightly different versions. 13 years ago
Con Kolivas bd79a61c43 Move poclbm to new branch optimisation as well. 13 years ago
Con Kolivas cf54f9b850 Move to 256 sized buffers and don't risk overwrite by using only 127 mask. 13 years ago
Con Kolivas 0f782ba6bd Update poclbm kernel to FF sized mask and only check that range. 13 years ago
Con Kolivas 95f878294f The extra shift in the output actually appears detrimental in cgminer and there is a miniscule chance of missing the actual result if it ends up in the same spot as MAXBUFFERS. 13 years ago
Con Kolivas a7707a26cb Rename the poclbm file to ensure a new binary is built since. 14 years ago
Con Kolivas b198badcf4 The poclbm kernel needs to be updated to work with the change to 4k sized output buffers. 14 years ago
Con Kolivas 13b43cfad1 Update copyright and authors. 14 years ago
Con Kolivas 2b6e841673 Use a buffer of up to 512 * 4 integers when retrieving work from the GPU. 14 years ago
Con Kolivas 2dbb39444d Base was being set wrongly meaning we were repeating searches and the rate was actually lower than displayed :( 14 years ago
Con Kolivas 623b9b9fd8 Patch bitalign separately from bfi_int. 14 years ago
Con Kolivas 8253f1414b Use some line breaks in the kernel. 14 years ago
Con Kolivas 4257deafdb Convert abcd... to an array. 14 years ago
Con Kolivas 75cf5ccda6 Replace Ws with an array. 14 years ago
ckolivas 19eea9067f Implement code detecting max work size and optimal vector width. 14 years ago
Con Kolivas f54d2cc0ed Make poclbm use 4 vectors and decrease worksize to keep pipelines fullish. 14 years ago
ckolivas b4d2733cfc Convert to poclbm kernel. 14 years ago