Con Kolivas
|
1b5c676de7
|
Use 256 output slots for kernels to allow 1 for each worksize.
|
13 years ago |
Con Kolivas
|
7367f57d14
|
Reuse Vals[] array more than W[] till they're re-initialised on the second sha256 cycle in poclbm kernel.
|
13 years ago |
Con Kolivas
|
baadafa298
|
Minor variable alignment in poclbm kernel.
|
13 years ago |
Con Kolivas
|
0a39a22c2d
|
Microoptimise poclbm kernel by ordering Val variables according to usage frequency.
|
13 years ago |
Con Kolivas
|
fb881eac62
|
Revert the crossover of variables from Vals to W in poclbm kernel now that Vals are the first declared variables so they're used more frequently.
|
13 years ago |
Con Kolivas
|
1a2130cc53
|
Vals variables appearing first in the array in poclbm is faster.
|
13 years ago |
Con Kolivas
|
4325cff2b0
|
Update version numbers of poclbm and diablo kernels as their APIs have also changed.
|
13 years ago |
Con Kolivas
|
621bcca7f5
|
Use global offset parameter to diablo and poclbm kernel ONLY for 1 vector kernels.
|
13 years ago |
Con Kolivas
|
746fe8b2e6
|
Remove unused constant passed to poclbm.
|
13 years ago |
Con Kolivas
|
2a3b7359eb
|
Clean up use of macros in poclbm and use bitselect everywhere possible.
|
13 years ago |
Con Kolivas
|
db873ba202
|
Add worksize and vector attribute hints to the poclbm kernel.
|
13 years ago |
Con Kolivas
|
325246d309
|
Spaces for non-aligned variables in poclbm.
|
13 years ago |
Con Kolivas
|
ab3e63ff42
|
More tidying of poclbm.
|
13 years ago |
Con Kolivas
|
0af7bbd74f
|
Swap Vals and W variables where they can overlap in poclbm.
|
13 years ago |
Con Kolivas
|
00796251e8
|
More tidying of poclbm.
|
13 years ago |
Con Kolivas
|
e1d580be70
|
Tidy up first half of poclbm.
|
13 years ago |
Con Kolivas
|
338f6d5788
|
Clean up use of any() by diablo and poclbm kernels.
|
13 years ago |
Con Kolivas
|
a7a9dbcf90
|
Minor variable symmetry changes in poclbm.
|
13 years ago |
Con Kolivas
|
c297f63e5e
|
Put additions on separate lines for consistency in poclbm.
|
13 years ago |
Con Kolivas
|
3fa8613557
|
Consolidate last use of W11 into Vals4 in poclbm.
|
13 years ago |
Con Kolivas
|
4885b02e32
|
Add last value in vectors in diablo and poclbm kernel for consistency with original code.
|
13 years ago |
Con Kolivas
|
40b18d5d01
|
Use the unrolled option for no vectors return code.
|
13 years ago |
Con Kolivas
|
d8f14fd666
|
Cluster Vals7 for use on output.
|
13 years ago |
Con Kolivas
|
76d0554d76
|
Get rid of extra char which is just truncated in poclbm kernel.
|
13 years ago |
Con Kolivas
|
d32cd583ac
|
Reinstate the old output mechanism setting output[FOUND] per vector.
|
13 years ago |
Con Kolivas
|
e9889a384d
|
Revert kernels that are designed for newer hardware and SDKs to 2.3.0 release style.
|
13 years ago |
Con Kolivas
|
fb077c6d59
|
Pass vectors * worksize to kernel to avoid one op.
|
13 years ago |
Con Kolivas
|
70e8ade54f
|
Revert behaviour to old nonce init code.
|
13 years ago |
Con Kolivas
|
bce47064b6
|
Revert use of any() in output code in poclbm kernel. Slower.
|
13 years ago |
Con Kolivas
|
b0a01be319
|
Revert use of any() in output code in poclbm kernel. Slower.
|
13 years ago |
Con Kolivas
|
df58517626
|
Extra byte was being unused and leading to failure on some platforms.
|
13 years ago |
Con Kolivas
|
93459839c8
|
Explicitly type the constants in poclbm kernel as uint.
|
13 years ago |
Con Kolivas
|
0bde957912
|
Update all kernel version names.
|
13 years ago |
Con Kolivas
|
8f08a775ad
|
Use any() in kernel output code and revert breakage of diakgcn kernel.
|
13 years ago |
Con Kolivas
|
145f3c0b1d
|
Put the nonce for each vector offset in advance, avoiding one extra addition in the kernel.
|
13 years ago |
Con Kolivas
|
5e31785e7b
|
Increase poclbm version number.
|
13 years ago |
Con Kolivas
|
49c28b3929
|
Use PreVal4addT1 instead of PreVal4 in poclbm kernel.
|
13 years ago |
Con Kolivas
|
5c4df1309a
|
Import PreVal4 and PreVal0 into poclbm kernel.
|
13 years ago |
Con Kolivas
|
f5c296785f
|
Import more prepared constants into poclbm kernel.
Conflicts:
poclbm120213.cl
|
13 years ago |
Con Kolivas
|
734dfecec5
|
Keep variables in one array but use Vals[] name for consistency with other kernel designs.
|
13 years ago |
Con Kolivas
|
3f9e34a53c
|
Replace constants that are mandatorily added in poclbm kernel with one value.
|
13 years ago |
Con Kolivas
|
b941146c29
|
Remove addition of final constant before testing for result in poclbm kernel.
|
13 years ago |
Con Kolivas
|
81cb584586
|
Hand optimise variable addition order.
|
13 years ago |
Con Kolivas
|
dc2d553d5b
|
Hand optimise first variable declaration order in poclbm kernel.
|
13 years ago |
Con Kolivas
|
f39fac9e4d
|
Third pass reorder.
|
13 years ago |
Con Kolivas
|
b754fb8f4e
|
2nd pass radical reorder.
|
13 years ago |
ckolivas
|
e2b3c85d59
|
Radical reordering machine based first pass to change variables as late as possible, bringing their usage close together.
|
13 years ago |
Con Kolivas
|
57dad38d04
|
Unroll all additions to enable further optimisations.
|
13 years ago |
Con Kolivas
|
64acb9dae7
|
Increase version numbers of modified kernels.
|
13 years ago |
Con Kolivas
|
210fe9d5b9
|
Constify nonce in poclbm.
|
13 years ago |