1
0
mirror of https://github.com/GOSTSec/ccminer synced 2025-01-08 22:07:56 +00:00
ccminer/quark
Tanguy Pruvot b9da6c67f5 improve jh512 with vectors (nist5,quark,sib,x11+,zr5)
the main improvement is to reduce asm calls to read global mem

but, a few more regs are used (68 mini vs 64 on SM 5.2)
so reduce the forced launch bounds to allow 80 or 128 regs per thread

Note: cuda 6.5 seems not able to store with v4.u32... (7.5 is fine)
        st.global.v4.u32        [%rd2], {%r3783, %r3824, %r3823, %r3822};
        st.global.v2.u32        [%rd2+16], {%r3821, %r3820};
        st.global.u32   [%rd2+24], %r3819;
        st.global.u32   [%rd2+28], %r3818;
        st.global.u32   [%rd2+44], %r3814;
        st.global.u32   [%rd2+40], %r3815;
        ...

todo, check alexis variant.. but wanted to keep this code before in git...
2017-01-30 07:27:01 +01:00
..
cuda_bmw512_sm3.cuh quark: add support for SM 2 devices 2015-11-06 04:10:06 +01:00
cuda_bmw512.cu vstudio: special ifdef for the constant (bmw) 2015-10-24 15:13:35 +02:00
cuda_jh512.cu improve jh512 with vectors (nist5,quark,sib,x11+,zr5) 2017-01-30 07:27:01 +01:00
cuda_quark_blake512_sp.cuh vstudio: clean and fix blake ifdef for x64 2015-10-24 18:21:45 +02:00
cuda_quark_blake512.cu ifdef some unused kernels on SM5+ 2015-10-28 07:25:52 +01:00
cuda_quark_compactionTest.cu quark: static shared memory allocation for SM3+ 2015-11-06 15:16:43 +01:00
cuda_quark_groestl512_sm2.cuh groestl: explain code and improve perf on SM 2.x 2017-01-29 00:51:42 +01:00
cuda_quark_groestl512.cu groestl: explain code and improve perf on SM 2.x 2017-01-29 00:51:42 +01:00
cuda_quark_keccak512.cu windows: fix missing off_t include 2015-06-08 16:58:12 +02:00
cuda_quark.h cuda: header for common kernel functions (quark/x11) 2015-10-25 06:54:17 +01:00
cuda_skein512.cu skein: merge the double implementations in one 2017-01-29 00:01:06 +01:00
groestl_functions_quad.h groestl: rename included cuda files 2015-04-06 23:46:34 +02:00
groestl_simple.cuh groestl: explain code and improve perf on SM 2.x 2017-01-29 00:51:42 +01:00
groestl_transf_quad.h groestl: rename included cuda files 2015-04-06 23:46:34 +02:00
nist5.cu migrate 2nd nonce storage of most algos 2017-01-29 05:46:45 +01:00
quarkcoin.cu migrate 2nd nonce storage of most algos 2017-01-29 05:46:45 +01:00