133 Commits (573d249e1f9a223a97f9fb0cf9925d43d449b54d)
 

Author SHA1 Message Date
yggverse 573d249e1f fix snap filesize 8 months ago
yggverse 1b12153183 add body cache decoration 8 months ago
yggverse eac2734b9f display document body cache 8 months ago
yggverse 900e3a453f Disable keywords collection from headers as body index enabled 8 months ago
yggverse 1f3ee435e9 fix custom encoding conversion 8 months ago
yggverse e09440b44a strip code content 8 months ago
yggverse b5cd219f47 strip css content from index 8 months ago
yggverse 5588668728 update link rules 8 months ago
yggverse 2909091e72 update link rules 8 months ago
yggverse 19272733e4 update url rules 8 months ago
yggverse b440e6edff disable configuration changes cleanup 8 months ago
yggverse ad3fd31f67 update cleanup condition 8 months ago
yggverse dd914e0e1b fix cleanup query 8 months ago
yggverse 25fed9f1dc add new link rules 8 months ago
yggverse 3c62dc0fd5 add new url blacklist rule 8 months ago
yggverse 36972cab19 implement alter index tool 8 months ago
yggverse 44e2836de4 add new link rules 8 months ago
yggverse 2257ce771f apply cleaner to the current url configuration 8 months ago
yggverse d9bc24c8f8 add url substrings skip rules 8 months ago
yggverse 3884f375d4 save document body text to index 8 months ago
ghost 1f27a7e105 trim extra spaces before query escape 9 months ago
ghost d6b5f8b210 build combined search query 9 months ago
ghost 1c2e8dafb2 collect keywords from document headers 10 months ago
ghost cfbc84cbaf sort queue by rank asc 10 months ago
ghost db9dc8d4ba force results to string 10 months ago
ghost ff8461835d calculate initial rank 10 months ago
ghost 50dc9d315a add rank field 10 months ago
ghost 6f4abe4729 set crc32url as document id 10 months ago
ghost 93baed4b90 delete deprecated documents with HTTP code not 200 on second scan 11 months ago
ghost 17d6171d95 fix directory existion check #2 12 months ago
ghost 100806af02 complete local snaps feature #2 12 months ago
ghost 3be2f3ce09 ignore all config files in this folder 12 months ago
ghost 33cc778999 crawl newest pages by rand in queue 12 months ago
ghost 811c700049 add http code notice 12 months ago
ghost 35ad144a9e add stripos url rules for crawl snaps 12 months ago
ghost 0e06ff3c0f fix debug message 12 months ago
ghost e066223bd2 fix link container 12 months ago
ghost 51d52dea7d fix destination name 12 months ago
ghost 87ca594860 add debug levels 12 months ago
ghost 33d657cb72 apply sleep on timeout value provided only 12 months ago
ghost bc00f0c851 make tmp subfolders storage optimization 12 months ago
ghost f613b44d3f disable sort by RAND() in crawler queue 12 months ago
ghost 646269c4d9 fix link name 12 months ago
ghost 761cac9f3e remove target="_blank" 12 months ago
ghost fa3c0491e2 fix chromium -webkit-autofill input colors 12 months ago
ghost 9087c4b0d7 add footer links settings, implement nodes registry with database download list 12 months ago
ghost 4cec81c893 make extended search mode disabled by default #7 12 months ago
ghost f0da3caaf5 add extended search mode option 12 months ago
ghost 2f2eea6821 fix registry 12 months ago
ghost 5a730c09fc update readme 12 months ago