75 Commits (8bc8a943e755f9c1e80ea94529ceebe551ed6bbf)

Author SHA1 Message Date
ghost 8bc8a943e7 add lemmatize_de_all 2 years ago
ghost 17f69b9661 add min_word_len, min_prefix_len, html_strip, index_exact_words presets example 2 years ago
ghost 1e2736d67b skip empty mime type index 2 years ago
ghost 4fa33afe40 prevent infinitive connection on streaming resources detected 2 years ago
ghost 982be2a949 add the description text source 2 years ago
ghost cb60d52a0b update documentation 2 years ago
ghost 45c4f7b7b0 add database optimization settings 2 years ago
ghost 2853db6207 fix mimes separator 2 years ago
ghost f827c37691 add MEGAcmd/FTP launch examples 2 years ago
ghost 81f7ea1e1e implement multi-storage snap downloads 2 years ago
ghost 1969707eeb integrate optional MEGA/cmd snap storage 2 years ago
ghost 0d19004e86 make local snap storage optimization 2 years ago
ghost 2f7d99079d implement local snaps 2 years ago
ghost d98b8f5c94 remove `hostPageToHostPage`.`quantity` field because of implements wrong duplicates counting on reindex 2 years ago
ghost 28e8bcf8d7 add audio/video media crawl support 2 years ago
ghost 566d3b442e make mime details grouped 2 years ago
ghost 746cc228a9 update page rank query 2 years ago
ghost db0e66c846 refactor to mime-based content index #1 2 years ago
ghost e7c5e2ca9d GROUP_CONCAT host image descriptions 2 years ago
ghost 28bf526d53 add host nsfw settings 2 years ago
ghost d186fff48f skip curl download on response data size reached 2 years ago
ghost 23ead4e12c update page / image description models, implement history snap crawling 2 years ago
ghost 77bd25f587 add line separators 2 years ago
ghost 0e9d29675f implement host page description history crawling 2 years ago
ghost e9d5137dfe allow svg images mime content type 2 years ago
ghost 25b6bce2ec add crawler/cleaner logs 2 years ago
ghost fdd18de373 remove abstraction 2 years ago
ghost 4801360a51 update api version 2 years ago
ghost b6605b9132 implement not reachable resources ban feature with timeout to prevent extra http requests 2 years ago
ghost f88d2ee9ff implement MIME content-type crawler filter 2 years ago
ghost 5999fb3a73 add distributed hosts crawling using yggo nodes manifest 2 years ago
ghost 297563d4a5 display related pages in priority to the unique host by rank, rand() order 2 years ago
ghost 834ac68cce create separated pagination settings for page/image search types 2 years ago
ghost 79878d17fe add crawler / proxy user agent settings 2 years ago
ghost 9ed8411d2f add image queue crawler 2 years ago
ghost d905e33b4f update host images info on search requests 2 years ago
ghost 63b51f71c6 fix space offset 2 years ago
ghost f980b6318c add page meta to the image index 2 years ago
ghost baf78e2bf5 add hostImage examples to sphinx configuration 2 years ago
ghost 0741a3e9ef implement image crawler 2 years ago
ghost 56c79d8f3a update config documentation 2 years ago
ghost 6d8f4f4882 create manifests registry 2 years ago
ghost 219a56d6cd update manifest API 2 years ago
ghost d7bbf1d96a update default settings preset 2 years ago
ghost 0a199fce72 add project description and support links 2 years ago
ghost a16a13b395 add application mode settings 2 years ago
ghost a2fc14c8cf implement manifest API 2 years ago
ghost 74dd15e544 add page rank sort order attribute 2 years ago
ghost d20487acfd add stem_enru, stem_cz, stem_ar morphology support 2 years ago
ghost 2a79671cf1 add missed option example 2 years ago