551 Commits (8fd422b5c2ca43dd0cef3ea0628e5b589674c859)
 

Author SHA1 Message Date
ghost 0af5d165d3 remove logCrawler column not in use 2 years ago
ghost 4b16b41440 make transaction for each item in crawl queue 2 years ago
ghost b585b16d31 fix datatype error detection 2 years ago
ghost 8726512cf0 change morphology from stem_enru to lemmatize_ru_all/lemmatize_en_all 2 years ago
ghost 8bc8a943e7 add lemmatize_de_all 2 years ago
ghost 3d3fcdda87 update readme 2 years ago
ghost a1249859a8 update readme 2 years ago
ghost eef23cc830 update readme 2 years ago
ghost 17f69b9661 add min_word_len, min_prefix_len, html_strip, index_exact_words presets example 2 years ago
ghost 1e2736d67b skip empty mime type index 2 years ago
ghost c5e25d17fb prevent page ban when it MIME in the whitelist, skip steps below only (make multimedia/streaming resources visible in search results) 2 years ago
ghost 4fa33afe40 prevent infinitive connection on streaming resources detected 2 years ago
ghost 345c59b5f4 collect target location links on page redirect available 2 years ago
ghost 5d7f2bf68c fix snap foreign keys deletion 2 years ago
ghost 242e0abd86 ban pages only on data type error codes only 2 years ago
ghost 62a4f33b53 load missed dependency 2 years ago
ghost 512bd56056 ban page that throws the error and stuck the crawl queue 2 years ago
ghost 5a47c66e55 fix readme description 2 years ago
ghost f49076bb0c index homepages and shorter URL with higher priority 2 years ago
ghost 3b1590cf7b update readme 2 years ago
ghost 69eb8d4ed4 update readme 2 years ago
ghost 52bd900ad4 update readme 2 years ago
ghost 982be2a949 add the description text source 2 years ago
ghost cb60d52a0b update documentation 2 years ago
ghost 45c4f7b7b0 add database optimization settings 2 years ago
ghost fc687f7f2c update readme 2 years ago
ghost 3113057a36 update readme 2 years ago
ghost a2b1dc4aa7 update meow 2 years ago
ghost 6e45c85ce4 update readme 2 years ago
ghost d88b30925c update readme 2 years ago
ghost 3d100568d7 update readme 2 years ago
ghost 940d5b0042 update readme 2 years ago
ghost 56a706948e update readme 2 years ago
ghost 35acabeab9 update readme 2 years ago
ghost 2853db6207 fix mimes separator 2 years ago
ghost a69270034a update robots.txt 2 years ago
ghost f827c37691 add MEGAcmd/FTP launch examples 2 years ago
ghost ae7467ad75 update readme 2 years ago
ghost df195249df update readme 2 years ago
ghost 81f7ea1e1e implement multi-storage snap downloads 2 years ago
ghost 653be0b79d add missed snap folder 2 years ago
ghost d277610b49 update .gitignore 2 years ago
ghost afb014cdb2 update readme 2 years ago
ghost 1969707eeb integrate optional MEGA/cmd snap storage 2 years ago
ghost f55a2dd26a remove subject link shortener on explore page view 2 years ago
ghost cdb9160c79 decrease max link preview length to 32 chars 2 years ago
ghost bd99dcb023 add leading zero to mkdir access code 2 years ago
ghost 5c1d8faa93 remove init file 2 years ago
ghost 48664f0caf fix zip close, loop brake condition 2 years ago
ghost 50c9066f62 add tables optimization to the cron/cleaner task 2 years ago