Commit Graph

498 Commits

Author SHA1 Message Date
ghost
8726512cf0 change morphology from stem_enru to lemmatize_ru_all/lemmatize_en_all 2023-06-05 18:20:49 +03:00
ghost
8bc8a943e7 add lemmatize_de_all 2023-06-05 18:13:31 +03:00
ghost
3d3fcdda87 update readme 2023-06-05 13:44:23 +03:00
ghost
a1249859a8 update readme 2023-06-05 13:42:19 +03:00
ghost
eef23cc830 update readme 2023-06-05 13:40:35 +03:00
ghost
17f69b9661 add min_word_len, min_prefix_len, html_strip, index_exact_words presets example 2023-06-05 13:36:15 +03:00
ghost
1e2736d67b skip empty mime type index 2023-06-04 18:10:59 +03:00
ghost
c5e25d17fb prevent page ban when it MIME in the whitelist, skip steps below only (make multimedia/streaming resources visible in search results) 2023-06-04 17:44:09 +03:00
ghost
4fa33afe40 prevent infinitive connection on streaming resources detected 2023-06-04 17:02:32 +03:00
ghost
345c59b5f4 collect target location links on page redirect available 2023-06-04 14:58:33 +03:00
ghost
5d7f2bf68c fix snap foreign keys deletion 2023-06-04 13:39:47 +03:00
ghost
242e0abd86 ban pages only on data type error codes only 2023-06-04 13:10:32 +03:00
ghost
62a4f33b53 load missed dependency 2023-06-04 12:27:20 +03:00
ghost
512bd56056 ban page that throws the error and stuck the crawl queue 2023-06-04 12:04:41 +03:00
ghost
5a47c66e55 fix readme description 2023-06-04 11:49:25 +03:00
ghost
f49076bb0c index homepages and shorter URL with higher priority 2023-06-04 11:38:56 +03:00
ghost
3b1590cf7b update readme 2023-06-01 00:07:11 +03:00
ghost
69eb8d4ed4 update readme 2023-05-31 16:30:05 +03:00
ghost
52bd900ad4 update readme 2023-05-31 16:28:58 +03:00
ghost
982be2a949 add the description text source 2023-05-30 21:46:52 +03:00
ghost
cb60d52a0b update documentation 2023-05-29 22:36:13 +03:00
ghost
45c4f7b7b0 add database optimization settings 2023-05-29 22:13:41 +03:00
ghost
fc687f7f2c update readme 2023-05-18 14:13:00 +03:00
ghost
3113057a36 update readme 2023-05-18 14:11:42 +03:00
ghost
a2b1dc4aa7 update meow 2023-05-17 14:25:31 +03:00
ghost
6e45c85ce4 update readme 2023-05-17 13:52:50 +03:00
ghost
d88b30925c update readme 2023-05-17 13:50:04 +03:00
ghost
3d100568d7 update readme 2023-05-17 13:48:56 +03:00
ghost
940d5b0042 update readme 2023-05-15 21:54:37 +03:00
ghost
56a706948e update readme 2023-05-15 21:25:54 +03:00
ghost
35acabeab9 update readme 2023-05-15 21:18:30 +03:00
ghost
2853db6207 fix mimes separator 2023-05-15 17:18:33 +03:00
ghost
a69270034a update robots.txt 2023-05-15 17:08:15 +03:00
ghost
f827c37691 add MEGAcmd/FTP launch examples 2023-05-15 11:51:27 +03:00
ghost
ae7467ad75 update readme 2023-05-15 09:34:36 +03:00
ghost
df195249df update readme 2023-05-15 09:33:16 +03:00
ghost
81f7ea1e1e implement multi-storage snap downloads 2023-05-15 09:18:18 +03:00
ghost
653be0b79d add missed snap folder 2023-05-14 21:09:56 +03:00
ghost
d277610b49 update .gitignore 2023-05-14 19:44:20 +03:00
ghost
afb014cdb2 update readme 2023-05-14 19:42:55 +03:00
ghost
1969707eeb integrate optional MEGA/cmd snap storage 2023-05-14 19:41:20 +03:00
ghost
f55a2dd26a remove subject link shortener on explore page view 2023-05-14 10:43:33 +03:00
ghost
cdb9160c79 decrease max link preview length to 32 chars 2023-05-14 10:37:27 +03:00
ghost
bd99dcb023 add leading zero to mkdir access code 2023-05-14 05:43:03 +03:00
ghost
5c1d8faa93 remove init file 2023-05-14 05:41:50 +03:00
ghost
48664f0caf fix zip close, loop brake condition 2023-05-14 04:33:35 +03:00
ghost
50c9066f62 add tables optimization to the cron/cleaner task 2023-05-14 02:39:32 +03:00
ghost
134b70e130 update readme 2023-05-14 01:51:30 +03:00
ghost
0d19004e86 make local snap storage optimization 2023-05-14 01:45:55 +03:00
ghost
8a3b25b31c update .gitignore 2023-05-13 11:07:29 +03:00