Commit Graph

483 Commits

Author SHA1 Message Date
ghost
1d7deffc4c update PR generation, delegate PR value from redirecting pages, update method names 2023-08-02 15:43:44 +03:00
ghost
bba718c901 remove host pages total column 2023-08-02 15:36:26 +03:00
ghost
b7a48b905e update method names 2023-08-02 14:25:48 +03:00
ghost
e65c24f6f3 uodate roadmap 2023-08-02 12:44:10 +03:00
ghost
1655ec63b2 skip xmpp links 2023-08-02 11:57:54 +03:00
ghost
06c136f05c fix meta/nofollow attribute processing 2023-08-02 10:56:25 +03:00
ghost
39ba77fce5 fix page info conditions 2023-08-01 22:17:54 +03:00
ghost
ef170f62f3 update cli 2023-08-01 21:55:18 +03:00
ghost
43776b5ff4 fix semaphores 2023-08-01 17:53:14 +03:00
ghost
48e0482dbd update Filter::searchQuery method 2023-08-01 17:20:42 +03:00
ghost
cc0cca346b allow empty search queries 2023-08-01 16:47:39 +03:00
ghost
d119756a41 fix index size 2023-08-01 16:23:40 +03:00
ghost
662351cc46 make meta fields index separated, set search priority by document title 2023-08-01 14:15:14 +03:00
ghost
5791877a4e update Filter::searchQuery method, fix search by URL 2023-08-01 13:50:07 +03:00
ghost
0bda87fbe6 fix priority calculation on zero value in PR 2023-08-01 11:17:29 +03:00
ghost
bf69d894ca change search results piority, add PR to the page weight 2023-08-01 11:13:06 +03:00
ghost
d3c628b477 update Filter::searchQuery method 2023-08-01 11:01:08 +03:00
ghost
61a0652f51 update Filter::searchQuery method 2023-07-31 23:33:06 +03:00
ghost
3235133cd0 extract keywords from URI 2023-07-31 22:42:49 +03:00
ghost
3d6bc54b66 update Filter::searchQuery method 2023-07-31 22:07:59 +03:00
ghost
2ef9948342 change default CRAWL_PAGE_HOME_SECONDS_OFFSET value to 1 month 2023-07-31 22:04:27 +03:00
ghost
fd3444a379 change timestamp sort order 2023-07-31 14:25:38 +03:00
ghost
9c0f361601 refactor snap storage 2023-07-31 13:33:30 +03:00
ghost
5d7bcba42b minor optimization 2023-07-31 01:45:34 +03:00
ghost
a242d7b05a fix hostPageSnapId to hostPageId 2023-07-31 01:28:11 +03:00
ghost
aacbdfebc8 enable localhost DB-FS relations sync 2023-07-30 23:33:31 +03:00
ghost
0ea665f6e7 change method name 2023-07-30 23:32:02 +03:00
ghost
9e53618193 add snap file existing message 2023-07-30 22:24:16 +03:00
ghost
1e6451f863 update output message on snap index deletion 2023-07-30 22:12:30 +03:00
ghost
3546e07d6a fix roadmap in the CLI hostPageSnap section 2023-07-30 22:08:05 +03:00
ghost
9f23a0ebe4 move clean/crawl commans to the crontab options 2023-07-30 22:05:37 +03:00
ghost
1dbb9f0366 keep native functions inside the nlistr method 2023-07-30 22:00:18 +03:00
ghost
000b9ad8dd add FS cleaning features, lock execution on active crontab tasks, disable hostPageSnap/localhost untested constructions 2023-07-30 21:53:30 +03:00
ghost
547cd6717b prevent scheduled execution on cli/yggo running 2023-07-30 21:47:09 +03:00
ghost
3e3b7ee2ef optimize snaps, delete unused constructions 2023-07-30 19:09:41 +03:00
ghost
36becf6fe1 remove dots 2023-07-30 18:13:23 +03:00
ghost
db7e92391d remove undefined variable from CLI output 2023-07-30 18:06:46 +03:00
ghost
fde30da74c fix variable name 2023-07-30 17:59:15 +03:00
ghost
30a81ca6fb delete snaps from registry when not found in the any of storage available 2023-07-30 17:42:36 +03:00
ghost
1972b3411c fix snap file location 2023-07-30 14:46:07 +03:00
ghost
3c4a89d16d fix host page snap storage id 2023-07-30 14:39:21 +03:00
ghost
307eb03600 build host/host page URL in SQL query 2023-07-30 13:02:24 +03:00
ghost
b13293988a add search index by host and host page URL 2023-07-30 12:39:41 +03:00
ghost
1e664ba4cd cover snap deletion in transaction 2023-07-30 12:18:35 +03:00
ghost
1f33205236 add script tag support 2023-07-30 00:52:55 +03:00
ghost
b9ec787bbb decrease length of link previews 2023-07-30 00:21:45 +03:00
ghost
b433fa6b3c add link tag support 2023-07-30 00:17:28 +03:00
ghost
a5a48f37f7 read sitemap location in user-agent:* only 2023-07-29 23:37:25 +03:00
ghost
051bfb2e81 update snap reindex CLI output 2023-07-29 20:23:43 +03:00
ghost
43f53f0967 fix method name 2023-07-29 20:07:44 +03:00