Commit Graph

88 Commits

Author SHA1 Message Date
ghost
c5ae6974bd fix PDO calls 2023-08-05 21:36:28 +03:00
ghost
513addc7af add query totals counting, update crawler debug 2023-08-05 21:03:45 +03:00
ghost
142d496108 fix SQL syntax error 2023-08-05 19:31:29 +03:00
ghost
d024ffd770 implement unlimited settings customization for each host 2023-08-05 19:06:39 +03:00
ghost
ab6c0379c8 implement hosts crawl queue, move robots, sitemaps, manifests to this task 2023-08-04 09:32:12 +03:00
ghost
71724ae33f refactor manifest crawling 2023-08-04 09:00:03 +03:00
ghost
b24d31f360 refactor cleaner, delegate tasks to crawler, init hostSetting table 2023-08-03 15:25:38 +03:00
ghost
02612d098b delete getFoundHostPage method, update API version 2023-08-03 14:08:45 +03:00
ghost
11e02da66d memory usage optimization, rename methods, remove memchached dependency from the model 2023-08-03 10:48:27 +03:00
ghost
1d7deffc4c update PR generation, delegate PR value from redirecting pages, update method names 2023-08-02 15:43:44 +03:00
ghost
b7a48b905e update method names 2023-08-02 14:25:48 +03:00
ghost
9c0f361601 refactor snap storage 2023-07-31 13:33:30 +03:00
ghost
a242d7b05a fix hostPageSnapId to hostPageId 2023-07-31 01:28:11 +03:00
ghost
0ea665f6e7 change method name 2023-07-30 23:32:02 +03:00
ghost
000b9ad8dd add FS cleaning features, lock execution on active crontab tasks, disable hostPageSnap/localhost untested constructions 2023-07-30 21:53:30 +03:00
ghost
3e3b7ee2ef optimize snaps, delete unused constructions 2023-07-30 19:09:41 +03:00
ghost
307eb03600 build host/host page URL in SQL query 2023-07-30 13:02:24 +03:00
ghost
8a2a79b65c add hostPageSnapStorage table to the optimization queue 2023-07-29 17:27:55 +03:00
ghost
712d67f6bf implement unlimited snap storage mirrors, delete megaCMD integration 2023-07-29 14:37:01 +03:00
ghost
9b52e3b7f5 delete unused constructions 2023-07-28 12:55:25 +03:00
ghost
1dd0a8ee2c make page rank procedural, optimize performance 2023-07-28 12:49:43 +03:00
ghost
4a4394fb27 add memcached support 2023-07-27 17:53:36 +03:00
ghost
0fb2e8a78c fix active pages total 2023-07-27 16:53:25 +03:00
ghost
a4b4ea324b change top rating from hosts to pages 2023-07-27 15:09:58 +03:00
ghost
2e2501b437 implement sitemap support 2023-07-27 11:44:42 +03:00
ghost
407e0d7f18 implement top page 2023-07-26 12:51:00 +03:00
ghost
3a610d5ccb add hostPageSnap truncate command 2023-07-25 20:33:25 +03:00
ghost
443eaec64e autodelete failed snaps 2023-07-07 12:30:07 +03:00
ghost
01d5356791 remove extra brackets 2023-06-30 13:41:07 +03:00
ghost
3218add372 add custom home page reindex settings 2023-06-30 13:28:22 +03:00
ghost
f4bf6b9fa4 fix crawl queue message 2023-06-27 13:14:53 +03:00
ghost
29197ab904 remove testing construction 2023-06-26 15:59:08 +03:00
ghost
ed240d53b0 show available snaps only 2023-06-25 23:29:30 +03:00
ghost
5346b13602 implement custom hostPageDom elements index 2023-06-25 22:10:47 +03:00
ghost
dc2d971ba0 clean up banned pages extra data 2023-06-16 16:53:14 +03:00
ghost
a657d31e1d fix enum data type 2023-06-16 16:32:46 +03:00
ghost
f3475035c2 show page size in explorer view, hide not available data 2023-06-13 23:20:22 +03:00
ghost
ab78e17ca8 add hostPage.size collection 2023-06-13 12:45:12 +03:00
ghost
7892784f5c add httpCode column to hostPageSnapDownload table 2023-06-12 13:34:25 +03:00
ghost
0af5d165d3 remove logCrawler column not in use 2023-06-05 22:06:55 +03:00
ghost
f49076bb0c index homepages and shorter URL with higher priority 2023-06-04 11:38:56 +03:00
ghost
81f7ea1e1e implement multi-storage snap downloads 2023-05-15 09:18:18 +03:00
ghost
1969707eeb integrate optional MEGA/cmd snap storage 2023-05-14 19:41:20 +03:00
ghost
50c9066f62 add tables optimization to the cron/cleaner task 2023-05-14 02:39:32 +03:00
ghost
0d19004e86 make local snap storage optimization 2023-05-14 01:45:55 +03:00
ghost
2f7d99079d implement local snaps 2023-05-13 10:15:07 +03:00
ghost
d98b8f5c94 remove hostPageToHostPage.quantity field because of implements wrong duplicates counting on reindex 2023-05-13 06:30:40 +03:00
ghost
eeeb3dceac implement index explorer 2023-05-13 05:54:15 +03:00
ghost
377b519a2c implement host page info mode 2023-05-13 03:51:34 +03:00
ghost
371670fadf add media referrers info 2023-05-13 03:01:00 +03:00