Commit Graph

  • baf78e2bf5 add hostImage examples to sphinx configuration ghost 2023-05-04 01:34:12 +0300
  • 0741a3e9ef implement image crawler ghost 2023-05-04 01:04:39 +0300
  • 78931ebc74 normalize host image description storage ghost 2023-05-03 21:52:00 +0300
  • 1122cb9798 update DB prototype ghost 2023-05-03 21:51:26 +0300
  • db617f9939 refactor image storage model ghost 2023-05-03 21:27:15 +0300
  • 74fb0d50be add DB prototype scheme ghost 2023-05-03 21:26:32 +0300
  • 1ee2ac4f0b add yggo:manifest namespace ghost 2023-05-03 09:38:58 +0300
  • 56c79d8f3a update config documentation ghost 2023-05-03 09:31:40 +0300
  • f8e0a50db6 add manifest url filter ghost 2023-05-03 09:26:48 +0300
  • 6d8f4f4882 create manifests registry ghost 2023-05-03 09:22:14 +0300
  • 219a56d6cd update manifest API ghost 2023-05-03 05:47:02 +0300
  • eb3e70a7b7 fix robots.txt conditions ghost 2023-05-03 04:17:58 +0300
  • d7bbf1d96a update default settings preset ghost 2023-05-03 04:17:13 +0300
  • ec20435790 remove presets registry (because provided in the node API) ghost 2023-05-03 04:13:32 +0300
  • 0bd765064b implement extended search mode support #9 ghost 2023-05-01 20:09:28 +0300
  • fb5cfe4f50 update readme ghost 2023-05-01 19:23:51 +0300
  • c3ff4de3bb update readme ghost 2023-05-01 19:10:42 +0300
  • b2d3cf1c13 update readme ghost 2023-05-01 19:09:28 +0300
  • 84fd82f294 fix replacement typo #9 ghost 2023-05-01 19:03:14 +0300
  • d40b914983 add new chars quoting #9 ghost 2023-05-01 18:58:03 +0300
  • f7807cf43e add extended syntax filter to prevent sphinxql query error #9 ghost 2023-05-01 18:39:46 +0300
  • a5f5541395 skip robots:noindex page without extra actions ghost 2023-04-29 08:58:48 +0300
  • 00140e30a8 update readme ghost 2023-04-29 07:49:09 +0300
  • c592edbd82 update readme ghost 2023-04-29 07:47:55 +0300
  • 9ae91ee187 remove phrase search mask, allow sphinx macroses ghost 2023-04-29 07:41:59 +0300
  • e418ddcd32 fix data type ghost 2023-04-25 21:20:35 +0300
  • 11aa404807 add metaYggo field index ghost 2023-04-25 21:10:59 +0300
  • 0a199fce72 add project description and support links ghost 2023-04-25 20:33:06 +0300
  • a16a13b395 add application mode settings ghost 2023-04-25 20:25:12 +0300
  • 9396c52313 change manifest API key names ghost 2023-04-25 19:53:52 +0300
  • e396a3a848 update readme ghost 2023-04-25 19:44:25 +0300
  • 6ade7e9fcd update readme ghost 2023-04-25 19:43:10 +0300
  • 957f15188b add CRAWL_PAGE_SECONDS_OFFSET info ghost 2023-04-25 19:38:17 +0300
  • a2fc14c8cf implement manifest API ghost 2023-04-25 19:35:52 +0300
  • 5875dd58c9 fix PR update condition ghost 2023-04-25 18:19:22 +0300
  • 74dd15e544 add page rank sort order attribute ghost 2023-04-25 17:07:57 +0300
  • 8671fc4bde implement page ranking ghost 2023-04-25 16:54:01 +0300
  • 57f64f6b90 add hostPage weight and rank info ghost 2023-04-25 16:53:13 +0300
  • d20487acfd add stem_enru, stem_cz, stem_ar morphology support ghost 2023-04-25 16:10:44 +0300
  • 2a79671cf1 add missed option example ghost 2023-04-25 16:09:38 +0300
  • afd4375e4d add hostPagesTotal info to the hosts API ghost 2023-04-25 16:01:55 +0300
  • 1d7031e4f7 make protocol settings adaptive ghost 2023-04-24 02:32:03 +0300
  • fcee7f62ef fix max_matches error ghost 2023-04-23 09:29:24 +0300
  • 3917ca8d4f move crontab configuration example to the config directory ghost 2023-04-23 09:07:06 +0300
  • bf976058c8 update readme ghost 2023-04-23 09:03:53 +0300
  • 74300cdf71 update readme ghost 2023-04-23 09:03:02 +0300
  • 12836660e4 update readme ghost 2023-04-23 07:16:40 +0300
  • 82042a52bf update readme ghost 2023-04-23 07:12:40 +0300
  • ef50716696 update readme ghost 2023-04-23 07:10:32 +0300
  • 3eb44d1aef update readme ghost 2023-04-23 07:09:28 +0300
  • 2d985f5851 update demo media ghost 2023-04-23 07:07:33 +0300
  • 4f8c0bc498 update home page demo ghost 2023-04-23 07:05:36 +0300
  • c4f30b0b94 update readme ghost 2023-04-23 07:03:12 +0300
  • e66e6832bd update crontab task example ghost 2023-04-23 04:48:15 +0300
  • 5936fa9a30 fix quota check condition ghost 2023-04-23 04:31:32 +0300
  • 8dbb4a06af add disk quota validation ghost 2023-04-23 04:05:00 +0300
  • 7bee0ebb4d update readme ghost 2023-04-23 03:29:02 +0300
  • 0df47efa8b fix API_ENABLED condition ghost 2023-04-23 03:25:43 +0300
  • 13431008c4 add options documentation ghost 2023-04-23 03:16:54 +0300
  • 1c4904d333 update readme ghost 2023-04-23 03:08:49 +0300
  • 14ba97f46a update readme ghost 2023-04-23 03:04:01 +0300
  • 5b16d83ca1 update readme ghost 2023-04-23 03:03:27 +0300
  • 9916fb701f implement basic api ghost 2023-04-23 03:01:51 +0300
  • 81cb970248 add options documentation ghost 2023-04-23 01:54:10 +0300
  • 8da150b295 add options documentation ghost 2023-04-23 01:46:34 +0300
  • 8f09db5045 add options documentation ghost 2023-04-23 01:32:34 +0300
  • c4dfb58fe3 add options documentation ghost 2023-04-23 01:14:31 +0300
  • 24472ea452 update readme ghost 2023-04-12 13:11:09 +0300
  • 921317c667 update readme ghost 2023-04-12 13:09:46 +0300
  • 7104cf19b7 update readme ghost 2023-04-12 12:53:51 +0300
  • fb18a9b955 update README.md ghost 2023-04-10 03:24:10 +0300
  • 352466ad03 update host.robotsPostfix registry ghost 2023-04-10 03:19:08 +0300
  • e6b1e8029c add missed regex replacement rule ghost 2023-04-10 03:18:50 +0300
  • dfbc6132c9 fix robots:noindex condition, add robots:nofollow attribute support ghost 2023-04-09 15:25:15 +0300
  • 5c8d299a4a add meta:robots tag support #2 ghost 2023-04-09 03:28:31 +0300
  • 6550eb310f update host.robotsPostfix rules ghost 2023-04-09 03:10:42 +0300
  • 6cee58214e update host.robotsPostfix rules ghost 2023-04-09 03:05:43 +0300
  • 6f4daf7a25 update host.robotsPostfix rule ghost 2023-04-09 02:19:07 +0300
  • f4db66d53f add new host.robotsPostfix rules ghost 2023-04-09 02:14:13 +0300
  • 9018acd0e2 update meta tags ghost 2023-04-09 01:22:36 +0300
  • 139e2c88eb add robots.txt ghost 2023-04-09 01:16:53 +0300
  • e505c76aaa update roadmap item by #5 answer ghost 2023-04-09 00:37:19 +0300
  • be7eae501b add host.status registry #1, #5 ghost 2023-04-09 00:28:51 +0300
  • bee5086f22 add crontab configuration example, check roadmap item ghost 2023-04-09 00:07:06 +0300
  • 8e8d89db0e implement database cleaner ghost 2023-04-09 00:06:28 +0300
  • 3c9bc1adaa add required user-agent construction #5 ghost 2023-04-09 00:02:31 +0300
  • 0484d43482 fix trim path levels in the relative links ghost 2023-04-08 23:52:46 +0300
  • b819fda025 init yggdrasil robots.txt registry #5 ghost 2023-04-08 22:29:33 +0300
  • df6f2a1869 implement CRAWL_ROBOTS_POSTFIX_RULES configuration #5 ghost 2023-04-08 22:28:31 +0300
  • 505544c8c9 add affiliate link ghost 2023-04-08 20:13:13 +0300
  • b3c668706b trim path levels in the relative links ghost 2023-04-08 19:14:04 +0300
  • 71a3e7dd0e skip x-raw-image links crawl ghost 2023-04-08 19:11:12 +0300
  • 50b6e90380 Merge branch 'main' of https://github.com/YGGverse/YGGo into main ghost 2023-04-08 18:23:51 +0300
  • 8d102ecdf7 index hosts with enabled status only ghost 2023-04-08 18:23:48 +0300
  • 0b12e872a3 add host name to the search index ghost 2023-04-08 18:22:53 +0300
  • a29d6d5d0a
    Update README.md d47081 2023-04-07 18:24:50 +0300
  • ab71b3823a update readme ghost 2023-04-07 15:03:00 +0300
  • e98146b78b index only 200 http code pages ghost 2023-04-07 05:34:45 +0300
  • 9b9d40a97c skip javascript/mailto links index ghost 2023-04-07 05:19:32 +0300
  • 2a843449e0 add process locked notice to the debug output ghost 2023-04-07 04:58:56 +0300