Commit Graph

  • 28e8bcf8d7 add audio/video media crawl support ghost 2023-05-13 01:23:09 +0300
  • 89d1b2230b update readme ghost 2023-05-11 08:46:44 +0300
  • ced7d7c9d6 remove unused css construction ghost 2023-05-11 08:00:46 +0300
  • 9ad03c8153 add meta data description ghost 2023-05-11 07:40:09 +0300
  • b83ad6cc3a fix default mime ghost 2023-05-11 01:45:36 +0300
  • acafdfcf3a add method filters ghost 2023-05-11 01:34:09 +0300
  • 566d3b442e make mime details grouped ghost 2023-05-10 23:37:24 +0300
  • 4486bdc215 show mime type options that match search results only ghost 2023-05-10 20:37:05 +0300
  • 307ebcf0b1 add page description on title | description | keywords not empty, remove deprecated constructions ghost 2023-05-10 19:35:01 +0300
  • 7c5ba050b2 fix media crawling ghost 2023-05-10 18:35:18 +0300
  • 746cc228a9 update page rank query ghost 2023-05-10 15:42:48 +0300
  • 0fed16621a fix mime content type update ghost 2023-05-10 14:47:33 +0300
  • 34e25a1d94 update readme ghost 2023-05-10 14:32:36 +0300
  • db0e66c846 refactor to mime-based content index #1 ghost 2023-05-10 12:47:36 +0300
  • 272a885039 add line separator ghost 2023-05-09 16:37:56 +0300
  • 12c33d8ed6 add line separator ghost 2023-05-09 16:34:33 +0300
  • c13842b6c0 remove extra query ghost 2023-05-09 16:30:36 +0300
  • e7c5e2ca9d GROUP_CONCAT host image descriptions ghost 2023-05-09 16:27:31 +0300
  • 0ffcee1efb fix image description updates timing ghost 2023-05-09 15:53:21 +0300
  • 2c5ca1b630 fix image description duplicate ghost 2023-05-09 15:23:32 +0300
  • 1c7cca1446 fix UNIQUE index relation ghost 2023-05-09 14:10:08 +0300
  • 28bf526d53 add host nsfw settings ghost 2023-05-09 13:26:19 +0300
  • 8ce0324e94 convert page data to string ghost 2023-05-09 12:52:07 +0300
  • dfca5570c6 remove unused construction ghost 2023-05-09 12:10:42 +0300
  • 7dc7c89d9e update readme ghost 2023-05-09 10:22:41 +0300
  • d186fff48f skip curl download on response data size reached ghost 2023-05-09 10:21:37 +0300
  • d7a5f7ef84 remove content filter, snap raw the data ghost 2023-05-09 09:02:17 +0300
  • ef4de6b245 fix image search page errors ghost 2023-05-09 08:53:33 +0300
  • 377d4935ad update readme ghost 2023-05-09 08:28:09 +0300
  • 23ead4e12c update page / image description models, implement history snap crawling ghost 2023-05-09 08:19:49 +0300
  • 77bd25f587 add line separators ghost 2023-05-09 01:39:56 +0300
  • 0e9d29675f implement host page description history crawling ghost 2023-05-09 01:29:32 +0300
  • 6371def666 fix attributes passing ghost 2023-05-08 17:52:17 +0300
  • 32d0f390d3 update http code and mime type on page/image ban event ghost 2023-05-08 14:13:53 +0300
  • 84dcecf50b add svg images support, fix mime validation ghost 2023-05-08 13:12:16 +0300
  • e9d5137dfe allow svg images mime content type ghost 2023-05-08 13:00:37 +0300
  • e6da2e729a fix images ban update ghost 2023-05-08 13:00:02 +0300
  • 8fbd7f3516 count totals using sphinx index instead of database ghost 2023-05-08 12:28:49 +0300
  • bf1eeb332c fix page/image mime content type detection ghost 2023-05-08 12:10:57 +0300
  • 25b6bce2ec add crawler/cleaner logs ghost 2023-05-08 11:04:59 +0300
  • dcdc2c50ad update debug string names ghost 2023-05-08 08:31:34 +0300
  • ea04220de3 add curl requests debug ghost 2023-05-08 08:27:21 +0300
  • 1aba060d34 fix variable name ghost 2023-05-08 07:23:50 +0300
  • fdd18de373 remove abstraction ghost 2023-05-06 14:03:43 +0300
  • 4801360a51 update api version ghost 2023-05-06 13:55:05 +0300
  • 6c41dd5831 fix ban time update / count affected rows only ghost 2023-05-06 10:11:25 +0300
  • 20514c455f add banned items counters ghost 2023-05-06 08:50:41 +0300
  • b6605b9132 implement not reachable resources ban feature with timeout to prevent extra http requests ghost 2023-05-06 08:45:37 +0300
  • cfa5d01db1 update readme ghost 2023-05-06 07:33:34 +0300
  • 702a14b634 add mime content type crawling #1 ghost 2023-05-06 07:25:54 +0300
  • 0bd95d7f4d fix comments ghost 2023-05-05 21:39:48 +0300
  • f88d2ee9ff implement MIME content-type crawler filter ghost 2023-05-05 21:25:57 +0300
  • d945fdfd91 update readme ghost 2023-05-05 20:29:12 +0300
  • 0e7220f7f8 display url decoded links ghost 2023-05-05 20:09:15 +0300
  • bca05e66e9 update readme ghost 2023-05-05 19:53:54 +0300
  • bed5d3f149 fix offset out of bounds error ghost 2023-05-05 15:16:36 +0300
  • c45592b459 update readme ghost 2023-05-05 13:32:56 +0300
  • ff187cfd60 update readme ghost 2023-05-05 13:27:57 +0300
  • b1ac5e64a1 update readme ghost 2023-05-05 13:22:53 +0300
  • 376476c7b0 update readme ghost 2023-05-05 13:20:50 +0300
  • 44b4b50336 update readme ghost 2023-05-05 13:20:23 +0300
  • d1312a2ade update readme ghost 2023-05-05 13:15:49 +0300
  • e99dae84dd update readme ghost 2023-05-05 13:09:38 +0300
  • 14541ae5e3 update readme ghost 2023-05-05 12:59:46 +0300
  • 052c829b64 update readme ghost 2023-05-05 12:52:20 +0300
  • a1d8522006 update readme ghost 2023-05-05 12:49:07 +0300
  • 05fedc6fa6 update readme ghost 2023-05-05 12:25:22 +0300
  • ce16d2dd9f update readme ghost 2023-05-05 12:17:48 +0300
  • 6a3f4d1904 update readme ghost 2023-05-05 11:56:53 +0300
  • 747caccea5 update readme ghost 2023-05-05 11:55:50 +0300
  • b09ac5eb43 update readme ghost 2023-05-05 11:53:32 +0300
  • 463b81af5e update readme ghost 2023-05-05 11:49:16 +0300
  • 4d54e5cc8f update readme ghost 2023-05-05 11:46:03 +0300
  • 32e81ede31 update readme ghost 2023-05-05 11:41:40 +0300
  • 5999fb3a73 add distributed hosts crawling using yggo nodes manifest ghost 2023-05-05 05:26:53 +0300
  • f0b2eb1613 show images total instead of pages in placeholder on image search page ghost 2023-05-05 01:42:44 +0300
  • 5297e6e918 fix condition error ghost 2023-05-04 11:35:22 +0300
  • 297563d4a5 display related pages in priority to the unique host by rank, rand() order ghost 2023-05-04 10:53:37 +0300
  • 34b7291228 add related to image hostpages limit ghost 2023-05-04 10:17:47 +0300
  • adc791f378 fix updateTime init ghost 2023-05-04 10:11:13 +0300
  • 317a58cfa2 add main container paddings for mobile browsers ghost 2023-05-04 09:55:52 +0300
  • 5310c3423b update search page header ghost 2023-05-04 09:52:08 +0300
  • 0cc712f24e fix variable definition ghost 2023-05-04 09:24:21 +0300
  • 834ac68cce create separated pagination settings for page/image search types ghost 2023-05-04 09:20:34 +0300
  • d8449d4f7d update db prototype ghost 2023-05-04 08:55:03 +0300
  • dad9a2633c increase h3 margins ghost 2023-05-04 08:53:41 +0300
  • d4f66c83e7 fix image crawling errors ghost 2023-05-04 08:51:45 +0300
  • baa8b0d2f0 fix data type formatting ghost 2023-05-04 07:58:07 +0300
  • 79878d17fe add crawler / proxy user agent settings ghost 2023-05-04 07:38:22 +0300
  • 73f212e3d7 set crawler queue order priority to item rank, rand() ghost 2023-05-04 06:55:05 +0300
  • 9ed8411d2f add image queue crawler ghost 2023-05-04 06:45:04 +0300
  • d905e33b4f update host images info on search requests ghost 2023-05-04 06:12:51 +0300
  • 68581960a3 add image.data field ghost 2023-05-04 05:19:29 +0300
  • 9c24eda833 switch to native curl library ghost 2023-05-04 04:56:25 +0300
  • 100d12c6ab update curl library constructor ghost 2023-05-04 04:55:26 +0300
  • bb4e97eea3 use curl for image connections to prevent queue timeout ghost 2023-05-04 04:42:07 +0300
  • 63b51f71c6 fix space offset ghost 2023-05-04 04:20:54 +0300
  • f980b6318c add page meta to the image index ghost 2023-05-04 04:20:20 +0300
  • 250e20bbcd remove separator ghost 2023-05-04 04:19:38 +0300
  • 6b18202588 implement proxied image search #1 ghost 2023-05-04 03:48:57 +0300