590 Commits (main)
 

Author SHA1 Message Date
ghost 28e8bcf8d7 add audio/video media crawl support 2 years ago
ghost 89d1b2230b update readme 2 years ago
ghost ced7d7c9d6 remove unused css construction 2 years ago
ghost 9ad03c8153 add meta data description 2 years ago
ghost b83ad6cc3a fix default mime 2 years ago
ghost acafdfcf3a add method filters 2 years ago
ghost 566d3b442e make mime details grouped 2 years ago
ghost 4486bdc215 show mime type options that match search results only 2 years ago
ghost 307ebcf0b1 add page description on title | description | keywords not empty, remove deprecated constructions 2 years ago
ghost 7c5ba050b2 fix media crawling 2 years ago
ghost 746cc228a9 update page rank query 2 years ago
ghost 0fed16621a fix mime content type update 2 years ago
ghost 34e25a1d94 update readme 2 years ago
ghost db0e66c846 refactor to mime-based content index #1 2 years ago
ghost 272a885039 add line separator 2 years ago
ghost 12c33d8ed6 add line separator 2 years ago
ghost c13842b6c0 remove extra query 2 years ago
ghost e7c5e2ca9d GROUP_CONCAT host image descriptions 2 years ago
ghost 0ffcee1efb fix image description updates timing 2 years ago
ghost 2c5ca1b630 fix image description duplicate 2 years ago
ghost 1c7cca1446 fix UNIQUE index relation 2 years ago
ghost 28bf526d53 add host nsfw settings 2 years ago
ghost 8ce0324e94 convert page data to string 2 years ago
ghost dfca5570c6 remove unused construction 2 years ago
ghost 7dc7c89d9e update readme 2 years ago
ghost d186fff48f skip curl download on response data size reached 2 years ago
ghost d7a5f7ef84 remove content filter, snap raw the data 2 years ago
ghost ef4de6b245 fix image search page errors 2 years ago
ghost 377d4935ad update readme 2 years ago
ghost 23ead4e12c update page / image description models, implement history snap crawling 2 years ago
ghost 77bd25f587 add line separators 2 years ago
ghost 0e9d29675f implement host page description history crawling 2 years ago
ghost 6371def666 fix attributes passing 2 years ago
ghost 32d0f390d3 update http code and mime type on page/image ban event 2 years ago
ghost 84dcecf50b add svg images support, fix mime validation 2 years ago
ghost e9d5137dfe allow svg images mime content type 2 years ago
ghost e6da2e729a fix images ban update 2 years ago
ghost 8fbd7f3516 count totals using sphinx index instead of database 2 years ago
ghost bf1eeb332c fix page/image mime content type detection 2 years ago
ghost 25b6bce2ec add crawler/cleaner logs 2 years ago
ghost dcdc2c50ad update debug string names 2 years ago
ghost ea04220de3 add curl requests debug 2 years ago
ghost 1aba060d34 fix variable name 2 years ago
ghost fdd18de373 remove abstraction 2 years ago
ghost 4801360a51 update api version 2 years ago
ghost 6c41dd5831 fix ban time update / count affected rows only 2 years ago
ghost 20514c455f add banned items counters 2 years ago
ghost b6605b9132 implement not reachable resources ban feature with timeout to prevent extra http requests 2 years ago
ghost cfa5d01db1 update readme 2 years ago
ghost 702a14b634 add mime content type crawling #1 2 years ago