115 Commits (2ef994834278675ab5ce798b2ab47dd3568c0277)

Author SHA1 Message Date
ghost 377b519a2c implement host page info mode 2 years ago
ghost 371670fadf add media referrers info 2 years ago
ghost 4486bdc215 show mime type options that match search results only 2 years ago
ghost 307ebcf0b1 add page description on title | description | keywords not empty, remove deprecated constructions 2 years ago
ghost 7c5ba050b2 fix media crawling 2 years ago
ghost 0fed16621a fix mime content type update 2 years ago
ghost db0e66c846 refactor to mime-based content index #1 2 years ago
ghost 0ffcee1efb fix image description updates timing 2 years ago
ghost 2c5ca1b630 fix image description duplicate 2 years ago
ghost 28bf526d53 add host nsfw settings 2 years ago
ghost 8ce0324e94 convert page data to string 2 years ago
ghost dfca5570c6 remove unused construction 2 years ago
ghost d186fff48f skip curl download on response data size reached 2 years ago
ghost ef4de6b245 fix image search page errors 2 years ago
ghost 23ead4e12c update page / image description models, implement history snap crawling 2 years ago
ghost 0e9d29675f implement host page description history crawling 2 years ago
ghost 32d0f390d3 update http code and mime type on page/image ban event 2 years ago
ghost 8fbd7f3516 count totals using sphinx index instead of database 2 years ago
ghost 25b6bce2ec add crawler/cleaner logs 2 years ago
ghost ea04220de3 add curl requests debug 2 years ago
ghost 6c41dd5831 fix ban time update / count affected rows only 2 years ago
ghost b6605b9132 implement not reachable resources ban feature with timeout to prevent extra http requests 2 years ago
ghost 702a14b634 add mime content type crawling #1 2 years ago
ghost f88d2ee9ff implement MIME content-type crawler filter 2 years ago
ghost bed5d3f149 fix offset out of bounds error 2 years ago
ghost 5999fb3a73 add distributed hosts crawling using yggo nodes manifest 2 years ago
ghost f0b2eb1613 show images total instead of pages in placeholder on image search page 2 years ago
ghost 297563d4a5 display related pages in priority to the unique host by rank, rand() order 2 years ago
ghost 34b7291228 add related to image hostpages limit 2 years ago
ghost adc791f378 fix updateTime init 2 years ago
ghost d4f66c83e7 fix image crawling errors 2 years ago
ghost baa8b0d2f0 fix data type formatting 2 years ago
ghost 79878d17fe add crawler / proxy user agent settings 2 years ago
ghost 73f212e3d7 set crawler queue order priority to item rank, rand() 2 years ago
ghost 9ed8411d2f add image queue crawler 2 years ago
ghost d905e33b4f update host images info on search requests 2 years ago
ghost 68581960a3 add image.data field 2 years ago
ghost 100d12c6ab update curl library constructor 2 years ago
ghost 250e20bbcd remove separator 2 years ago
ghost 6b18202588 implement proxied image search #1 2 years ago
ghost 0741a3e9ef implement image crawler 2 years ago
ghost 6d8f4f4882 create manifests registry 2 years ago
ghost 0bd765064b implement extended search mode support #9 2 years ago
ghost 84fd82f294 fix replacement typo #9 2 years ago
ghost d40b914983 add new chars quoting #9 2 years ago
ghost f7807cf43e add extended syntax filter to prevent sphinxql query error #9 2 years ago
ghost a5f5541395 skip robots:noindex page without extra actions 2 years ago
ghost 11aa404807 add metaYggo field index 2 years ago
ghost 8671fc4bde implement page ranking 2 years ago
ghost fcee7f62ef fix max_matches error 2 years ago