138 Commits (1b287c8d281ac963f1321cc7bd8c64272c9b4778)

Author SHA1 Message Date
ghost 297563d4a5 display related pages in priority to the unique host by rank, rand() order 2 years ago
ghost 34b7291228 add related to image hostpages limit 2 years ago
ghost adc791f378 fix updateTime init 2 years ago
ghost d4f66c83e7 fix image crawling errors 2 years ago
ghost baa8b0d2f0 fix data type formatting 2 years ago
ghost 79878d17fe add crawler / proxy user agent settings 2 years ago
ghost 73f212e3d7 set crawler queue order priority to item rank, rand() 2 years ago
ghost 9ed8411d2f add image queue crawler 2 years ago
ghost d905e33b4f update host images info on search requests 2 years ago
ghost 68581960a3 add image.data field 2 years ago
ghost 100d12c6ab update curl library constructor 2 years ago
ghost 250e20bbcd remove separator 2 years ago
ghost 6b18202588 implement proxied image search #1 2 years ago
ghost 0741a3e9ef implement image crawler 2 years ago
ghost 6d8f4f4882 create manifests registry 2 years ago
ghost 0bd765064b implement extended search mode support #9 2 years ago
ghost 84fd82f294 fix replacement typo #9 2 years ago
ghost d40b914983 add new chars quoting #9 2 years ago
ghost f7807cf43e add extended syntax filter to prevent sphinxql query error #9 2 years ago
ghost a5f5541395 skip robots:noindex page without extra actions 2 years ago
ghost 11aa404807 add metaYggo field index 2 years ago
ghost 8671fc4bde implement page ranking 2 years ago
ghost fcee7f62ef fix max_matches error 2 years ago
ghost 9916fb701f implement basic api 2 years ago
ghost e6b1e8029c add missed regex replacement rule 2 years ago
ghost 5c8d299a4a add meta:robots tag support #2 2 years ago
ghost 8e8d89db0e implement database cleaner 2 years ago
ghost df6f2a1869 implement CRAWL_ROBOTS_POSTFIX_RULES configuration #5 2 years ago
ghost 2495a2bbc7 implement MySQL/Sphinx data model #3, add basical robots.txt support #2 2 years ago
ghost c9cd38f6ac update variable names #2 2 years ago
ghost ed2d4047b4 implement robots.txt library #2 2 years ago
ghost e7e4bb686c fix curl exec double call 2 years ago
ghost ff95df72c1 implement hostname identicons 2 years ago
ghost 4ea01bf8b4 implement search results pagination 2 years ago
ghost 04dbbc3adf make url/src column ukeys digital by using crc32 2 years ago
ghost b218b8bbc3 make url/src columns unique keys, add insert/ignore construction 2 years ago
ghost d5f33ad643 add ceawl in queue notification 2 years ago
ghost 72985eaf9e initial commit 2 years ago