38 Commits (fd90e2d517a05e3be864b11819b6c4eddb3c7d1c)

Author SHA1 Message Date
ghost fd90e2d517 keep banned pages data 1 year ago
ghost 11e02da66d memory usage optimization, rename methods, remove memchached dependency from the model 1 year ago
ghost 43776b5ff4 fix semaphores 1 year ago
ghost 9c0f361601 refactor snap storage 1 year ago
ghost 547cd6717b prevent scheduled execution on cli/yggo running 1 year ago
ghost 307eb03600 build host/host page URL in SQL query 1 year ago
ghost 6eb45fdad2 fix snap crc32name index 1 year ago
ghost 712d67f6bf implement unlimited snap storage mirrors, delete megaCMD integration 1 year ago
ghost 2c17c93e2f fix broken snaps autodelection 1 year ago
ghost 443eaec64e autodelete failed snaps 1 year ago
ghost 4298203cab make paths absolute 2 years ago
ghost d912caeb0c fix variable name 2 years ago
ghost 5346b13602 implement custom hostPageDom elements index 2 years ago
ghost dc2d971ba0 clean up banned pages extra data 2 years ago
ghost acba2816e2 remove transaction from tables optimization case 2 years ago
ghost b2cf9fc6a5 do table optimization in separated transaction 2 years ago
ghost 5d7f2bf68c fix snap foreign keys deletion 2 years ago
ghost 62a4f33b53 load missed dependency 2 years ago
ghost 45c4f7b7b0 add database optimization settings 2 years ago
ghost 81f7ea1e1e implement multi-storage snap downloads 2 years ago
ghost 1969707eeb integrate optional MEGA/cmd snap storage 2 years ago
ghost 50c9066f62 add tables optimization to the cron/cleaner task 2 years ago
ghost 0d19004e86 make local snap storage optimization 2 years ago
ghost 2f7d99079d implement local snaps 2 years ago
ghost db0e66c846 refactor to mime-based content index #1 2 years ago
ghost d186fff48f skip curl download on response data size reached 2 years ago
ghost 23ead4e12c update page / image description models, implement history snap crawling 2 years ago
ghost 0e9d29675f implement host page description history crawling 2 years ago
ghost 25b6bce2ec add crawler/cleaner logs 2 years ago
ghost dcdc2c50ad update debug string names 2 years ago
ghost ea04220de3 add curl requests debug 2 years ago
ghost b6605b9132 implement not reachable resources ban feature with timeout to prevent extra http requests 2 years ago
ghost f88d2ee9ff implement MIME content-type crawler filter 2 years ago
ghost 5999fb3a73 add distributed hosts crawling using yggo nodes manifest 2 years ago
ghost 79878d17fe add crawler / proxy user agent settings 2 years ago
ghost 0741a3e9ef implement image crawler 2 years ago
ghost eb3e70a7b7 fix robots.txt conditions 2 years ago
ghost 8e8d89db0e implement database cleaner 2 years ago