Commit Graph

129 Commits

Author SHA1 Message Date
ghost
68581960a3 add image.data field 2023-05-04 05:19:29 +03:00
ghost
100d12c6ab update curl library constructor 2023-05-04 04:55:26 +03:00
ghost
250e20bbcd remove separator 2023-05-04 04:19:38 +03:00
ghost
6b18202588 implement proxied image search #1 2023-05-04 03:48:57 +03:00
ghost
0741a3e9ef implement image crawler 2023-05-04 01:04:39 +03:00
ghost
6d8f4f4882 create manifests registry 2023-05-03 09:22:14 +03:00
ghost
0bd765064b implement extended search mode support #9 2023-05-01 20:09:28 +03:00
ghost
84fd82f294 fix replacement typo #9 2023-05-01 19:03:14 +03:00
ghost
d40b914983 add new chars quoting #9 2023-05-01 18:58:03 +03:00
ghost
f7807cf43e add extended syntax filter to prevent sphinxql query error #9 2023-05-01 18:39:46 +03:00
ghost
a5f5541395 skip robots:noindex page without extra actions 2023-04-29 08:58:48 +03:00
ghost
11aa404807 add metaYggo field index 2023-04-25 21:10:59 +03:00
ghost
8671fc4bde implement page ranking 2023-04-25 16:54:01 +03:00
ghost
fcee7f62ef fix max_matches error 2023-04-23 09:29:24 +03:00
ghost
9916fb701f implement basic api 2023-04-23 03:01:51 +03:00
ghost
e6b1e8029c add missed regex replacement rule 2023-04-10 03:18:50 +03:00
ghost
5c8d299a4a add meta:robots tag support #2 2023-04-09 03:28:31 +03:00
ghost
8e8d89db0e implement database cleaner 2023-04-09 00:06:28 +03:00
ghost
df6f2a1869 implement CRAWL_ROBOTS_POSTFIX_RULES configuration #5 2023-04-08 22:28:31 +03:00
ghost
2495a2bbc7 implement MySQL/Sphinx data model #3, add basical robots.txt support #2 2023-04-07 04:04:24 +03:00
ghost
c9cd38f6ac update variable names #2 2023-04-04 01:38:32 +03:00
ghost
ed2d4047b4 implement robots.txt library #2 2023-04-04 00:27:32 +03:00
ghost
e7e4bb686c fix curl exec double call 2023-04-03 04:47:31 +03:00
ghost
ff95df72c1 implement hostname identicons 2023-04-03 01:30:09 +03:00
ghost
4ea01bf8b4 implement search results pagination 2023-04-02 23:36:35 +03:00
ghost
04dbbc3adf make url/src column ukeys digital by using crc32 2023-04-02 18:56:56 +03:00
ghost
b218b8bbc3 make url/src columns unique keys, add insert/ignore construction 2023-04-02 18:09:44 +03:00
ghost
d5f33ad643 add ceawl in queue notification 2023-04-02 01:30:50 +03:00
ghost
72985eaf9e initial commit 2023-04-01 19:29:39 +03:00