mirror of https://github.com/YGGverse/YGGo.git
phpyggdrasilcrawlermysqljs-lessspideralt-websphinxopen-sourcedistributedwebsearch-enginecurlparserfts5privacy-orientedsphinxsearchfederativeweb-archivepdo
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
d47081
e5565815df
|
2 years ago | |
---|---|---|
config | 2 years ago | |
crontab | 2 years ago | |
library | 2 years ago | |
public | 2 years ago | |
LICENSE | 2 years ago | |
README.md | 2 years ago |
README.md
YGGo! - Open Source Web Search Engine
Written by inspiration to research Yggdrasil ecosystem, because of single Yacy node was down. Could be using for crawling regular websites, small business resources, local networks.
The goal - simple interface, clear architecture and lightweight server requirements but effective content discovery.
Online examples
An official node, that indexing only the local network
http://94.140.114.241/yggo (web mirror)
Requirements
php 8
php-php
php-pdo
curl-curl
sqlite / fts5
Installation
- The webroot dir is
/public
- Single configuration file placed here
/config/app.php
where important option is DB settings - By default, script automaticaly generate the database file in
/storage
folder but I need to play a bit with environment variables in code for that - Set up the
/crontab/crawler.php
script for execution every the minute, but it mostly related of the configs and target network volume - Script has no MVC model, because of super simple. I have a doubts to make it AJAX, but...
TODO / ideas
- Web pages full text ranking search
- Make search results pagination
- Improve yggdrasil links detection, add .ygg domain sone support
- Images search (basically implemented but requires testing and some performance optimization)
- Distributed index data sharing between the nodes trough service API