YGGo! Distributed Web Search Engine
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
d47081 d356a9f92f
Update README.md
2 years ago
config initial commit 2 years ago
crontab initial commit 2 years ago
library initial commit 2 years ago
public fix meow 2 years ago
LICENSE Initial commit 2 years ago
README.md Update README.md 2 years ago

README.md

YGGo! - Open Source Web Search Engine

Written by inspiration to research Yggdrasil ecosystem, because of single Yacy node was down. Could be using for crawling regular websites, small business resources, local networks.

The goal - simple interface, clear architecture and lightweight server requirements but effective content discovery.

Online examples

An official node, that indexing only the local network
http://94.140.114.241/yggo (web mirror)

Requirements

php 8
php-php
php-pdo
curl-curl
sqlite / fts5

Installation

  • The webroot dir is /public
  • Single configuration file placed here /config/app.php.txt and need to be configured and renamed to /config/app.php
  • By the idea, script automaticaly generates database structure in /storage folder (where could be nice to collect other variative and tmp data - like logs, etc)
  • Set up the /crontab/crawler.php script for execution every the minute, but it mostly related of the configs and targetal network volume, there is no debug implemented yet, so let's silentize it by /dev/null
  • Script has no MVC model, because of super simple. It's is just 2 files, and everything else stored incapsulated in /library classes.

TODO / ideas

  • Web pages full text ranking search
  • Make search results pagination
  • Improve yggdrasil links detection, add .ygg domain zone support
  • Make page description visible - based on the cached content dump, when website description tag not available, add condition highlights
  • Images search (basically implemented but requires testing and some performance optimization)
  • Distributed index data sharing between the nodes trough service API
  • An idea to make unique gravatars for sites without favicons, because simpler to ident, comparing to ipv6
  • An idea to make some visitors counters, like in good old times?

Feedback

Please, feel free to share your ideas and bug reports here or use sources for your own implementations.

Have a good time.