Micro Web Crawler in PHP & Manticore
Go to file
2023-11-19 23:00:51 +02:00
src initial commit 2023-11-19 23:00:51 +02:00
.gitignore initial commit 2023-11-19 23:00:51 +02:00
composer.json initial commit 2023-11-19 23:00:51 +02:00
LICENSE Initial commit 2023-11-19 20:07:17 +02:00
README.md initial commit 2023-11-19 23:00:51 +02:00

Yo!

Micro Web Crawler in PHP & Manticore

CLI

Index

Init

Create initial index

php src/cli/index/init.php [reset]
  • reset - optional, reset existing index

Document

Add

php src/cli/document/add.php URL
  • URL - add new URL to the crawl queue

Crawl

php src/cli/document/crawl.php
php src/cli/document/search.php '@title "*"' [limit]
  • query - required
  • limit - optional search results limit