Go to file
Askill 61dd1190b1 test 2024-07-17 19:54:31 +02:00
.github/workflows added ci 2024-07-17 19:50:34 +02:00
cache added Dockerfile and tested with updated packages 2024-07-13 15:11:29 +02:00
src refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
tests refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
.gitignore added k8s deployment 2024-07-16 18:10:13 +02:00
Dockerfile added k8s deployment 2024-07-16 18:10:13 +02:00
__init__.py refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
deployment.yaml deploy latest 2024-07-16 19:53:37 +02:00
dev.py refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
keywords.txt wip 2022-10-15 15:38:58 +02:00
license.txt license 2022-11-27 16:07:17 +01:00
main.py refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
prod.py refactored to utilize dependency injection to make code more testable, added some tests 2024-07-17 19:45:06 +02:00
readme.md test 2024-07-17 19:54:31 +02:00
requirements.txt added ci 2024-07-17 19:49:53 +02:00
sites.txt added Dockerfile and tested with updated packages 2024-07-13 15:11:29 +02:00

readme.md

Optar Pytest

This tool crawles all pages on a given website to the provided deapth and finds new pages by comparing the new site tree to the cached one. All new pages are then checked for containing any of the provided keywords. If there is a match the page will be higlighted for the reader. Default timeout 1h, list of keywords and sites can be changed while the software is running.

Only retrieves static content, client side rendered content crawling is not implemented.