Mini Google
Course project for the Architecture of Computer Systems course.
Overview:
Architecture:
We are working on multiple components of the web crawler at the same time:
- Website backend
- Elasticsearch database backend
- Two crawlers (one in Python, and one in Rust)
- Language detection backend in Rust and Python.
Each component is intended to run as a separate Docker container, for us to be able to freely mix them in different amounts and on different computers/servers.
Progress can be tracked over here.
Usage:
Launch each container independently with instructions in respective directories, or launch all of them together:
# Rust takes some time to download and compile all of the dependencies since it
# produces static binaries, so be aware! We are figuring out a way to fix this.
docker-compose build
docker-compose up