Hydra

IMPORTANT!!!! This file needs to be updated to reflect recent changes. Information below, in the "how to run this" secions, is partially inaccurate.

Distributed Pi Calculation

This part of the project demonstrates a distributed approach to calculating $\pi$ (pi) using dynamic chunking. It involves three main components:

Scheduler (Rust) – Coordinates jobs, assigns work chunks, and tracks completion.
Worker (Rust) – Performs Monte Carlo simulations for a given chunk of points.
WebApp (Python Flask) – Provides a front-end for users to create Pi calculation jobs and view progress/results in real time.

Distributed Mandelbrot Generation

In addition to calculating pi, Hydra can also generate the Mandelbrot set using the same distributed approach. Here’s how it works:

Image Partitioning:
The Mandelbrot image is divided into horizontal rows. Each row is treated as a single work chunk. This ensures that every row of the final image is computed and no rows are skipped.
Mapping Pixels to the Complex Plane:
For each pixel in a row, the Worker maps its column and row coordinates to a corresponding complex number $( c )$. Typically, the real part is scaled from $(-2.0)$ to $(1.0)$ and the imaginary part from $(-1.5)$ to $(1.5)$.
Iterative Computation:
Each Worker runs an iterative algorithm for each pixel using the formula:
$z_{n+1} = z_n^2 + c$ starting from $( z_0 = 0 )$. The iteration continues until either the magnitude of $( z )$ exceeds 2 (indicating divergence) or a preset maximum number of iterations (e.g., 300) is reached.
Coloring Based on Iterations:
- Inside the Set: If the point does not diverge within the maximum iterations, it is considered to be in the Mandelbrot set and is colored black.
- Outside the Set: If the point diverges, a color is calculated (using a hue-based method) based on the number of iterations it took to diverge. This produces the characteristic gradient seen in Mandelbrot images.
Aggregation and Rendering:
Once a Worker computes a row of pixels, it sends the pixel data (including pixel indices and corresponding colors) back to the Scheduler. The Scheduler aggregates all rows, and the WebApp periodically fetches these updates to redraw the complete image on an HTML canvas.

Overview

General Flow

The user visits the WebApp, inputs a large number of random points (e.g., 1 billion).
The WebApp instructs the Scheduler to create a new job.
Workers (which can be on the same machine or multiple machines) query the Scheduler for an available job, receive a chunk of random points to compute, run the Monte Carlo step, and submit partial results back to the Scheduler.
The Scheduler aggregates partial results (points inside the circle vs. total points) to estimate $\pi$.
The WebApp periodically polls the Scheduler to update the progress and final result in a browser chart.

Monte Carlo Pi Calculation

Each Worker randomly generates points $(x, y)$ in the unit square $[0,1) \times [0,1)$.
A point lies inside the unit circle if $x^2 + y^2 \leq 1$.
The fraction of points inside vs. total points, multiplied by 4, approximates $\pi$. (Because the area of a unit circle is $\pi$, and the square’s area is 1, so the ratio $\pi / 4$ is expected if points are uniformly distributed.)

Dynamic Chunking

Instead of splitting the total points in a static way, the Scheduler assigns multiple, smaller “chunks” of points to each Worker.
Over time, the Scheduler measures each Worker’s throughput (points computed per second).
When a Worker requests more work, the Scheduler sizes the chunk based on that Worker’s observed performance, aiming for ~2 seconds of compute.
This ensures faster Workers get larger chunks, slower Workers get smaller chunks, maximizing overall throughput and preventing idle time.

Key Components

Scheduler

Language: Rust
Location: scheduler/ directory
Responsibilities:
1. Maintains a list of jobs. Each job tracks number of points, partial result, percentage complete, and status.
2. Receives chunk submissions (how many points in the circle vs. total).
3. Tracks each Worker’s measured speed (avg_points_per_sec) so it can adapt chunk sizes.
4. Cleans up inactive or stalled jobs.
5. Provides a REST API for the WebApp and Workers.
Endpoints (subset):
- POST /api/create_job: Creates a new job (given number of points, etc.).
- GET /api/assign_chunk/:job_id?worker_id=...: Returns a custom chunk for the worker, based on that worker’s performance.
- POST /api/submit_chunk: Worker submits chunk results.
- GET /api/job_status/:job_id: WebApp queries job progress.

Worker

Language: Rust
Location: worker/ directory
Responsibilities:
1. Registers itself with the Scheduler (/api/register_worker).
2. Repeatedly checks for an available job.
3. Calls assign_chunk?worker_id=... to get a chunk sized for its performance.
4. Performs the Monte Carlo simulation for that chunk, then submit_chunk with results.
5. Loops indefinitely, requesting new chunks until the job is finished or no jobs remain.

WebApp

Language: Python (Flask)
Location: webapp/ directory
Responsibilities:
1. Renders an HTML page (index.html) with a user form to create a new Pi calculation job.
2. Sends a request to the Scheduler to create a job, then starts a background polling process to track progress.
3. Displays progress (percent complete) and partial π approximations on a Chart.js graph in real time.
4. Can kill a previously running job if the user starts a new one.

Running Locally

Follow these steps to run the Scheduler, Worker, and WebApp on the same machine:

Prerequisites
- Rust (Cargo) installed (for building Scheduler & Worker).
- Python 3 and pip (for the WebApp).
- Local TLS certificate and key for the Scheduler (certs/scheduler_cert.pem, certs/scheduler_key.pem). Copy these to "certs" directories in the scheduler/worker directories.
- Configure or trust the self-signed certificate so the Worker can connect to https://127.0.0.1:8443.
Build & Run the Scheduler
```
cd scheduler
cargo build --release
cargo run
```
The Scheduler listens on port 8443 by default.
Build & Run the Worker
```
cd worker
cargo build --release
cargo run
```
The Worker registers itself, then polls the Scheduler for available jobs.

Run the WebApp

cd webapp
# (Optional) python -m venv && source venv/bin/activate
# pip install -r requirements.txt (if used)
python app.py

The Flask server starts on port 5000 by default

Open the Web Interface
- Navigate to https://127.0.0.1:5000
- Enter the desired number of random points and hit Calculate
- Watch the progress bar and chart update as workers submit chunks

Notes

Chunk Overhead: Chunk assignments have communication overhead, so chunk sizing is important. I am still working on this.
Scaling: You can run multiple Workers on different machines, or on the same machine, all talking to the same Scheduler.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
scheduler		scheduler
webapp		webapp
worker		worker
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hydra

Distributed Pi Calculation

Distributed Mandelbrot Generation

Overview

General Flow

Monte Carlo Pi Calculation

Dynamic Chunking

Key Components

Scheduler

Worker

WebApp

Running Locally

Notes

About

Languages

License

conjfrnk/Hydra

Folders and files

Latest commit

History

Repository files navigation

Hydra

Distributed Pi Calculation

Distributed Mandelbrot Generation

Overview

General Flow

Monte Carlo Pi Calculation

Dynamic Chunking

Key Components

Scheduler

Worker

WebApp

Running Locally

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Languages