Skip to content

Latest commit

 

History

History
414 lines (309 loc) · 15.6 KB

SETUP.md

File metadata and controls

414 lines (309 loc) · 15.6 KB

Nest Setup

This is a complete and comprehensive guide of how Nest is setup!

Hardware

Nest is running on a Hetzner EX101 dedicated server, located in Helsinki, Finland. It runs with the following specs:

Proxmox

The server runs Proxmox VE 8.2.4 over Debian 12 Bookworm. It has 2 VMs: the Secure VM (ID secure, #100) and the Nest VM (ID nest #101).

Nest VM

The Nest VM is the VM that users will access and host their stuff on. It runs Debian 12 Bookworm. It's configured with all 32 CPU threads, 64 GiB of RAM, and 750GB of storage.

Resource Limits

Every user on Nest has resource limits setup to prevent abuse of Nest and to make sure there's enough resources for everyone. The default limits are:

  • 6.25% (total, technically it's 200% since there's 20 cores) CPU time
  • 2G memory (2G high, 2500M max)
  • 15G disk (hard quota)

CPU time and memory are configured through SystemD & CGroups. These default limits are defined in /etc/systemd/system/user-.slice.d/limit.conf, and can be overridden for users in /etc/systemd/system/user-<id>.slice.d/limit.conf. User overrides override the entire configuration - so if you want to increase the max memory for a user, you must specify the high memory and CPU quota in the file as well.

/etc/systemd/system/user-.slice.d/limit.conf:

[Slice]
MemoryHigh=2G
MemoryMax=2500M
CPUQuota=200%

(see https://www.freedesktop.org/software/systemd/man/latest/systemd.resource-control.html and https://www.kernel.org/doc/Documentation/cgroup-v2.txt for reference)

Note that to apply changes to either the CPU time or memory limits, you must run systemctl daemon-reload. You can verify that a limit is applied by checking the appropriate file in /sys/fs/cgroup/user.slice/user-<id>.slice (ex. memory.max for MemoryMax).

Disk space limits are configured through quota. The following command is run by Quetzal to set the default disk space limit:

setquota -u <user> 15G 15G 0 0 /

Secure VM

The Secure VM is the VM that hosts all critical Nest services:

In addition to some admin-only services:

It runs NixOS 23.05. it's configured with 20 CPU cores and 8 GiB of RAM. Configuration files can be found in the secure-vm directory.

Networking

The Nest server has 3 IPv4 addresses:

  • 37.27.51.33 (Secure VM)
  • 37.27.51.34 (Nest VM)
  • 37.27.51.35 (Proxmox)

as well as an IPv6 subnet, 2a01:4f9:3081:399c::/64. IPv6 addresses are the subnet + the last digit of their IPv4 address, for simplicity. Exceptions are Proxmox (::2) and the Backup VM, which does not have an IPv4 address and only has its ::6 IPv6 address.

DNS is configured through hackclub/dns. A tailnet, coordinated through a Headscale instance on the Secure VM, is used to access Proxmox and the Secure VM. All other inbound traffic is blocked on these hosts, as seen in the firewall configurations below.

The network configuration files are as follows:

Proxmox network config

$ sysctl -w net.ipv4.ip_forward=1

$ sysctl -w net.ipv6.conf.all.forwarding=1

/etc/network/interfaces:

auto lo
iface lo inet loopback

auto enp5s0
iface enp5s0 inet static
  address 37.27.51.35/26
  gateway 37.27.51.1

iface enp5s0 inet6 static
  address 2a01:4f9:3081:399c::2/128
  gateway fe80::1

auto vmbr0
iface vmbr0 inet static
  address 37.27.51.35/32
  bridge_ports none
  bridge_stp off
  bridge_fd 0
  up ip route add 37.27.51.33/32 dev vmbr0
  up ip route add 37.27.51.34/32 dev vmbr0

iface vmbr0 inet6 static
  address 2a01:4f9:3081:399c::2/64
  bridge_ports none
  bridge_stp off
  bridge_fd 0
  up ip -6 route add 2a01:4f9:3081:399c::3/64 dev vmbr0
  up ip -6 route add 2a01:4f9:3081:399c::4/64 dev vmbr0

$ iptables -S:

-P INPUT ACCEPT
-P FORWARD ACCEPT
-P OUTPUT ACCEPT
-N ts-forward
-N ts-input
-N ufw-after-forward
-N ufw-after-input
-N ufw-after-logging-forward
-N ufw-after-logging-input
-N ufw-after-logging-output
-N ufw-after-output
-N ufw-before-forward
-N ufw-before-input
-N ufw-before-logging-forward
-N ufw-before-logging-input
-N ufw-before-logging-output
-N ufw-before-output
-N ufw-reject-forward
-N ufw-reject-input
-N ufw-reject-output
-N ufw-track-forward
-N ufw-track-input
-N ufw-track-output
-A INPUT -j ts-input
-A INPUT -j ufw-before-logging-input
-A INPUT -j ufw-before-input
-A INPUT -j ufw-after-input
-A INPUT -j ufw-after-logging-input
-A INPUT -j ufw-reject-input
-A INPUT -j ufw-track-input
-A INPUT -d 37.27.51.35/32 -i tailscale0 -p tcp -m multiport --dports 22,8006 -j ACCEPT
-A INPUT -d 37.27.51.35/32 -p tcp -m multiport --dports 22,8006 -m conntrack --ctstate NEW -j DROP
-A FORWARD -j ts-forward
-A FORWARD -j ufw-before-logging-forward
-A FORWARD -j ufw-before-forward
-A FORWARD -j ufw-after-forward
-A FORWARD -j ufw-after-logging-forward
-A FORWARD -j ufw-reject-forward
-A FORWARD -j ufw-track-forward
-A OUTPUT -j ufw-before-logging-output
-A OUTPUT -j ufw-before-output
-A OUTPUT -j ufw-after-output
-A OUTPUT -j ufw-after-logging-output
-A OUTPUT -j ufw-reject-output
-A OUTPUT -j ufw-track-output
-A ts-forward -i tailscale0 -j MARK --set-xmark 0x40000/0xff0000
-A ts-forward -m mark --mark 0x40000/0xff0000 -j ACCEPT
-A ts-forward -s 100.64.0.0/10 -o tailscale0 -j DROP
-A ts-forward -o tailscale0 -j ACCEPT
-A ts-input -s 100.64.0.20/32 -i lo -j ACCEPT
-A ts-input -s 100.115.92.0/23 ! -i tailscale0 -j RETURN
-A ts-input -s 100.64.0.0/10 ! -i tailscale0 -j DROP
-A ts-input -i tailscale0 -j ACCEPT
-A ts-input -p udp -m udp --dport 41641 -j ACCEPT

$ ip6tables -S:

-P INPUT ACCEPT
-P FORWARD ACCEPT
-P OUTPUT ACCEPT
-N ts-forward
-N ts-input
-N ufw6-after-forward
-N ufw6-after-input
-N ufw6-after-logging-forward
-N ufw6-after-logging-input
-N ufw6-after-logging-output
-N ufw6-after-output
-N ufw6-before-forward
-N ufw6-before-input
-N ufw6-before-logging-forward
-N ufw6-before-logging-input
-N ufw6-before-logging-output
-N ufw6-before-output
-N ufw6-reject-forward
-N ufw6-reject-input
-N ufw6-reject-output
-N ufw6-track-forward
-N ufw6-track-input
-N ufw6-track-output
-A INPUT -j ts-input
-A INPUT -j ufw6-before-logging-input
-A INPUT -j ufw6-before-input
-A INPUT -j ufw6-after-input
-A INPUT -j ufw6-after-logging-input
-A INPUT -j ufw6-reject-input
-A INPUT -j ufw6-track-input
-A FORWARD -j ts-forward
-A FORWARD -j ufw6-before-logging-forward
-A FORWARD -j ufw6-before-forward
-A FORWARD -j ufw6-after-forward
-A FORWARD -j ufw6-after-logging-forward
-A FORWARD -j ufw6-reject-forward
-A FORWARD -j ufw6-track-forward
-A OUTPUT -j ufw6-before-logging-output
-A OUTPUT -j ufw6-before-output
-A OUTPUT -j ufw6-after-output
-A OUTPUT -j ufw6-after-logging-output
-A OUTPUT -j ufw6-reject-output
-A OUTPUT -j ufw6-track-output
-A ts-forward -i tailscale0 -j MARK --set-xmark 0x40000/0xff0000
-A ts-forward -m mark --mark 0x40000/0xff0000 -j ACCEPT
-A ts-forward -o tailscale0 -j ACCEPT
-A ts-input -s fd7a:115c:a1e0::14/128 -i lo -j ACCEPT
-A ts-input -i tailscale0 -j ACCEPT
-A ts-input -p udp -m udp --dport 41641 -j ACCEPT

Nest VM network config

systemd-networkd is used on the Nest VM instead of the standard networking service.

/etc/systemd/network/ens18.network:

[Match]
Name=ens18

[Network]
DHCP=false
IPv6AcceptRA=false

[Address]
Address=37.27.51.34/32
Peer=37.27.51.35/32

[Route]
Destination=0.0.0.0/0
Gateway=37.27.51.35

[Address]
Address=2a01:4f9:3081:399c::4/64

[Route]
Destination=::/0
Gateway=2a01:4f9:3081:399c::2

# de2.g-load.eu (dn42)
[Route]
Destination=2a01:4f8:c2c:3b65::1/128
Gateway=2a01:4f9:3081:399c::2

# helsinki.dn42.immibis.com aka hcl.immibis.com (dn42)
[Route]
Destination=2a01:4f9:c010:b12b::1/128
Gateway=2a01:4f9:3081:399c::2

# sto.peer.highdef.network (dn42)
[Route]
Destination=2a0e:dc0:2:7fa1::1/128
Gateway=2a01:4f9:3081:399c::2

# de1.dn42.ni.sb (dn42)
[Route]
Destination=2603:c020:8012:a322::cd17/128
Gateway=2a01:4f9:3081:399c::2

# mikrus.catgirls.systems (dn42)
[Route]
Destination=2a01:4f9:4a:5029::103/128
Gateway=2a01:4f9:3081:399c::2

# Secure VM (IPv4)
[Route]
Destination=37.27.51.33/32
Gateway=37.27.51.35

The Nest VM is configured with fail2ban to combat spamming.

Secure VM network config

The network configuration for the Secure VM can be found in the networking.nix file.

Nest Services

With the exception of Gotify, Quetzal, Prometheus/node_exporter, Forgejo, and Uptime Kuma, all services run on the Secure VM in Docker containers, configured within a Docker Compose file contained within the /opt/docker directory on the Secure VM.

Authentik

All authentication and user management for Nest is done through Nest's identity provider, Authentik, which runs in a Docker Compose project on the Secure VM (/opt/docker/authentik). The Docker Compose file is downloaded directory from Authentik. Authentik is configured with 6 providers:

  • Headscale OIDC
  • LDAP
  • MediaWiki OIDC
  • Proxmox OIDC
  • Uptime Kuma
  • Provider for Forgejo

and their respective applications. The LDAP provider is bound to an LDAP outpost (ak-outpost-ldap), managed by Authentik through a local Docker connection.

A Slack OAuth source has been configured to allow logging into Authentik through the Hack Club Slack as well.

Authentik has a few groups configured:

  • authentik Admins (compatibility reasons)
  • admins (main Nest admins)
  • nest-users (allowed to access Nest VM)

LDAP

LDAP is configured on both the Secure VM and Nest VM for user management. For the Secure VM, LDAP configuration is in /etc/nixos/users.nix. For the Nest VM, SSSD is used - its configuration is at /etc/sssd/sssd.conf.

For SSH authentication, the SSH AuthorizedKeysCommand configuration option is used in combination with a short script.

LDAP Details:

  • Hostname: ldap://identity.hackclub.app (plaintext) or ldaps://identity.hackclub.app (StartTLS - you must install the certificate named identity.hackclub.app on the machine for it to work).
  • Base DN: dc=ldap,dc=secure,dc=vm,dc=hackclub,dc=app
  • Users Organizational Unit: ou=users
  • Groups Organizational Unit: ou=groups

DN42

// todo

Forgejo

Forgejo (https://forgejo.org/) is Nest's public Git server, available at https://git.hackclub.app. It's on the Nest VM inside /srv/git/forgejo, running as the git user.

Gotify

Gotify is setup on the Nest VM to handle notifications from services such as Proxmox, in a Docker container in /root/gotify. It is in the process of being moved to the Secure VM. It is available at https://gotify.hackclub.app

Headscale

Headscale is used for administering Nest. All Nest admins can connect to it, and it is the only way through which one can access Proxmox or the Secure VM directly, due to firewall configurations. It is configured in /opt/docker/headscale/compose.yml - contents are included in headscale.yml. Configuration is default, except for OIDC configuration added at the end:

oidc:
  only_start_if_oidc_is_available: false
  issuer: "https://identity.hackclub.app/application/o/headscale/"
  client_id:
  client_secret:
  scope: ["openid", "profile"]

Guides

https://guides.hackclub.app (MediaWiki) is used for all of Nest's documentation, with the exception of this document. It's maintained and written by Nest admins and the community, and contains guides and help for anyone using Nest. MediaWiki has its Docker compose configuration at /opt/docker/guides/compose.yml - contents are in compose.yml.

The Docker compose configuration uses a custom FrankenPHP image to host the PHP application, located in /opt/docker/guides/mediawiki. The Dockerfile contents are in Dockerfile, and the Caddyfile configuration is in Caddyfile.

MediaWiki's configuration file is also documented here at LocalSettings.php.

Quetzal

Quetzal handles Nest account creation and management from Slack for easy access to users. Its code is in the directory quetzal. Since it needs to run commands on the Nest VM, it runs there, under the nest-internal user and inside a tmux session named quetzal. This repo is cloned in /home/nest-internal/nest.

Quetzal's database runs on the Secure VM. The Docker compose configuration for it is in /opt/docker/quetzal/compose.yml - contents are in quetzal.yml.

Prometheus / node_exporter

Prometheus and node_exporter have been setup on the Nest VM (nest-internal user, /home/nest-internal) to monitor the Nest VM's statistics and resources as well as report metrics from Quetzal. All data is collected in Hack Club's Grafana instance.

Uptime Kuma

Uptime Kuma monitors Nest's services and infrastructure, and alerts in Slack (#nest-meta channel) when anything goes down. It's setup on a Docker container on the Nest VM (so that it can monitor Nest VM Docker containers), in /root/uptime.

At the moment, it is setup to monitor 6 services:

  • Authentik
  • Nest Postgres
  • sshd
  • Forgejo
  • bird-lg
  • bird-lgproxy

Vaultwarden

Vaultwarden has been setup on the Secure VM to manage shared passwords, secrets, and failsafes for the Nest Admins. Vaultwarden has its Docker compose configuration at /opt/docker/vaultwarden/compose.yml - contents are in compose.yml. Users are manually managed by Vaultwarden as it does not fully support LDAP/OIDC.

Nest CLI

The Nest CLI is a collection of tools for Nest users. At the moment, its core use is in allowing users to edit the global Caddyfile and add their own domains. It also includes a command for getting a random port that you can bind your services to.

Nest CLI is written in Ruby using Thor (with the help of some bash scripts), and is located in the cli directory. On the Nest VM, this Nest repo (including the Nest CLI) is cloned in the /usr/local/nest directory.

Nest CLI requires the /var/nest-cli directory to be created.

Failsafes

The Nest admins take great care to make sure that all Nest services are as secure as possible, which involves using proper authentication and firewalls when appropriate. However, this relies upon critical services such as Authentik and Headscale, meaning that Nest admins could be prevented from fixing issues if such services go down. For this reason, a series of failsafes has been created to ensure that force reboots are never required.

These failsafes are:

  • Access to root@proxmox and root@secure-vm through SSH keys and cryptograhically random passwords.
  • Shared TOTP token for root@pam on the Proxmox Web UI
  • A multi-use, non-expiring preauth token for Headscale

These failsafes should be stored safely and securely by all Nest admins (now through Vaultwarden), and they are only to be used in case of emergency to ensure proper security and auditing.