Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EPIC]: Testground's Infrastructure as Code #147

Closed
17 of 30 tasks
Bidon15 opened this issue Jan 23, 2023 · 6 comments
Closed
17 of 30 tasks

[EPIC]: Testground's Infrastructure as Code #147

Bidon15 opened this issue Jan 23, 2023 · 6 comments
Assignees
Labels
enhancement New feature or request infrastructure Everything related to K8s topic testground related to testground

Comments

@Bidon15
Copy link
Member

Bidon15 commented Jan 23, 2023

Background

ATM, testground infrastructure is written in bash scripts. This solves the initial task of using eks to execute test runs.
Nevertheless, this is not enough to reliably use testground for:

  • 1. Infra module upgrades
  • 2. Metrics collection / aggregation
  • 3. Logs collection / aggregation
  • 4. Metrics / Logs storage
  • 5. CI/CD for big network tests

Hence, we need to redesign the infrastructure to a layers, which will let us be more flexible on setups

  • Layer 1 - Basic Infrastructure in EKS(AWS)

Goal: Setup basics for the layers above

  • 1. Control Plane + node groups + security groups + OIDC
  • 2. EBS, EFS, Autoscaling
  • S3 -> to check
  • 3. ArgoCD

TerraForm is how we are going to implement all 3 points

Every item in points above are treated as TerraForm modules

Layer 2 - Testground Applications(Mandatory)

Goal: make it easy aka helm chart

  1. Master helm chart for testground, which contains:
    1. helm charts for each of components that we have rn in functions.sh
    2. ArgoCD will take care of deployments

Required Components (caveats → we need to finalise which ones are DS/STS/RS):


Goal: full visibility for the dev teams to analyse test runs

Components -> We need to finalise which ones below are DS/STS/RS:

@Bidon15 Bidon15 added enhancement New feature or request testground related to testground infrastructure Everything related to K8s topic labels Jan 23, 2023
@Bidon15 Bidon15 pinned this issue Jan 23, 2023
@Bidon15
Copy link
Member Author

Bidon15 commented Jan 23, 2023

cc: @sysrex @jrmanes

@smuu
Copy link
Member

smuu commented Feb 3, 2023

Why do we need to develop the Metrics Scraper in-house? Can't we use Prometheus for scraping?

@Bidon15
Copy link
Member Author

Bidon15 commented Feb 20, 2023

Cosmos support: #179

@Bidon15
Copy link
Member Author

Bidon15 commented Mar 10, 2023

Grooming 10/03/2023: Moved to Q2
Prob half a day to solve the PoC with 1 avail zone

@Bidon15
Copy link
Member Author

Bidon15 commented May 12, 2023

Grooming 12/05/2023:

Putting it to the board. It will be closed by celestiaorg/devops#123

@Bidon15
Copy link
Member Author

Bidon15 commented May 24, 2023

Closing as done celestiaorg/devops#123

@Bidon15 Bidon15 closed this as completed May 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request infrastructure Everything related to K8s topic testground related to testground
Projects
Archived in project
Development

No branches or pull requests

4 participants