render-config: delete and re-create config when output file already exists #771

wmTJc9IK0Q · 2025-10-06T02:26:06Z

CLAassistant · 2025-10-06T02:26:12Z

All committers have signed the CLA.

Copilot

Pull Request Overview

This PR implements a mechanism to skip the render-config operation when the output file already exists, addressing issue #770. The change adds an early exit condition to prevent unnecessary processing when the target output file is already present.

Adds file existence check before processing render-config
Exits early with status 0 and informational message when output file exists

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

matrix-tools/internal/cmd/render-config/cmd.go

benbz

Thanks for this. My preference would be to delete the file if it exists and have it be re-rendered so that we know it is up-to-date (e.g. any external secrets / appservice registrations could change at any time). My understanding is that you should have permissions to do this as the permissions will come from the parent directory rather than the file itself (as this process should be running with the user id the owner). Does that sound like a way forward that would work for you?

wmTJc9IK0Q · 2025-10-06T15:03:22Z

I considered that but I think there is an edge case there that could cause a config to be applied that maybe shouldn't be:

helm upgrade starts, configmap and secrets are updated
something fails during the upgrade, before the k8s.element.io/synapse-config-hash and k8s.element.io/synapse-secret-hash labels on the statefulset are updated which would cause a rollout of a new pod.
if the synapse pod is restarted in this scenario, it would get an updated config which could be wrong vs. continuing to work with the previous config

I think it's slightly safer to assume immutability of the config and only have the pod change it's config with the hash labels update.

benbz · 2025-10-07T08:30:05Z

I considered that but I think there is an edge case there that could cause a config to be applied that maybe shouldn't be:
* helm upgrade starts, configmap and secrets are updated

* something fails during the upgrade, before the `k8s.element.io/synapse-config-hash` and `k8s.element.io/synapse-secret-hash` labels on the statefulset are updated which would cause a rollout of a new pod.

* if the synapse pod is restarted in this scenario, it would get an updated config which could be wrong vs. continuing to work with the previous config
I think it's slightly safer to assume immutability of the config and only have the pod change it's config with the hash labels update.

Because of the check config hook the (chart provided) Synapse config won't be updated until that job has passed and the config is known valid. I think the likelihood of the the helm upgrade being interrupted between applying the updated ConfigMaps/Secrets and applying the updated StatefulSet is too small to worry about. For other components, yes we don't have check config hooks to validate their config but an admin should notice a failing helm upgrade command.

There is another case were the Pod could be restarted and get different config from when helm install/helm upgrade was last run: external Secrets change then Pod gets evicted from a Node. In that scenario the only thing I can see us being able to do to prevent that is that the Pod is given permissions to create/update Secrets in cluster to save off the config it started with as part of the render-config init-container. I dislike that as I don't think the applications should have permissions like that in general.

Given that we're left with at least one situation where the Pods could restart with different config to what they were originally started with, I would lean towards us being consistent and saying that config is always regenerated on startup regardless of whether it is a Pod being freshly (re)created or restarting.

wmTJc9IK0Q · 2025-10-07T17:44:04Z

@benbz I have updated the PR to delete. I am testing this on my own cluster now.

github-actions · 2025-10-08T07:41:48Z

dyff of changes in rendered templates of CI manifests

No changes in rendered templates

gaelgatelement · 2025-10-08T07:54:17Z

Thanks for the PR! This will be part of next release through https://github.com/element-hq/ess-helm/pull/782/files

wmTJc9IK0Q requested a review from a team as a code owner October 6, 2025 02:26

Copilot AI review requested due to automatic review settings October 6, 2025 02:26

Copilot AI reviewed Oct 6, 2025

View reviewed changes

matrix-tools/internal/cmd/render-config/cmd.go Outdated Show resolved Hide resolved

wmTJc9IK0Q mentioned this pull request Oct 6, 2025

Synapse pods fail on restart due to matrix-tools render-config permissions issue #770

Closed

Skip render-config when output exists

7e35be9

wmTJc9IK0Q force-pushed the restartfix branch from 5c09d3b to 7e35be9 Compare October 6, 2025 03:24

benbz reviewed Oct 6, 2025

View reviewed changes

Delete file instead of exiting

48ba4ee

wmTJc9IK0Q changed the title ~~Skip render-config when output exists~~ render-config: delete and re-create config when output file already exists Oct 7, 2025

Fix comment

944fb21

gaelgatelement approved these changes Oct 8, 2025

View reviewed changes

gaelgatelement merged commit c7170f7 into element-hq:main Oct 8, 2025
69 of 70 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

render-config: delete and re-create config when output file already exists #771

render-config: delete and re-create config when output file already exists #771

wmTJc9IK0Q commented Oct 6, 2025

Uh oh!

CLAassistant commented Oct 6, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

benbz left a comment

Uh oh!

wmTJc9IK0Q commented Oct 6, 2025 •

edited

Loading

Uh oh!

benbz commented Oct 7, 2025

Uh oh!

wmTJc9IK0Q commented Oct 7, 2025

Uh oh!

github-actions bot commented Oct 8, 2025

Uh oh!

Uh oh!

gaelgatelement commented Oct 8, 2025

Uh oh!

Uh oh!

render-config: delete and re-create config when output file already exists #771

render-config: delete and re-create config when output file already exists #771

Conversation

wmTJc9IK0Q commented Oct 6, 2025

Uh oh!

CLAassistant commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

benbz left a comment

Choose a reason for hiding this comment

Uh oh!

wmTJc9IK0Q commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benbz commented Oct 7, 2025

Uh oh!

wmTJc9IK0Q commented Oct 7, 2025

Uh oh!

github-actions bot commented Oct 8, 2025

dyff of changes in rendered templates of CI manifests

Uh oh!

Uh oh!

gaelgatelement commented Oct 8, 2025

Uh oh!

Uh oh!

CLAassistant commented Oct 6, 2025 •

edited

Loading

wmTJc9IK0Q commented Oct 6, 2025 •

edited

Loading