Synchronous config refresh by pedromfcarvalho · Pull Request #31 · rancher/channelserver

pedromfcarvalho · 2025-11-21T22:03:35Z

Rancher uses channelserver as a library to fetch KDM. It would be useful for Rancher if it could know when a refresh completed.

Currently, channelserver doesn't expose any way to know this, only to trigger refreshes (through Wait).

This PR proposes a public function to allow synchronous refreshes.

Potentially needed for rancher/rancher#53204

pkg/config/config.go

brandond · 2026-01-06T20:57:42Z

pkg/config/config.go

+	select {
+	case c.loadQueue <- struct{}{}:
+		defer func() {
+			<-c.loadQueue
+		}()
+	case <-ctx.Done():
+		return ctx.Err()
+	}


Are you attempting to reinvent sync.Mutex with an optional channel read write to ensure that there are no concurrent loads? This is kind of confusing, I would probably just replace this channel select write/deferred read with TryLock() / defer Unlock()

Yes, the idea is to ensure no concurrent loads to preserve the behavior with urls[index], but sync.Mutex doesn't take context.Context into consideration, so if there's another load in progress and the context is canceled, using a mutex would delay the caller returing with a canceled context until the previous load completes.

Sure but the context on both sides is a long-running controller context, not a client request context that is likely to be cancelled on a timeout. I'd say just try to get the lock, and if it can't be taken, return an error and let the caller retry.

Also note that if rancher doesn't pass in a Wait and handles 100% of reloading, there shouldn't ever be multiple overlapping calls to this function in the first place.

Actually we do call Refresh through a norman action with a client request context when the user wants to refresh outside the usual schedule.

But I've still changed to TryLock since it's still unlikely that there will be a collision, even with on-demand refreshes. We might also change how this is handled anyways by just having the norman action queue the object that triggers the refresh, to avoid some other problems.

brandond · 2026-01-06T21:02:04Z

Requested a couple changes. The whole urls[index] thing creates a lot of extra noise in here, and it honestly seems a little broken since the list is mutated every time the config is loaded - so even if you pass in more than one URL, all the URLs after the first successful one are dropped and never used. And I don't think there are actually any cases where callers supply more than a single URL anyway.

pedromfcarvalho · 2026-01-06T21:16:29Z

The urls[index] part is confusing, but Rancher does use it: it passes both the remote url and the fallback "url" which is a path in the local filesystem.

I suspect this was done so that if you manage to get the data from the remote server once, you wouldn't want to go back to the fallback since it could be out of date. It's still a bit broken because when the Rancher pod fails, this will be reset in the new pod and the local fallback could still end up being used.

pkg/config/config.go

pedromfcarvalho · 2026-01-12T19:37:26Z

pkg/config/config.go

-	if index, err := c.loadConfig(ctx, subKey, channelServerVersion, appName, urls...); err != nil {
-		logrus.Fatalf("Failed to load initial config from %s: %v", urls[index].URL(), err)
+	if err := c.LoadConfig(ctx); err != nil {
+		logrus.Fatalf("Failed to load initial config for %s: %v", subKey, err)


I've removed the index from the return since urls is now altered inside LoadConfig. But we won't print the url here, hopefully that's not a big deal. If that was useful, I'll go back to returning the index for logging purposes.

pedromfcarvalho · 2026-01-12T19:42:38Z

I also changed all previous fmt.Errorf to use %w instead of %v.

pkg/config/config.go

brandond · 2026-01-12T20:53:59Z

cc @jiaqiluo @kinarashah for additional review

Copilot

Pull request overview

This PR introduces a public synchronous LoadConfig method to allow Rancher to know when configuration refreshes complete. Previously, the only way to trigger refreshes was through the Wait interface, which provided no feedback on completion.

Changes:

Converted private loadConfig to public LoadConfig method with synchronous semantics
Added configuration parameters as struct fields (subKey, channelServerVersion, appName, urls) to support the new public API
Introduced loadMutex to prevent concurrent configuration loads
Made wait parameter optional (nil check added) to support usage without automatic periodic refreshes

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pkg/config/config.go

Copilot · 2026-01-12T21:25:52Z

pkg/config/config.go

+	locked := c.loadMutex.TryLock()
+	if !locked {
+		return errors.New("configuration is already being loaded")
+	}


Using TryLock() and returning an error when a load is already in progress creates a race condition vulnerability. If a caller wants to ensure they have the latest configuration, they may call LoadConfig() but receive an error even though a load is in progress. The caller has no way to wait for the in-progress load to complete and may proceed with stale configuration.

Consider either:

Using Lock() instead of TryLock() to block until the load completes, ensuring callers always get up-to-date config

Providing a separate method that indicates if a load is in progress, allowing callers to handle this case appropriately

Suggested change

locked := c.loadMutex.TryLock()

if !locked {

return errors.New("configuration is already being loaded")

}

c.loadMutex.Lock()

This was discussed above: callers should retry or abort if they get an error here. Lock() would not handle context timeouts.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pkg/config/config.go

jiaqiluo

LGTM

Sync load

73ca723

pedromfcarvalho force-pushed the load-sync branch from 6a06c91 to 73ca723 Compare November 21, 2025 22:12

brandond requested changes Jan 6, 2026

View reviewed changes

Address review comments

543b4dc

pedromfcarvalho changed the title ~~[DNM] Synchronous config refresh~~ Synchronous config refresh Jan 12, 2026

pedromfcarvalho marked this pull request as ready for review January 12, 2026 16:04

pedromfcarvalho requested a review from brandond January 12, 2026 16:17

brandond requested changes Jan 12, 2026

View reviewed changes

pkg/config/config.go Outdated Show resolved Hide resolved

pkg/config/config.go Outdated Show resolved Hide resolved

review comments

238648e

pedromfcarvalho commented Jan 12, 2026

View reviewed changes

pedromfcarvalho requested a review from brandond January 12, 2026 19:37

wrap errors

d5f123c

brandond approved these changes Jan 12, 2026

View reviewed changes

brandond reviewed Jan 12, 2026

View reviewed changes

pkg/config/config.go Outdated Show resolved Hide resolved

error handling

5b2fd84

pedromfcarvalho requested a review from brandond January 12, 2026 20:26

brandond approved these changes Jan 12, 2026

View reviewed changes

brandond requested review from jiaqiluo and kinarashah January 12, 2026 20:53

jiaqiluo requested a review from Copilot January 12, 2026 21:21

Copilot started reviewing on behalf of jiaqiluo January 12, 2026 21:22 View session

Copilot AI reviewed Jan 12, 2026

View reviewed changes

pedromfcarvalho added 2 commits January 13, 2026 11:16

rename mutex

e2e32d8

remove else

7c1e9bc

jiaqiluo requested a review from Copilot January 13, 2026 21:17

Copilot started reviewing on behalf of jiaqiluo January 13, 2026 21:17 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

pkg/config/config.go Show resolved Hide resolved

Comments

f8bcaa0

kinarashah reviewed Jan 13, 2026

View reviewed changes

pkg/config/config.go Show resolved Hide resolved

jiaqiluo approved these changes Jan 14, 2026

View reviewed changes

kinarashah approved these changes Jan 14, 2026

View reviewed changes

pedromfcarvalho requested a review from brandond January 14, 2026 17:55

brandond approved these changes Jan 14, 2026

View reviewed changes

pedromfcarvalho merged commit b93c8ff into rancher:master Jan 14, 2026
1 check passed

Conversation

pedromfcarvalho commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brandond Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

pedromfcarvalho Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

brandond Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

pedromfcarvalho Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brandond commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pedromfcarvalho commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pedromfcarvalho Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

pedromfcarvalho commented Jan 12, 2026

Uh oh!

Uh oh!

brandond commented Jan 12, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

pedromfcarvalho Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

jiaqiluo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pedromfcarvalho commented Nov 21, 2025 •

edited

Loading

pedromfcarvalho Jan 12, 2026 •

edited

Loading

brandond commented Jan 6, 2026 •

edited

Loading

pedromfcarvalho commented Jan 6, 2026 •

edited

Loading