Code errors in swad.py

There are some implementation errors in the update_and_evaluate function: https://github.com/khanrc/swad/blob/252190eb881169e736beccf51af5caaa5c5cf666/domainbed/swad.py#L83-L123

When `min_idx == 0` is satisfied, the final_model is initialized by converge_Q[0]. For the `self.n_tolerance > self.n_converge` branch, `Q = list(self.smooth_Q)[: converge_idx + 1]` ensures that Q[-1] == converge_Q[0], thus the following updating for-loop `for model in Q[start_idx + 1 :]:` leads to two errors:
(1) converge_Q[0] is updated to final_model twice
(2) the left-most weight in Q whose end_loss <= self.threshold is omitted due to the starting index of `start_idx + 1`

A refined version is below:
```python
converge_idx = self.n_tolerance - self.n_converge
Q = list(self.smooth_Q)[: converge_idx]  # excludes converge_Q[0]，Q only contains those segments before converge_Q[0]
start_idx = 0
for i in reversed(range(len(Q))):
    if Q[i].end_loss > self.threshold:
        start_idx = i + 1
        break
for model in Q[start_idx:]:
    self.final_model.update_parameters(
        model, start_step=model.start_step, end_step=model.end_step
    )
```

	def update_and_evaluate(self, segment_swa, val_acc, val_loss, prt_fn):
	if self.dead_valley:
	return

	frozen = copy.deepcopy(segment_swa.cpu())
	frozen.end_loss = val_loss
	self.converge_Q.append(frozen)
	self.smooth_Q.append(frozen)

	if not self.is_converged:
	if len(self.converge_Q) < self.n_converge:
	return

	min_idx = np.argmin([model.end_loss for model in self.converge_Q])
	untilmin_segment_swa = self.converge_Q[min_idx] # until-min segment swa.
	if min_idx == 0:
	self.converge_step = self.converge_Q[0].end_step
	self.final_model = swa_utils.AveragedModel(untilmin_segment_swa)

	th_base = np.mean([model.end_loss for model in self.converge_Q])
	self.threshold = th_base * (1.0 + self.tolerance_ratio)

	if self.n_tolerance < self.n_converge:
	for i in range(self.n_converge - self.n_tolerance):
	model = self.converge_Q[1 + i]
	self.final_model.update_parameters(
	model, start_step=model.start_step, end_step=model.end_step
	)
	elif self.n_tolerance > self.n_converge:
	converge_idx = self.n_tolerance - self.n_converge
	Q = list(self.smooth_Q)[: converge_idx + 1]
	start_idx = 0
	for i in reversed(range(len(Q))):
	model = Q[i]
	if model.end_loss > self.threshold:
	start_idx = i + 1
	break
	for model in Q[start_idx + 1 :]:
	self.final_model.update_parameters(
	model, start_step=model.start_step, end_step=model.end_step
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code errors in swad.py #28

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Code errors in swad.py #28

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions