nvflare.app_common.workflows.scaffold module

class Scaffold(*args, min_clients: int = 1000, num_rounds: int = 5, start_round: int = 0, persist_every_n_rounds: int = 1, **kwargs)[source]

Bases: BaseFedAvg

Controller for Scaffold Workflow. Note: This class is based on WFController. Implements [SCAFFOLD](https://proceedings.mlr.press/v119/karimireddy20a.html).

Provides the implementations for the run routine, controlling the main workflow:
  • def run(self)

The parent classes provide the default implementations for other routines.

Parameters:
  • min_clients (int, optional) – The minimum number of clients responses before Workflow starts to wait for wait_time_after_min_received. Note that the workflow will move forward when all available clients have responded regardless of this value. Defaults to 1000.

  • num_rounds (int, optional) – The total number of training rounds. Defaults to 5.

  • persistor_id (str, optional) – ID of the persistor component. Defaults to “persistor”.

  • ignore_result_error (bool, optional) – whether this controller can proceed if client result has errors. Defaults to False.

  • allow_empty_global_weights (bool, optional) – whether to allow empty global weights. Some pipelines can have empty global weights at first round, such that clients start training from scratch without any global info. Defaults to False.

  • task_check_period (float, optional) – interval for checking status of tasks. Defaults to 0.5.

  • persist_every_n_rounds (int, optional) – persist the global model every n rounds. Defaults to 1. If n is 0 then no persist.

The base controller for FedAvg Workflow. Note: This class is based on the WFController.

Implements [FederatedAveraging](https://arxiv.org/abs/1602.05629).

A model persistor can be configured via the persistor_id argument of the WFController. The model persistor is used to load the initial global model which is sent to a list of clients. Each client sends it’s updated weights after local training which is aggregated. Next, the global model is updated. The model_persistor will also save the model after training.

Provides the default implementations for the follow routines:
  • def sample_clients(self, min_clients)

  • def aggregate(self, results: List[FLModel], aggregate_fn=None) -> FLModel

  • def update_model(self, aggr_result)

The run routine needs to be implemented by the derived class:

  • def run(self)

Parameters:
  • min_clients (int, optional) – The minimum number of clients responses before Workflow starts to wait for wait_time_after_min_received. Note that the workflow will move forward when all available clients have responded regardless of this value. Defaults to 1000.

  • num_rounds (int, optional) – The total number of training rounds. Defaults to 5.

  • start_round (int, optional) – The starting round number.

  • persist_every_n_rounds (int, optional) – persist the global model every n rounds. Defaults to 1. If n is 0 then no persist.

initialize(fl_ctx)[source]

Called by the framework to initialize the Learner object. This is called before the Learner can train or validate. This is called only once.

run() None[source]

Main run routine for the controller workflow.

scaffold_aggregate_fn(results: List[FLModel]) FLModel[source]