Tutorials#

aucurriculum is designed to be flexible and extensible, allowing for the creation of custom …

scoring functions
pacing functions

For each, a tutorial is provided below to demonstrate their implementation and configuration.

For the following tutorials, all Python files should be placed in the project root directory and all configuration files should be placed in the corresponding subdirectories of the conf/ directory.

Custom Scoring Functions#

To create a custom scoring function, inherit from AbstractScore and implement the run() method.

For example, the following model-based scoring function determines the difficulty of each sample by computing the probability (assuming higher probabilities indicate easier samples) of the most likely class (regardless of the true class):

probability_score.py#

import os

from autrainer.core.utils import Timer, set_device
from omegaconf import DictConfig
import torch
from torch.utils.data import DataLoader

from aucurriculum.curricula.scoring import AbstractScore


class ProbabilityScore(AbstractScore):
    def __init__(
        self,
        output_directory: str,
        results_dir: str,
        experiment_id: str,
        run_name: str,
        stop: str = "best",
        subset: str = "train",
    ) -> None:
        """Probability scoring function determining the difficulty of a sample
        based on the model's highest output probability (regardless of the true
        class).

        Args:
            output_directory: Directory where the scores will be stored.
            results_dir: The directory where the results are stored.
            experiment_id: The ID of the grid search experiment.
            run_name: Name or list of names of the runs to score. Runs can be
                single runs or aggregated runs.
            stop: Model state dict to load or to stop at in ["best", "last"].
                Defaults to "best".
            subset: Dataset subset to use for scoring in ["train", "dev",
                "test"]. Defaults to "train".
        """
        super().__init__(
            output_directory=output_directory,
            results_dir=results_dir,
            experiment_id=experiment_id,
            run_name=run_name,
            stop=stop,
            subset=subset,
            reverse_score=True,  # assume higher probabilities are easier
        )

    def run(
        self, config: DictConfig, run_config: DictConfig, run_name: str
    ) -> None:
        run_name, full_run_name = self.split_run_name(run_name)
        run_path = os.path.join(self.output_directory, full_run_name)
        data, model = self.prepare_data_and_model(run_config)
        dataset = self.get_dataset_subset(data, self.subset)
        batch_size = config.get("batch_size", run_config.get("batch_size", 32))
        loader = DataLoader(dataset, batch_size=batch_size)
        self.load_model_checkpoint(model, run_name)
        device = set_device(config.device)
        forward_timer = Timer(run_path, "model_forward")
        probabilities, labels = self.forward_pass(
            model=model,
            loader=loader,
            batch_size=batch_size,
            output_map_fn=self.score,
            tqdm_desc=run_name,
            disable_progress_bar=not config.get("progress_bar", False),
            device=device,
            timer=forward_timer,
        )
        forward_timer.save()
        df = self.create_dataframe(probabilities, labels, data)
        self.save_scores(df, run_path)

    def score(self, outputs: torch.Tensor, y: torch.Tensor) -> torch.Tensor:
        """Compute the highest probability per sample regardless of the true
        class.

        Args:
            outputs: Batch of model outputs.
            y: Batch of labels.

        Returns:
            Batch of highest probability per sample.
        """
        return torch.softmax(outputs, dim=1).max(dim=1).values

Next, create a ProbabilityScore.yaml configuration file for the scoring function in the conf/curriculum/scoring/ directory:

conf/curriculum/scoring/ProbabilityScore.yaml#

id: ProbabilityScore
type: ProbabilityScore
_target_: probability_score.ProbabilityScore
stop: best # "best" or "last"
subset: train # train, dev, test

run_name: ??? # has to be defined based on a finished run

The id should match the name of the configuration file. The _target_ should point to the custom scoring function class via a Python import path (here assuming that the probability_score.py file is in the root directory of the project).

The run_name should be a run name or list of run names from which to load the models for scoring.

Custom Pacing Functions#

To create a custom pacing function, inherit from AbstractPace and implement the get_dataset_size() method.

For example, the following pacing function determines the dataset size at each iteration based on the convergence of the model, adding a new discrete bucket of samples when the tracking metric does not improve for a specified number of iterations:

discrete_convergence.py#

from typing import TYPE_CHECKING

from aucurriculum.curricula.pacing import AbstractPace


if TYPE_CHECKING:
    from autrainer.training import ModularTaskTrainer


class DiscreteConvergence(AbstractPace):
    def __init__(
        self,
        initial_size: float,
        final_iteration: float,
        total_iterations: int,
        dataset_size: int,
        patience: int = 1,
        min_improvement: float = 0.0,
        buckets: int = 10,
    ) -> None:
        super().__init__(
            initial_size, final_iteration, total_iterations, dataset_size
        )
        """Discrete convergence pacing function adding a new bucket of training
        data every time the validation performance of the tracking metric does
        not improve by at least `min_improvement` for `patience` iterations.

        Args:
            initial_size: The initial fraction of the dataset to start training
                with.
            final_iteration: The fraction of training iterations at which the
                dataset size will be the full dataset size. If not all buckets
                are introduced by this iteration, the remaining buckets will be
                added immediately.
            total_iterations: The total number of training iterations.
            dataset_size: The size of the dataset.
            patience: The number of iterations to wait before adding a new
                bucket of training data. Defaults to 1.
            min_improvement: The minimum improvement in the tracking metric to
                consider as an improvement. Defaults to 0.0.
            buckets: The number of buckets to divide the remaining dataset size
                into. Defaults to 10.
        """
        if patience < 1:
            raise ValueError(f"patience {patience} must be a positive integer")
        self.patience = patience
        if min_improvement < 0:
            raise ValueError(
                f"min_improvement {min_improvement} must be a positive float"
            )
        self.min_improvement = min_improvement
        if buckets < 1:
            raise ValueError(f"buckets {buckets} must be a positive integer")
        self.bucket_size = int((1 - initial_size) * dataset_size / buckets)
        self.current_size = int(initial_size * dataset_size)
        self.current_wait = 0

    def get_dataset_size(self, iteration: int) -> int:
        if self.total_iterations * self.final_iteration <= iteration:
            return self.dataset_size
        return self.current_size

    def convergence_criterion(self, metric: float) -> None:
        if self.metric_fn.compare(
            metric, self.current_best + self.min_improvement
        ):
            self.current_best = metric
            self.current_wait = 0
            return

        self.current_wait += 1
        if self.current_wait >= self.patience:
            size = min(self.current_size + self.bucket_size, self.dataset_size)
            self.current_size = size
            self.current_wait = 0

    def cb_on_train_begin(self, trainer: "ModularTaskTrainer") -> None:
        self.metric_fn = trainer.data.tracking_metric
        self.current_best = self.metric_fn.starting_metric
        if self.metric_fn.suffix == "min":
            self.min_improvement = -self.min_improvement

    def cb_on_val_end(
        self, trainer: "ModularTaskTrainer", iteration: int, val_results: dict
    ) -> None:
        self.convergence_criterion(val_results[self.metric_fn.name])

Next, create a DiscreteConvergence.yaml configuration file for the pacing function in the conf/curriculum/pacing/ directory:

conf/curriculum/pacing/DiscreteConvergence.yaml#

id: DiscreteConvergence
_target_: discrete_convergence.DiscreteConvergence
initial_size: ???
final_iteration: ???

patience: 5
min_improvement: 0.1
buckets: 10

The id should match the name of the configuration file. The _target_ should point to the custom pacing function class via a Python import path (here assuming that the discrete_convergence.py file is in the root directory of the project). The patience controls the number of iterations to wait for improvement before adding a new bucket of samples. The min_improvement specifies the minimum improvement required to consider the model as having converged. The buckets determines the number of discrete buckets the dataset is divided into.

Both the initial_size and final_iteration serve as placeholders (indicated by ???) and are automatically passed to the pacing function configuration in the main configuration file (e.g. conf/config.yaml).

Table of Contents

Tutorials#

Custom Scoring Functions#

Custom Pacing Functions#