CLI Reference#

autrainer provides a command line interface (CLI) to manage the entire training process, including configuration management, data preprocessing, model training, inference, and postprocessing.

In addition to the CLI, autrainer provides an CLI wrapper to manage configurations, data, training, inference, and postprocessing programmatically with the same functionality as the CLI.

autrainer#

usage: autrainer [-h] [-v] <command> ...

A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.

usage: autrainer [-h] [-v] <command> ...

Positional Arguments#

<command>: Possible choices: create, list, show, fetch, preprocess, train, inference, postprocess, rm-failed, rm-states, group

Named Arguments#

-v, --version: show program’s version number and exit

Configuration Management#

To manage configurations, autrainer create, autrainer list, and autrainer show allow for the creation of the project structure and the discovery as well as saving of default configurations provided by autrainer.

Tip

Default configurations can be discovered both through the CLI, the CLI wrapper, and the respective module documentation.

autrainer create#

usage: autrainer create [-h] [-e] [-a] [-f] [directories ...]

Create a new project with default configurations.

usage: autrainer create [-h] [-e] [-a] [-f] [directories ...]

Positional Arguments#

directories

Configuration directories to create. One or more of:

augmentation
dataset
model
optimizer
plotting
preprocessing
scheduler

Named Arguments#

-e, --empty: Create an empty project without any configuration directory.
-a, --all: Create a project with all configuration directories.
-f, --force: Force overwrite if the configuration directory already exists.

autrainer list#

usage: autrainer list [-h] [-l] [-g] [-p P] directory

List local and global configurations.

usage: autrainer list [-h] [-l] [-g] [-p P] directory

Positional Arguments#

directory

The directory to list configurations from. Choose from:

augmentation
dataset
model
optimizer
plotting
preprocessing
scheduler

Named Arguments#

-l, --local-only: List local configurations only.
-g, --global-only: List global configurations only.
-p, --pattern: Glob pattern to filter configurations.

autrainer show#

usage: autrainer show [-h] [-s] [-f] directory config

Show and save a global configuration.

usage: autrainer show [-h] [-s] [-f] directory config

Positional Arguments#

directory

The directory to list configurations from. Choose from:

augmentation
dataset
model
optimizer
plotting
preprocessing
scheduler

config

The global configuration to show. Configurations can be discovered using the ‘autrainer list’ command.

Named Arguments#

-s, --save: Save the global configuration to the local conf/ directory.
-f, --force: Force overwrite local configuration if it exists in combination with -s/–save.

Preprocessing#

To avoid race conditions when using Launcher Plugins that may run multiple training jobs in parallel, autrainer fetch and autrainer preprocess allow for downloading and preprocessing of Datasets (and pretrained model states or transforms) before training.

Both commands are based on the main configuration file (e.g., conf/config.yaml), such that the specified models and datasets are fetched and preprocessed accordingly. If a model or dataset is already fetched or preprocessed, it will be skipped.

autrainer fetch#

usage: autrainer fetch [-h] [-b]

Fetch the datasets, models, and transforms specified in a training configuration (Hydra). For more information on Hydra’s command line line flags, see: https://hydra.cc/docs/advanced/hydra-command-line-flags/.

usage: autrainer fetch [-h] [-l]

Named Arguments#

-l, --cfg-launcher: Use the launcher specified in the configuration instead of the Hydra basic launcher. Defaults to False.

autrainer preprocess#

usage: autrainer preprocess [-h] [-b] [-n N] [-p P] [-s]

Launch a data preprocessing configuration (Hydra). For more information on Hydra’s command line line flags, see: https://hydra.cc/docs/advanced/hydra-command-line-flags/.

usage: autrainer preprocess [-h] [-l] [-n N] [-u F]

Named Arguments#

-l, --cfg-launcher: Use the launcher specified in the configuration instead of the Hydra basic launcher. Defaults to False.
-n, --num-workers: Number of workers (subprocesses) to use for preprocessing. Defaults to 0.
-u, --update-frequency: Frequency of progress bar updates for each worker (subprocess). If 0, the progress bar will be disabled. Defaults to 1.

Training#

Training is managed by autrainer train, which starts the training process based on the main configuration file (e.g., conf/config.yaml).

autrainer train#

usage: autrainer train [-h]

Launch a training configuration (Hydra). For more information on Hydra’s command line line flags, see: https://hydra.cc/docs/advanced/hydra-command-line-flags/.

usage: autrainer train [-h]

Inference#

autrainer inference allows for the (sliding window) inference of audio data using a trained model.

Both local paths and Hugging Face Hub links are supported for the model. Hugging Face Hub links are automatically downloaded and cached in the torch cache directory.

The following syntax is supported for Hugging Face Hub links: hf:repo_id[@revision][:subdir]#local_dir. This syntax consists of the following components:

hf: The Hugging Face Hub prefix indicating that the model is fetched from the Hugging Face Hub.
repo_id: The repository ID of the model consisting of the user name and the model card name separated by a slash (e.g., autrainer/example).
revision (optional): The revision as a commit hash, branch name, or tag name (e.g., main). If not specified, the latest revision is used.
subdir (optional): The subdirectory of the repository containing the model directory (e.g., AudioModel). If not specified, the model directory is automatically inferred. If multiple models are present in the repo_id, subdir must be specified, as the correct model cannot be automatically inferred.
local_dir (optional): The local directory to which the model is downloaded to (e.g., .hf_local). If not specified, the model is placed in the torch hub cache directory.

For example, to download the model from the repository autrainer/example at the revision main from the subdirectory AudioModel and save it to the local directory .hf_local, the following autrainer inference CLI command can be used:

autrainer inference hf:autrainer/example@main:AudioModel#.hf_local input/ output/ -d cuda:0

Tip

To access private repositories, the environment variable HF_HOME should point to the Hugging Face User Access Token.

To use a custom endpoint (e.g., for a self-hosted hub), the environment variable HF_ENDPOINT should point to the desired endpoint URL.

To use a local model path, the following autrainer inference CLI command can be used:

autrainer inference /path/to/AudioModel input/ output/ -d cuda:0

autrainer inference#

usage: autrainer inference [-h] [-c C] [-d D] [-e E] [-r] [-emb] [-p P] [-w W] [-s S] [-m M] [-sr SR] model input output

Perform inference on a trained model.

usage: autrainer inference [-h] [-c C] [-d D] [-e E] [-r] [-emb] [-u F] [-p P]
                           [-w W] [-s S] [-m M] [-sr SR] [-t]
                           model input output

Positional Arguments#

model: Local path to model directory or Hugging Face link of the format: hf:repo_id[@revision][:subdir]#local_dir. Should contain at least one state subdirectory, the model.yaml, file_handler.yaml, target_transform.yaml, and inference_transform.yaml files.
input: Path to input directory. Should contain audio files of the specified extension.
output: Path to output directory. Output includes a YAML file with predictions and a CSV file with model outputs.

Named Arguments#

-c, --checkpoint: Checkpoint to use for evaluation. Defaults to ‘_best’ (on validation set).
-d, --device: CUDA-enabled device to use for processing. Defaults to ‘cpu’.
-e, --extension: Type of file to look for in the input directory. Defaults to ‘wav’.
-r, --recursive: Recursively search for files in the input directory. Defaults to False.
-emb, --embeddings: Extract embeddings from the model in addition to predictions.For each file, a .pt file with embeddings will be saved.Defaults to False.
-u, --update-frequency: Frequency of progress bar updates. If 0, the progress bar will be disabled. Defaults to 1.
-p, --preprocess-cfg: Preprocessing configuration to apply to input. Can be a path to a YAML file or the name of the preprocessing configuration in the local or autrainer ‘conf/preprocessing’ directory. If ‘default’, the default preprocessing configuration used during training will be applied. If ‘None’, no preprocessing will be applied. Defaults to ‘default’.
-w, --window-length: Window length for sliding window inference in seconds. If None, the entire input will be processed at once. Defaults to None.
-s, --stride-length: Stride length for sliding window inference in seconds. If None, the entire input will be processed at once. Defaults to None.
-m, --min-length: Minimum length of audio file to process in seconds. Files shorter than the minimum length are padded with zeros. Sample rate has to be specified for padding. If None, no minimum length is enforced. Defaults to None.
-sr, --sample-rate: Sample rate of audio files in Hz. Has to be specified for sliding window inference. Defaults to None.
-t, --trust-remote: Whether to trust and execute custom Python code from Hugging Face models. Warning: This can be dangerous as it may run arbitrary code on your machine! Defaults to False.

Postprocessing#

Postprocessing allows for the summarization, visualization, and aggregation of the training results using autrainer postprocess. Several cleanup utilities are provided by autrainer rm-failed and autrainer rm-states. Manual grouping of the training results can be done using autrainer group.

autrainer postprocess#

usage: autrainer postprocess [-h] [-m N] [-a A [A ...]] results_dir experiment_id

Postprocess grid search results.

usage: autrainer postprocess [-h] [-m N] [-a A [A ...]]
                             results_dir experiment_id

Positional Arguments#

results_dir: Path to grid search results directory.
experiment_id: ID of experiment to postprocess.

Named Arguments#

-m, --max-runs

Maximum number of best runs to plot.

-a, --aggregate

Configurations to aggregate. One or more of:

augmentation
batch_size
dataset
iterations
learning_rate
model
optimizer
scheduler
seed

autrainer rm-failed#

usage: autrainer rm-failed [-h] [-f] results_dir experiment_id

Delete failed runs from an experiment.

usage: autrainer rm-failed [-h] [-f] results_dir experiment_id

Positional Arguments#

results_dir: Path to grid search results directory.
experiment_id: ID of experiment to postprocess.

Named Arguments#

-f, --force: Force deletion of failed runs without confirmation. Defaults to False.

autrainer rm-states#

usage: autrainer rm-states [-h] [-b] [-r R [R ...]] [-i I [I ...]] results_dir experiment_id

Delete states (.pt files) from an experiment.

usage: autrainer rm-states [-h] [-b] [-r R [R ...]] [-i I [I ...]]
                           results_dir experiment_id

Positional Arguments#

results_dir: Path to grid search results directory.
experiment_id: ID of experiment to postprocess.

Named Arguments#

-b, --keep-best: Keep best states. Defaults to True.
-r, --keep-runs: Runs to keep.
-i, --keep-iterations: Iterations to keep.

autrainer group#

usage: autrainer group [-h]

Launch a manual grouping of multiple grid search results (Hydra). For more information on Hydra’s command line line flags, see: https://hydra.cc/docs/advanced/hydra-command-line-flags/.

usage: autrainer group [-h]

Table of Contents

CLI Reference#

autrainer#

Positional Arguments#

Named Arguments#

Configuration Management#

autrainer create#

Positional Arguments#

Named Arguments#

autrainer list#

Positional Arguments#

Named Arguments#

autrainer show#

Positional Arguments#

Named Arguments#

Preprocessing#

autrainer fetch#

Named Arguments#

autrainer preprocess#

Named Arguments#

Training#

autrainer train#

Inference#

autrainer inference#

Positional Arguments#

Named Arguments#

Postprocessing#

autrainer postprocess#

Positional Arguments#

Named Arguments#

autrainer rm-failed#

Positional Arguments#

Named Arguments#

autrainer rm-states#

Positional Arguments#

Named Arguments#

autrainer group#