RAdam Warmup

class pytorch_warmup.radam.RAdamWarmup(optimizer, last_step=-1)[source]

RAdam warmup schedule.

This warmup scheme is described in On the adequacy of untuned warmup for adaptive optimization.

Parameters:
  • optimizer (Optimizer) – an Adam optimizer

  • last_step (int) – The index of last step. (Default: -1)