Skip to content

v0.1.0 Lachesis

Choose a tag to compare

@jesus-villalba jesus-villalba released this 15 Aug 23:49
· 162 commits to master since this release
937c95c
  • Make Hyperion pip installable
  • Adds installation instructions
  • Configuration files and command-line arguments are handled using jsonarparse, not argparse anymore. This allows us to use yaml files and overrides the values in the yaml file from the command line.
  • First version using nn.DistributedDataParallel instead of nn.DataParallel
  • Supports FairScale Sharded DataParallel, we didn't observe significant memory gains in our models using this so far
  • Added SpineNet, Spine2Net and TSE-Spine2Net x-vector architectures from our IS21 paper
  • Added SpeAugment PyTorch Layer
  • Added numpy speed augment class
  • Fixed make_voxceleb2cat.pl, all speakers extracted from the same video were getting the same spkid, the script did not take into account that more than one speaker could be extracted from each video
  • New recipes:
    • Recipe for classifying adversarial attacks algorithms and threat models from our IS21 paper (voxceleb/adv.v2)
    • Recipe for adv attacks aginst spk verif renamed as voxceleb/adv.v1 and adv.v1.1, these recipes have been updated and cleaned up
    • Recipe for SRE19-AV Audio part with AHC diarization (sre19-av-a/v2.1)
    • Recipe for Chime5 speaker verification setup chime5/v1
    • Recipes for SRE19-AV Face Recognition using pretrained RetinaFace face detector and ArcFace embeddings from InsightFace MX-Net repository (sre19-av-v/v0.1) and Insightface-Pytorch (sre19-av-v/v0.2)
    • Added VOiCES challenge recipe
    • Adds SRE20-CTS recipe v1
    • Added Spine2Net results in voxceleb/v1.1 recipe