Merlin

docs
egs
1. build_your_own_voice
  1. 构建一种新语言合成模型的一般流程
2. fls_blizz ard2017
  1. 以Blizzard数据集构建一个语音合成模型，作为benchmark
3. hybrid_synthesis
  1. 混合语言合成模型，目前还在开发之中，有很多bug
4. mandarin_voice
  1. conf
    1. logging_config.conf
  2. scripts
    1. prepare_config_files.sh
      1. 准备当前流程的时域配置参数文件
      2. 准备当前流程的声学配置参数文件
    2. prepare_config_files_for_synthesis.sh
      1. 准备当前流程的语音合成时时域配置参数文件
      2. 准备当前流程的语音合成时时域配置参数文件
    3. prepare_labels_from_txt.sh
      1. 将文本转换为lab文件
      2. 1. 从txt文本使用前端工具生成scheme文件
      3. 2. 从scheme文件生成 utt文件
      4. 3. 将 festival utt转换为lab文件
      5. 4. state_align或phone_align，规范化lab
    4. remove_intermediate_files.sh
      1. 移除中间产生的文件
    5. run_phone_aligner.sh
      1. 执行音素对齐(phone align)
    6. run_state_aligner.sh
    7. submit.sh
    8. test_nan.sh
  3. 01_setup.sh
  4. 02_prepare_labels.sh
  5. 03_prepare_acoustic_features.sh
  6. 04_prepare_conf_files.sh
  7. 05_train_duration_model.sh
  8. 06_train_acoustic_model.sh
  9. 07_run_merlin.sh
  10. run_demo.sh
  11. run_mandarin_voice.sh
5. nick_hurricane
  1. Nick语料库的语音合成方法，但是数据没有公开
6. slt_arctic
  1. CMU指导的均衡语音库ARCTIC合成的语音流程
7. speaker_adaptation
  1. 发音人适应模型训练流程，
    1. 1. 构建一个多发音人的平均语音模型AVM
    2. 2. 为需要适应的发音人构建一个单独的模型
    3. 3. 将平均模型AVM适应到适应者
8. voice_conversion
  1. 将源发声着的发音特色转移到目标发音者，使用CMU ARCTIC 数据集(包含了7个发音人，每个人都念相同的文本)
misc
1. questions 问题集
  1. questions-mandarin.hed
    1. 普通话标准问题集
  2. questions-radio_dnn_416.hed
    1. 普通话问题集，包含416条问题
  3. questions-unilex_dnn_600.hed
    1. 普通话问题集，包含600条问题
2. recipes
  1. glstm
  2. dnn
  3. general_config
  4. lstm
  5. lstm_variants
  6. MGE
  7. acoustic_demo.conf
  8. duration_demo.conf
3. scripts
  1. alignment
  2. frontend
  3. hybrid_voice
  4. vocoder
  5. voice_conversion
src
test
tools
目录结构都与mandarin_voice一致，流程化不同的语音合成步骤