Music theory
- melody
- harmony
- rhythm
Dataset
- Lakh MIDI Dataset 包含 17 万多个 MIDI 文件。
https://colinraffel.com/projects/lmd/#get
- MAESTRO Dataset 古典钢琴音乐数据集。
https://magenta.tensorflow.org/datasets/maestro
- FMA (Free Music Archive) 包含各种流派的音频文件。
https://freemusicarchive.org/search/?quicksearch=&search-genre=Classical
- MusicNet 古典音乐数据集,带有乐器、音高等标注。
- Million Song Dataset 大规模音乐元数据集。
- RWC (Real World Computing) Music Database 包含流行音乐、古典音乐等多种类型。
- NSynth Dataset 由 Google 发布的单音音色数据集。
- MUSDB18 多轨音乐分离数据集。
- Nottingham Database 民谣曲调数据集。
- Bach Chorales Dataset 巴赫合唱曲数据集。
Tranditional Music Generation
- Rule-based
- Markov Chain
- Hidden Markov Model
Neural Network Music Generation
- LSTM
- GRU
- Transformer
- VAE
- GAN
- RL
Music assistant tools
Related Work
Appendix
Mixed precision training
stochastic_rounding is a technique that can be used to improve the accuracy of the model when using mixed precision training. It is a rounding technique that rounds to the nearest even integer, which can help reduce the bias introduced by rounding errors.
It is used in Adamw optimizer and encoder in the transformer model.
Parameters update in the optimizer is done in FP32, while the model weights are stored in FP16. This can lead to a loss of precision in the model weights, which can affect the accuracy of the model. stochastic_rounding can help reduce this loss of precision by rounding the model weights to the nearest even integer.