Abstract In this thesis, a computational method for extracting the beat positions from audio signals is presented. A dynamic Bayesian network is used to model beat periods of various lengths and align the predicted beat positions to the best global solution. The proposed beat tracking system is based on temporal convolutional networks which capture the sequential structure of audio input.

