Jesse Engel    @jesseengel
SOTA Piano transcription with a generic spectrogram2midi transformer. Audio classification often encodes a lot of domain specific priors (e.g. separate conv stacks for separate labels), but we found a domain agnostic architecture can do just as well.







 







Jesse Engel

TensorFlow

Sayak Paul

DeepMind

Aleksa Gordić

Sergey Levine