Stas Bekman    @StasBekman
Additionally, DeepSpeed released a super-fast CUDA-kernel-based DeepSpeed-Inference for BERT, GPT-2, and GPT-Neo https://t.co/tvybS0xMX5 and https://t.co/qgd2QUB7Yl







 







Stas Bekman

Tobias Pfaff

DeepGraphLibrary

FOX College Football

Runa Capital

Jane Wang