Google Introduces New Architecture To Reduce Cost Of Transformers

Over the last few years, transformers like BERT, XLNet, RoBERTa, GPT-3, etc., have extensively been used in many natural language processing (NLP) advances.

