Notebook Fourteen | Repository

Transformers

Andrea Leone
University of Trento
January 2022


BERT



RoBERTa



DistilBERT



SqueezeBERT



Nyströmformer [experiment]




Results


Transformers score board

model         accuracy    precision   recall      es

BERT          .34837799   .11612599   .3           5
BERT          .87447108   .87162724   .87438515   10
BERT          .92806770   .92677159   .92359018   15
BERT          .93229901   .93357080   .92739973   20

RoBERTa       .34837799   .11612599   .3           5
RoBERTa       .68928067    —           —          10
RoBERTa       .80253878   .80112364   .79699835   15
RoBERTa       .85190409   .85917919   .83747284   20

DistilBERT    .92242595   .92319025   .91704217    5
DistilBERT    .94781382   .94554739   .94635791   10
DistilBERT    .92806770   .92449340   .92972718   15
DistilBERT    .92383638   .92122829   .92547669   20

SqueezeBERT   .90409026   .90895179   .89725193    5
SqueezeBERT   .93229901   .93061917   .93270826   10
SqueezeBERT   .95345557   .95080893   .95318207   15
SqueezeBERT   .94499294   .94611277   .94207616   20