[PAD] [UNK] [CLS] [SEP] this is a small bert model vocab file and only twenty line for the whole text ##a