You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
mindspore-ci-bot e4451a1a49 !2464 [Dataset] code review & add citation 5 years ago
..
CMakeLists.txt remove graphengine changes 5 years ago
basic_tokenizer_op.cc BasicTokenizer not case fold on preserverd words 5 years ago
basic_tokenizer_op.h BasicTokenizer not case fold on preserverd words 5 years ago
bert_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
bert_tokenizer_op.h !2306 [Dataset] Code review & improve quality 5 years ago
case_fold_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
case_fold_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
jieba_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
jieba_tokenizer_op.h Clean up work for text python package 5 years ago
lookup_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
lookup_op.h phase I of Vocab rework 5 years ago
ngram_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
ngram_op.h Implemented n-gram for dataset TensorOp 5 years ago
normalize_utf8_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
normalize_utf8_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
regex_replace_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
regex_replace_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
regex_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
regex_tokenizer_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
to_number_op.cc remove graphengine changes 5 years ago
to_number_op.h remove graphengine changes 5 years ago
truncate_sequence_pair_op.cc Truncate Pair 5 years ago
truncate_sequence_pair_op.h Truncate Pair 5 years ago
unicode_char_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
unicode_char_tokenizer_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
unicode_script_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
unicode_script_tokenizer_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
whitespace_tokenizer_op.cc Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
whitespace_tokenizer_op.h Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp 5 years ago
wordpiece_tokenizer_op.cc !2464 [Dataset] code review & add citation 5 years ago
wordpiece_tokenizer_op.h change output of WordpieceTokenizer and BertTokenizer to 1-D string tensors 5 years ago