Cathy Wong
e0aaf847aa
dataset API docstring: Update/add text examples
5 years ago
Cathy Wong
e971a3aed9
dataset API docstring: Update datasets.py and config.py
5 years ago
Cathy Wong
f7adf648e9
dataset API docstring: Update datasets, samplers, graphdata and text
5 years ago
nhussain
92e99ff224
change map calls
5 years ago
mindspore-ci-bot
c45f79d36b
!5384 [MD]-Api changes
Merge pull request !5384 from nhussain/api_changes
5 years ago
Zirui Wu
bde9f18f5a
update lookup api to take in a type
ci
add test case
address some of the review cmts
address review cmts
5 years ago
nhussain
3bac9d3713
switch input columns and operation
change ImagefolderDV2 name
change ds.transforms.vision to ds.vision
change batch api to match map api more closely
compose op changes
test_pylint
remove compose op from vision, move to transform module, refactor map and batch to use column_order
5 years ago
Cathy Wong
4d4c11b133
dataset API docstring cleanup: Standard product terms NumPy, Python
5 years ago
Lixia Chen
27d5281641
Change epoch count to 1 for python testcases
5 years ago
Cathy Wong
7f6782be2a
dataset fixes: Update OneHot API docs; fixup Python UTs
5 years ago
nhussain
eb9a611041
remove old defaults
5 years ago
ms_yan
501f549bc9
modify comment for api
5 years ago
shenwei41
b2c7f597d1
Modify format
5 years ago
guansongsong
0bf6ae913a
fix python api doc for mindspore.dataset
5 years ago
ms_yan
7888f78bb8
repair api format problem in totype, totensor, slice
5 years ago
mindspore-ci-bot
21a5f06e93
!3328 Add testcases
Merge pull request !3328 from shenwei41/sw_master
5 years ago
shenwei41
4eaa396ca7
Add testcases
5 years ago
Zirui Wu
40ea84ae81
fix from_file validator minor error, 1 line change
add test case
5 years ago
xulei2020
18b519ae0f
add sentence piece
5 years ago
mindspore-ci-bot
a84092e89d
!3102 [MD] Add additional parameter checks in RandomCropWithBBoxOp and RandomResizeWithBbox
Merge pull request !3102 from nhussain/random_crop_fixes
5 years ago
nhussain
5871ba8a05
more validator fixes
5 years ago
YangLuo
4136892a3e
add SlidingWindow Op
5 years ago
Zirui Wu
63185cb20f
fix some validators errors
address review cmts
addr review cmts
5 years ago
nhussain
05b2a57d2a
fix validation errors, and fix try catch error tests
5 years ago
Jesse Lee
7c1bc5192b
Initial Drop of CacheOp Phase I
6 years ago
mindspore-ci-bot
6284c42a76
!2941 MD tokenizer support output offsets
Merge pull request !2941 from xiefangqi/md_add_offsets_feature
5 years ago
xiefangqi
47060631e5
add offsets feature to tokenizer
5 years ago
Zirui Wu
7b15e5a742
rework on lookup
add test caser
fix ci
address review cmts
ci
addr review cmt
fix typo
address review cmts
add 2 more test cases
cpplint fix
addr cpplint
addr ci
fix tst case err
fix doc str
5 years ago
nhussain
6c37ea3be0
fix validators
fixed random_apply tests
fix validators
fixed random_apply tests
fix engine validation
5 years ago
qianlong
94581f1c43
del JiebaMode and NormalizeForm from python api doc
5 years ago
YangLuo
36d1613f9a
!2464 [Dataset] code review & add citation
5 years ago
qianlong
d9f4549d13
add comment for dataset.text
5 years ago
mindspore-ci-bot
1ea38eb60c
!2375 Add Python Tokenizer
Merge pull request !2375 from h.farahat/python_tokenizer
5 years ago
mindspore-ci-bot
886dfe6fd7
!2419 Rectification and modification of dataset api documentation comments
Merge pull request !2419 from qianlong21st/temp
5 years ago
qianlong
cb01a99b08
fix dataset.text api doc
5 years ago
qianlong
980ddd32a2
change output of WordpieceTokenizer and BertTokenizer to 1-D string tensors
5 years ago
hesham
e981c67acd
Python Tokenizer
!38 Synchronize with latest Ascend software suite 17 Jun 2020
Merge pull request !38 from yanghaoran/master
5 years ago
hesham
674415f7be
Cleanup work for Concate, Mask, Slice, PadEnd and TruncatePair
5 years ago
peilinwang
1e36b0649f
remove graphengine changes
remove graphengine changes
concat op
Truncate Pair
concat_op
remove graph engine changes
ToNumberOp implementation almost done
ToNumberOp complete
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
merge conflicts
5 years ago
hesham
11826fb256
DuplicateOp
5 years ago
Zirui Wu
b6e9504b31
phase I of Vocab rework
phase II vocab rework
added more test cases
fix api doc string
address review cmts and fix CI
address ci complains
fix review cmts
ci
5 years ago
Zirui Wu
8f2674850b
address API doc style and content
addr review cmts
revert
ci
private class
ci
fix ci
address ci
address review cmts
fix style err
5 years ago
hesham
b9495a9ccc
Truncate Pair
5 years ago
qianlong
4f16f036be
Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
add CaseFold, NormalizeUTF8
add RegexReplace
add RegexTokenizer
add BasicTokenizer
add WordpieceTokenizer
add BertTokenizer
6 years ago
Zirui Wu
880ce5ea26
implemented from_dataset
fix complie error
more tests
address CI complains
fix ci
adress review comments
address review cmts
5 years ago
mindspore-ci-bot
c82a8bf483
!1678 modify print
Merge pull request !1678 from jinyaohui/print
5 years ago
Zirui Wu
dbf9936ec4
Implemented n-gram for dataset TensorOp
5 years ago
jinyaohui
5e43edc474
clean pylint
5 years ago
hesham
6c21e556c4
Clean up work for text python package
6 years ago
Zirui Wu
25ab2ef303
Implemented lookup and vocab
6 years ago