ms_yan
cf82aa9035
init remove storage op in c++
init remove storage op test case c++
remove source c++ files
5 years ago
islam_amin
edc42c5b85
addressed comments from reviews
5 years ago
Danish Farid
c1e6ece563
my files added back in
another commit
comm2
comm3
comm4
comm5
comm6
5 years ago
peilinwang
1e36b0649f
remove graphengine changes
remove graphengine changes
concat op
Truncate Pair
concat_op
remove graph engine changes
ToNumberOp implementation almost done
ToNumberOp complete
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
ci fix
merge conflicts
5 years ago
hesham
11826fb256
DuplicateOp
5 years ago
mindspore-ci-bot
90bb9320aa
!2314 Implemented RandomHorizontalFlipWithBBox and BoundingBoxAugment C++ Ops
Merge pull request !2314 from imaaamin/object_ops_pr
5 years ago
mindspore-ci-bot
a83baafbf6
!2253 Add ConcatOp to Dataset
Merge pull request !2253 from nhussain/concat_op
5 years ago
mindspore-ci-bot
3784220056
!2317 rework on dataset.text.vocab to support any user special_tokens
Merge pull request !2317 from ZiruiWu/vocab_rework
5 years ago
nhussain
5515016dba
remove graphengine changes
remove graphengine changes
concat op
Truncate Pair
concat_op
remove graph engine changes
5 years ago
islam_amin
0af156d169
added two ops for object detection, random horizontal flip with bbox and bouding box augment. along with their unit tests
5 years ago
xunxue
87aa9c8f7a
PadEndOp
5 years ago
Zirui Wu
b6e9504b31
phase I of Vocab rework
phase II vocab rework
added more test cases
fix api doc string
address review cmts and fix CI
address ci complains
fix review cmts
ci
5 years ago
mindspore-ci-bot
5850b99161
!2211 Add TruncatePair Op to dataset
Merge pull request !2211 from h.farahat/pair_truncate
5 years ago
ms_yan
8d1dae46ac
Throw error when load config failed
5 years ago
hesham
b9495a9ccc
Truncate Pair
5 years ago
mindspore-ci-bot
ea829e89c3
!2198 Add MaskOp to dataset
Merge pull request !2198 from h.farahat/mask_op
5 years ago
qianlong
4f16f036be
Add WhitespaceTokenizer and UnicodeScriptTokenizer for nlp
add CaseFold, NormalizeUTF8
add RegexReplace
add RegexTokenizer
add BasicTokenizer
add WordpieceTokenizer
add BertTokenizer
5 years ago
hesham
f2462bb00d
Mask Op
5 years ago
mindspore-ci-bot
625f2421b5
!1977 random walk in gnn node2vec
Merge pull request !1977 from JonathanY/randomwalk
5 years ago
mindspore-ci-bot
3adbde56b2
!2162 SliceOp
Merge pull request !2162 from h.farahat/slice_op
5 years ago
liyong
d4f8f57c7e
add split in minddataset
5 years ago
hesham
e2012a1de9
Slice Op
5 years ago
mindspore-ci-bot
2cab44a637
!2028 FillOp for BERT
Merge pull request !2028 from nhussain/fillOp_learning
5 years ago
nhussain
dd9bf09f0a
added FillOp for #119 - special Ops
5 years ago
Peilin Wang
848e07d022
initial commit, start of BucketBatchByLengthOp
c implementation done, just need to call batch/pad
added python api and validator
added pybind/de_pipeline stuff, fixed some compile errors, figure out how null py::function works
added tiny bit of doc
integrated with static batch methods
fixed some bugs
some more bug fixes and cleanup
ci fix
fix ci
ci fix
fix ci
added test_cases and debugged
addressed code review comments
addressed code review comments
ci fix
ci fix
addressed code review comments
addressed code review comments
5 years ago
Jonathan Yan
87d2c27c7f
random walk v1
5 years ago
mindspore-ci-bot
a8cf83acf2
!1932 Add CLUE dataset
Merge pull request !1932 from jiangzhiwen/dataset/clue
5 years ago
jiangzhiwen
e0e167a000
add CLUE dataset
5 years ago
mindspore-ci-bot
c661087b99
!2029 Create TensorRow class that supports a row id
Merge pull request !2029 from lixiachen/lixia/tensorrow
5 years ago
Lixia Chen
6ae88c3987
Create TensorRow class that supports a row id.
5 years ago
mindspore-ci-bot
1b5d9a1194
!1981 Implemented from_dataset, a way to build a vocab from existing dataset
Merge pull request !1981 from ZiruiWu/from_dataset_dev
5 years ago
Zirui Wu
880ce5ea26
implemented from_dataset
fix complie error
more tests
address CI complains
fix ci
adress review comments
address review cmts
5 years ago
Alexey Shevlyakov
4e333a2f22
MindData profiler infrastructure
5 years ago
mindspore-ci-bot
b3da41bd7a
!1913 CocoDataset implementation
Merge pull request !1913 from xiefangqi/xfq_support_coco
5 years ago
xiefangqi
097cbcc0cb
CocoDataset implementation
5 years ago
heleiwang
3ece8dd090
1. support get_all_edges, get_nodes_from_edge, get_sampled_neighbors, get_neg_sampled_neighbors and graph_info API
2. mod cora and citeseer conversion script
5 years ago
liyong
feff8899ac
support padding samples
5 years ago
mindspore-ci-bot
5c21616293
!1807 Implemented Ngram TensorOp for dataset
Merge pull request !1807 from ZiruiWu/ngram_dev
5 years ago
Zirui Wu
dbf9936ec4
Implemented n-gram for dataset TensorOp
5 years ago
Jamie Nisbet
51bc0c0460
consistent design for num_samples
updates
more update
more work
more fixin
post rebase updates
clang formatting
code review recovery
ci fixes
updates
update
update
update
5 years ago
xiefangqi
5e4728c50f
add voc support split
5 years ago
anthony
ba2eaea7e1
Changed uniform_aug_op py::list to std::vector with some cleanup and enhancement.
5 years ago
mindspore-ci-bot
458436186c
!1365 Clean up work for text python sub-package
Merge pull request !1365 from h.farahat/text_namespace
5 years ago
hesham
6c21e556c4
Clean up work for text python package
5 years ago
heleiwang
599a449e0b
Support processing GNN data
5 years ago
mindspore-ci-bot
58e6d7d950
!1341 Added lookup and vocab to mindspore.dataset.text
Merge pull request !1341 from ZiruiWu/vocab_and_lookup
5 years ago
mindspore-ci-bot
2e3d55ed87
!1281 Implementation of SplitOp
Merge pull request !1281 from Peilin/splitOp
5 years ago
Peilin Wang
71e8bb1960
general split case done, chaining sampler (basic case) is working
implementation 99% complete
everything and tested except for repeatable shuffling
tested most basic/typical split usecases
cleanup
some more cleanup
fix CI
more ci fix
more ci fixes
more ci fix
more ci fix
more ci fix
added more tests, fixed some bugs
some more clean up and test cases
added shard/shuffle before split warning/error
addressed code review comments and ci
fixed ci
5 years ago
Zirui Wu
25ab2ef303
Implemented lookup and vocab
5 years ago
qianlong
451c20a6f5
Add UnicodeCharTokenizer for nlp
5 years ago