peilin-wang
9468b49e28
bucket_batch_sizes must be strictly positive, 0 is not a valid batch size
5 years ago
ms_yan
7fa0d9e7e4
add paramter check for numpyslices and num_shards
5 years ago
liyong
68e2097897
fix split erroer message
5 years ago
heleiwang
0d52888fc5
fix misspell and check parameters
5 years ago
YangLuo
4e3bfcf4c9
!2306 [Dataset] Code review & improve quality
5 years ago
mindspore-ci-bot
78a8bc302d
!2380 Fix CocoDataset issue
Merge pull request !2380 from xiefangqi/xfq_fix_coco_issue_01
5 years ago
ms_yan
cf82aa9035
init remove storage op in c++
init remove storage op test case c++
remove source c++ files
5 years ago
xiefangqi
5703a10b8b
fix CocoDataset issue
5 years ago
ms_yan
fdafc690f8
repair problem in NumpySlicesDataset
5 years ago
panfengfeng
c745b6bd9d
回退 'Pull Request !2189 : dataset: repair some bug in NumpySlicesDataset'
5 years ago
mindspore-ci-bot
11a256536d
!2189 dataset: repair some bug in NumpySlicesDataset
Merge pull request !2189 from ms_yan/numpy_slice_repair
5 years ago
Yanjun Peng
e95220d89d
fix generatordataset para check
5 years ago
ms_yan
d162a7a5ab
repair problem in NumpySlicesDataset
5 years ago
mindspore-ci-bot
625f2421b5
!1977 random walk in gnn node2vec
Merge pull request !1977 from JonathanY/randomwalk
5 years ago
mindspore-ci-bot
670a512a9a
!2145 Replace np.sum in check_numpyslicesdataset() to correct deprecated warning
Merge pull request !2145 from cathwong/ckw_dataset_npsum_slice_warn
5 years ago
Cathy Wong
201008d49b
Replace np.sum to correct deprecated warning
5 years ago
Peilin Wang
848e07d022
initial commit, start of BucketBatchByLengthOp
c implementation done, just need to call batch/pad
added python api and validator
added pybind/de_pipeline stuff, fixed some compile errors, figure out how null py::function works
added tiny bit of doc
integrated with static batch methods
fixed some bugs
some more bug fixes and cleanup
ci fix
fix ci
ci fix
fix ci
added test_cases and debugged
addressed code review comments
addressed code review comments
ci fix
ci fix
addressed code review comments
addressed code review comments
5 years ago
Jonathan Yan
87d2c27c7f
random walk v1
5 years ago
jiangzhiwen
e0e167a000
add CLUE dataset
5 years ago
ms_yan
bc22c172b8
add TensorDataset and its ut
5 years ago
mindspore-ci-bot
b3da41bd7a
!1913 CocoDataset implementation
Merge pull request !1913 from xiefangqi/xfq_support_coco
5 years ago
xiefangqi
097cbcc0cb
CocoDataset implementation
5 years ago
heleiwang
3ece8dd090
1. support get_all_edges, get_nodes_from_edge, get_sampled_neighbors, get_neg_sampled_neighbors and graph_info API
2. mod cora and citeseer conversion script
5 years ago
mindspore-ci-bot
5eb95599f6
!1874 Updates to string tensor
Merge pull request !1874 from h.farahat/string_tensor_updates
5 years ago
hesham
f837ddc956
- Bug when empty strings sent to Python
- Support accepting Numpy of str as input
- Support batching strings
- Core logic of batch&pad is static
- Make Pad a utility function
5 years ago
liyong
feff8899ac
support padding samples
5 years ago
Jamie Nisbet
51bc0c0460
consistent design for num_samples
updates
more update
more work
more fixin
post rebase updates
clang formatting
code review recovery
ci fixes
updates
update
update
update
5 years ago
heleiwang
0b87620b73
fix input check
5 years ago
heleiwang
f28f883cac
fix comments error and modify parameter check
5 years ago
mindspore-ci-bot
6f733ec113
!1308 Stage 2 of adding support for string Tensor
Merge pull request !1308 from h.farahat/string_tensor2
5 years ago
heleiwang
599a449e0b
Support processing GNN data
6 years ago
hesham
df361d1d26
Change mem layout of string tensor
add support for MindRecord and TFRecord
----
optimize tensorshape
optimize tensorshape and FlatIndex
TFRecord and MindRecord support for string tensor
Modify mem layout
Add new constructor
Add method Allocate
Change some GetMutableBuffer usages to AllocateBuffer
5 years ago
Peilin Wang
71e8bb1960
general split case done, chaining sampler (basic case) is working
implementation 99% complete
everything and tested except for repeatable shuffling
tested most basic/typical split usecases
cleanup
some more cleanup
fix CI
more ci fix
more ci fixes
more ci fix
more ci fix
more ci fix
added more tests, fixed some bugs
some more clean up and test cases
added shard/shuffle before split warning/error
addressed code review comments and ci
fixed ci
5 years ago
ms_yan
33f3aee9da
repair format problem in concat op
5 years ago
mindspore-ci-bot
c680cfbf27
!1157 dataset: add concat operation for dataset
Merge pull request !1157 from ms_yan/concat_dataset
5 years ago
mindspore-ci-bot
ab031ee9ea
!1126 VOCDataset support object detection function
Merge pull request !1126 from xiefangqi/voc_support_detection
5 years ago
xiefangqi
c937bad53f
minddata support voc
5 years ago
ms_yan
c0fa7b4b19
init commit of concat dataset
change to use __add__ operation instead ds.concat
5 years ago
jonyguo
be2e7531ca
fix: MindDataset parameter shard_id & num_shards check
5 years ago
liyong
aa3f89e74f
mindrecord support read file list
5 years ago
mindspore-ci-bot
6c79c00a3f
!979 dataset: repair parameter check in rename Op
Merge pull request !979 from ms_yan/rename_columns
5 years ago
Zirui Wu
c2d364a573
batch with padding implemented
support for 1 specific dimension to be None, added validator
fix various CI complains
another round of CI fixes
ci
refactor parts of the code
code refactor
ci fix
comments added, fix bugs
address review comments
address review comments
review cmts
added simple perf test script
update pad code
perf imprv
6 years ago
ms_yan
d95a2e9ccf
repair num of input columns and output columns check
5 years ago
panfengfeng
5e7cef7e3d
fix generatordataset check shuffle parameter
6 years ago
Junhan Hu
c5a8ffe4f4
Add schema support for GeneratorDataset
6 years ago
yanghaitao
203b05d458
num_shards and sampler is not supported if source does not have __getitem__
6 years ago
yanghaitao
c87e174661
checkout python_multiprocess type for map
6 years ago
yanghaitao
60f2663326
Check TfFileDataset num_shards and shard_id
6 years ago
ms_yan
763aa1067e
Add Error catch for giving too many input parameters
6 years ago
mindspore-ci-bot
eda63a559a
!477 Fix VOC dataset test cases
Merge pull request !477 from xiefangqi/xfq_fix_voc
6 years ago