You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

mindspore.dataset.rst 5.2 kB

4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175
  1. mindspore.dataset
  2. =================
  3. 该模块提供了加载和处理各种通用数据集的API,如MNIST、CIFAR-10、CIFAR-100、VOC、COCO、ImageNet、CelebA、CLUE等,
  4. 也支持加载业界标准格式的数据集,包括MindRecord、TFRecord、Manifest等。此外,用户还可以使用此模块定义和加载自己的数据集。
  5. 该模块还提供了在加载时进行数据采样的API,如SequentialSample、RandomSampler、DistributedSampler等。
  6. 大多数数据集可以通过指定参数 `cache` 启用缓存服务,以提升整体数据处理效率。
  7. 请注意Windows平台上还不支持缓存服务,因此在Windows上加载和处理数据时,请勿使用。更多介绍和限制,
  8. 请参考 `Single-Node Tensor Cache <https://www.mindspore.cn/docs/programming_guide/zh-CN/master/cache.html>`_。
  9. 在API示例中,常用的模块导入方法如下:
  10. .. code-block::
  11. import mindspore.dataset as ds
  12. from mindspore.dataset.transforms import c_transforms
  13. 常用数据集术语说明如下:
  14. - Dataset,所有数据集的基类,提供了数据处理方法来帮助预处理数据。
  15. - SourceDataset,一个抽象类,表示数据集管道的来源,从文件和数据库等数据源生成数据。
  16. - MappableDataset,一个抽象类,表示支持随机访问的源数据集。
  17. - Iterator,用于枚举元素的数据集迭代器的基类。
  18. 视觉
  19. -----
  20. .. mscnautosummary::
  21. :toctree: dataset
  22. :nosignatures:
  23. :template: classtemplate_inherited.rst
  24. mindspore.dataset.Caltech101Dataset
  25. mindspore.dataset.Caltech256Dataset
  26. mindspore.dataset.CelebADataset
  27. mindspore.dataset.Cifar10Dataset
  28. mindspore.dataset.Cifar100Dataset
  29. mindspore.dataset.CityscapesDataset
  30. mindspore.dataset.CocoDataset
  31. mindspore.dataset.DIV2KDataset
  32. mindspore.dataset.EMnistDataset
  33. mindspore.dataset.FakeImageDataset
  34. mindspore.dataset.FashionMnistDataset
  35. mindspore.dataset.FlickrDataset
  36. mindspore.dataset.Flowers102Dataset
  37. mindspore.dataset.ImageFolderDataset
  38. mindspore.dataset.KMnistDataset
  39. mindspore.dataset.ManifestDataset
  40. mindspore.dataset.MnistDataset
  41. mindspore.dataset.PhotoTourDataset
  42. mindspore.dataset.Places365Dataset
  43. mindspore.dataset.QMnistDataset
  44. mindspore.dataset.SBDataset
  45. mindspore.dataset.SBUDataset
  46. mindspore.dataset.SemeionDataset
  47. mindspore.dataset.STL10Dataset
  48. mindspore.dataset.SVHNDataset
  49. mindspore.dataset.USPSDataset
  50. mindspore.dataset.VOCDataset
  51. mindspore.dataset.WIDERFaceDataset
  52. 文本
  53. ----
  54. .. mscnautosummary::
  55. :toctree: dataset
  56. :nosignatures:
  57. :template: classtemplate_inherited.rst
  58. mindspore.dataset.AGNewsDataset
  59. mindspore.dataset.AmazonReviewDataset
  60. mindspore.dataset.CLUEDataset
  61. mindspore.dataset.CoNLL2000Dataset
  62. mindspore.dataset.CSVDataset
  63. mindspore.dataset.DBpediaDataset
  64. mindspore.dataset.EnWik9Dataset
  65. mindspore.dataset.IMDBDataset
  66. mindspore.dataset.IWSLT2016Dataset
  67. mindspore.dataset.IWSLT2017Dataset
  68. mindspore.dataset.PennTreebankDataset
  69. mindspore.dataset.SogouNewsDataset
  70. mindspore.dataset.TextFileDataset
  71. mindspore.dataset.UDPOSDataset
  72. mindspore.dataset.WikiTextDataset
  73. mindspore.dataset.YahooAnswersDataset
  74. mindspore.dataset.YelpReviewDataset
  75. 音频
  76. ------
  77. .. mscnautosummary::
  78. :toctree: dataset
  79. :nosignatures:
  80. :template: classtemplate_inherited.rst
  81. mindspore.dataset.LJSpeechDataset
  82. mindspore.dataset.SpeechCommandsDataset
  83. mindspore.dataset.TedliumDataset
  84. mindspore.dataset.YesNoDataset
  85. 标准格式
  86. --------
  87. .. mscnautosummary::
  88. :toctree: dataset
  89. :nosignatures:
  90. :template: classtemplate_inherited.rst
  91. mindspore.dataset.CSVDataset
  92. mindspore.dataset.MindDataset
  93. mindspore.dataset.OBSMindDataset
  94. mindspore.dataset.TFRecordDataset
  95. 用户自定义
  96. ----------
  97. .. mscnautosummary::
  98. :toctree: dataset
  99. :nosignatures:
  100. :template: classtemplate_inherited.rst
  101. mindspore.dataset.GeneratorDataset
  102. mindspore.dataset.NumpySlicesDataset
  103. mindspore.dataset.PaddedDataset
  104. mindspore.dataset.RandomDataset
  105. ---
  106. .. mscnautosummary::
  107. :toctree: dataset
  108. mindspore.dataset.GraphData
  109. 采样器
  110. -------
  111. .. mscnautosummary::
  112. :toctree: dataset
  113. mindspore.dataset.DistributedSampler
  114. mindspore.dataset.PKSampler
  115. mindspore.dataset.RandomSampler
  116. mindspore.dataset.SequentialSampler
  117. mindspore.dataset.SubsetRandomSampler
  118. mindspore.dataset.SubsetSampler
  119. mindspore.dataset.WeightedRandomSampler
  120. 其他
  121. -----
  122. .. mscnautosummary::
  123. :toctree: dataset
  124. :nosignatures:
  125. :template: classtemplate_inherited.rst
  126. mindspore.dataset.BatchInfo
  127. mindspore.dataset.DatasetCache
  128. mindspore.dataset.DSCallback
  129. mindspore.dataset.SamplingStrategy
  130. mindspore.dataset.Schema
  131. mindspore.dataset.Shuffle
  132. mindspore.dataset.WaitedDSCallback
  133. mindspore.dataset.OutputFormat
  134. mindspore.dataset.compare
  135. mindspore.dataset.deserialize
  136. mindspore.dataset.serialize
  137. mindspore.dataset.show
  138. mindspore.dataset.sync_wait_for_dataset
  139. mindspore.dataset.utils.imshow_det_bbox
  140. mindspore.dataset.zip