You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

mindspore.dataset.rst 5.1 kB

4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
4 years ago
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173
  1. mindspore.dataset
  2. =================
  3. 该模块提供了加载和处理各种通用数据集的API,如MNIST、CIFAR-10、CIFAR-100、VOC、COCO、ImageNet、CelebA、CLUE等,
  4. 也支持加载业界标准格式的数据集,包括MindRecord、TFRecord、Manifest等。此外,用户还可以使用此模块定义和加载自己的数据集。
  5. 该模块还提供了在加载时进行数据采样的API,如SequentialSample、RandomSampler、DistributedSampler等。
  6. 大多数数据集可以通过指定参数 `cache` 启用缓存服务,以提升整体数据处理效率。
  7. 请注意Windows平台上还不支持缓存服务,因此在Windows上加载和处理数据时,请勿使用。更多介绍和限制,
  8. 请参考 `Single-Node Tensor Cache <https://www.mindspore.cn/docs/programming_guide/zh-CN/master/cache.html>`_。
  9. 在API示例中,常用的模块导入方法如下:
  10. .. code-block::
  11. import mindspore.dataset as ds
  12. from mindspore.dataset.transforms import c_transforms
  13. 常用数据集术语说明如下:
  14. - Dataset,所有数据集的基类,提供了数据处理方法来帮助预处理数据。
  15. - SourceDataset,一个抽象类,表示数据集管道的来源,从文件和数据库等数据源生成数据。
  16. - MappableDataset,一个抽象类,表示支持随机访问的源数据集。
  17. - Iterator,用于枚举元素的数据集迭代器的基类。
  18. 视觉
  19. -----
  20. .. mscnautosummary::
  21. :toctree: dataset
  22. :nosignatures:
  23. :template: classtemplate_inherited.rst
  24. mindspore.dataset.Caltech101Dataset
  25. mindspore.dataset.Caltech256Dataset
  26. mindspore.dataset.CelebADataset
  27. mindspore.dataset.Cifar10Dataset
  28. mindspore.dataset.Cifar100Dataset
  29. mindspore.dataset.CityscapesDataset
  30. mindspore.dataset.CocoDataset
  31. mindspore.dataset.DIV2KDataset
  32. mindspore.dataset.EMnistDataset
  33. mindspore.dataset.FakeImageDataset
  34. mindspore.dataset.FashionMnistDataset
  35. mindspore.dataset.FlickrDataset
  36. mindspore.dataset.Flowers102Dataset
  37. mindspore.dataset.ImageFolderDataset
  38. mindspore.dataset.KMnistDataset
  39. mindspore.dataset.ManifestDataset
  40. mindspore.dataset.MnistDataset
  41. mindspore.dataset.PhotoTourDataset
  42. mindspore.dataset.Places365Dataset
  43. mindspore.dataset.QMnistDataset
  44. mindspore.dataset.SBDataset
  45. mindspore.dataset.SBUDataset
  46. mindspore.dataset.SemeionDataset
  47. mindspore.dataset.STL10Dataset
  48. mindspore.dataset.SVHNDataset
  49. mindspore.dataset.USPSDataset
  50. mindspore.dataset.VOCDataset
  51. mindspore.dataset.WIDERFaceDataset
  52. 文本
  53. ----
  54. .. mscnautosummary::
  55. :toctree: dataset
  56. :nosignatures:
  57. :template: classtemplate_inherited.rst
  58. mindspore.dataset.AGNewsDataset
  59. mindspore.dataset.AmazonReviewDataset
  60. mindspore.dataset.CLUEDataset
  61. mindspore.dataset.CoNLL2000Dataset
  62. mindspore.dataset.CSVDataset
  63. mindspore.dataset.DBpediaDataset
  64. mindspore.dataset.EnWik9Dataset
  65. mindspore.dataset.IMDBDataset
  66. mindspore.dataset.IWSLT2016Dataset
  67. mindspore.dataset.IWSLT2017Dataset
  68. mindspore.dataset.PennTreebankDataset
  69. mindspore.dataset.SogouNewsDataset
  70. mindspore.dataset.TextFileDataset
  71. mindspore.dataset.UDPOSDataset
  72. mindspore.dataset.WikiTextDataset
  73. mindspore.dataset.YahooAnswersDataset
  74. mindspore.dataset.YelpReviewDataset
  75. 音频
  76. ------
  77. .. mscnautosummary::
  78. :toctree: dataset
  79. :nosignatures:
  80. :template: classtemplate_inherited.rst
  81. mindspore.dataset.LJSpeechDataset
  82. mindspore.dataset.SpeechCommandsDataset
  83. mindspore.dataset.TedliumDataset
  84. mindspore.dataset.YesNoDataset
  85. 标准格式
  86. --------
  87. .. mscnautosummary::
  88. :toctree: dataset
  89. :nosignatures:
  90. :template: classtemplate_inherited.rst
  91. mindspore.dataset.CSVDataset
  92. mindspore.dataset.MindDataset
  93. mindspore.dataset.TFRecordDataset
  94. 用户自定义
  95. ----------
  96. .. mscnautosummary::
  97. :toctree: dataset
  98. :nosignatures:
  99. :template: classtemplate_inherited.rst
  100. mindspore.dataset.GeneratorDataset
  101. mindspore.dataset.NumpySlicesDataset
  102. mindspore.dataset.PaddedDataset
  103. mindspore.dataset.RandomDataset
  104. ---
  105. .. mscnautosummary::
  106. :toctree: dataset
  107. mindspore.dataset.GraphData
  108. 采样器
  109. -------
  110. .. mscnautosummary::
  111. :toctree: dataset
  112. mindspore.dataset.DistributedSampler
  113. mindspore.dataset.PKSampler
  114. mindspore.dataset.RandomSampler
  115. mindspore.dataset.SequentialSampler
  116. mindspore.dataset.SubsetRandomSampler
  117. mindspore.dataset.SubsetSampler
  118. mindspore.dataset.WeightedRandomSampler
  119. 其他
  120. -----
  121. .. mscnautosummary::
  122. :toctree: dataset
  123. :nosignatures:
  124. :template: classtemplate_inherited.rst
  125. mindspore.dataset.BatchInfo
  126. mindspore.dataset.DatasetCache
  127. mindspore.dataset.DSCallback
  128. mindspore.dataset.SamplingStrategy
  129. mindspore.dataset.Schema
  130. mindspore.dataset.Shuffle
  131. mindspore.dataset.WaitedDSCallback
  132. mindspore.dataset.OutputFormat
  133. mindspore.dataset.compare
  134. mindspore.dataset.deserialize
  135. mindspore.dataset.serialize
  136. mindspore.dataset.show
  137. mindspore.dataset.utils.imshow_det_bbox
  138. mindspore.dataset.zip