You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

RELEASE.md 80 kB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788899091929394959697989910010110210310410510610710810911011111211311411511611711811912012112212312412512612712812913013113213313413513613713813914014114214314414514614714814915015115215315415515615715815916016116216316416516616716816917017117217317417517617717817918018118218318418518618718818919019119219319419519619719819920020120220320420520620720820921021121221321421521621721821922022122222322422522622722822923023123223323423523623723823924024124224324424524624724824925025125225325425525625725825926026126226326426526626726826927027127227327427527627727827928028128228328428528628728828929029129229329429529629729829930030130230330430530630730830931031131231331431531631731831932032132232332432532632732832933033133233333433533633733833934034134234334434534634734834935035135235335435535635735835936036136236336436536636736836937037137237337437537637737837938038138238338438538638738838939039139239339439539639739839940040140240340440540640740840941041141241341441541641741841942042142242342442542642742842943043143243343443543643743843944044144244344444544644744844945045145245345445545645745845946046146246346446546646746846947047147247347447547647747847948048148248348448548648748848949049149249349449549649749849950050150250350450550650750850951051151251351451551651751851952052152252352452552652752852953053153253353453553653753853954054154254354454554654754854955055155255355455555655755855956056156256356456556656756856957057157257357457557657757857958058158258358458558658758858959059159259359459559659759859960060160260360460560660760860961061161261361461561661761861962062162262362462562662762862963063163263363463563663763863964064164264364464564664764864965065165265365465565665765865966066166266366466566666766866967067167267367467567667767867968068168268368468568668768868969069169269369469569669769869970070170270370470570670770870971071171271371471571671771871972072172272372472572672772872973073173273373473573673773873974074174274374474574674774874975075175275375475575675775875976076176276376476576676776876977077177277377477577677777877978078178278378478578678778878979079179279379479579679779879980080180280380480580680780880981081181281381481581681781881982082182282382482582682782882983083183283383483583683783883984084184284384484584684784884985085185285385485585685785885986086186286386486586686786886987087187287387487587687787887988088188288388488588688788888989089189289389489589689789889990090190290390490590690790890991091191291391491591691791891992092192292392492592692792892993093193293393493593693793893994094194294394494594694794894995095195295395495595695795895996096196296396496596696796896997097197297397497597697797897998098198298398498598698798898999099199299399499599699799899910001001100210031004100510061007100810091010101110121013101410151016101710181019102010211022102310241025102610271028102910301031103210331034103510361037103810391040104110421043104410451046104710481049105010511052105310541055105610571058105910601061106210631064106510661067106810691070107110721073107410751076107710781079108010811082108310841085108610871088108910901091109210931094109510961097109810991100110111021103110411051106110711081109111011111112111311141115111611171118111911201121112211231124112511261127112811291130113111321133113411351136113711381139114011411142114311441145114611471148114911501151115211531154115511561157115811591160116111621163116411651166116711681169117011711172117311741175117611771178117911801181118211831184118511861187118811891190119111921193119411951196119711981199120012011202120312041205120612071208120912101211121212131214121512161217121812191220122112221223122412251226122712281229123012311232123312341235123612371238123912401241124212431244124512461247124812491250125112521253125412551256125712581259126012611262126312641265126612671268126912701271127212731274127512761277127812791280128112821283128412851286128712881289129012911292129312941295129612971298129913001301130213031304130513061307130813091310131113121313131413151316131713181319132013211322132313241325132613271328
  1. # MindSpore 1.1.1 Release Notes
  2. ## MindSpore
  3. ### API Change
  4. #### Backwards Incompatible Change
  5. ##### Python API
  6. ###### `ops.AvgPool`, `ops.MaxPool`, `ops.MaxPoolWithArgmax` change attr name from 'ksize', 'padding' to 'kernel_size', 'pad_mode' ([!11350](https://gitee.com/mindspore/mindspore/pulls/11350))
  7. Previously the kernel size and pad mode attrs of pooling ops are named "ksize" and "padding", which is a little puzzling and inconsistent with convolution ops. So they are rename to "kernel_size" and "pad_mode".
  8. <table>
  9. <tr>
  10. <td style="text-align:center"> 1.1.0 </td> <td style="text-align:center"> 1.1.1 </td>
  11. </tr>
  12. <tr>
  13. <td>
  14. ```python
  15. >>> import mindspore.ops as ops
  16. >>>
  17. >>> avg_pool = ops.AvgPool(ksize=2, padding='same')
  18. >>> max_pool = ops.MaxPool(ksize=2, padding='same')
  19. >>> max_pool_with_argmax = ops.MaxPoolWithArgmax(ksize=2, padding='same')
  20. ```
  21. </td>
  22. <td>
  23. ```python
  24. >>> import mindspore.ops as ops
  25. >>>
  26. >>> avg_pool = ops.AvgPool(kernel_size=2, pad_mode='same')
  27. >>> max_pool = ops.MaxPool(kernel_size=2, pad_mode='same')
  28. >>> max_pool_with_argmax = ops.MaxPoolWithArgmax(kernel_size=2, pad_mode='same')
  29. ```
  30. </td>
  31. </tr>
  32. </table>
  33. # MindSpore 1.1.0 Release Notes
  34. ## MindSpore
  35. ### Major Features and Improvements
  36. #### NewModels
  37. - [STABLE] GNMT v2: similar to the model described in Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, which is mainly used for corpus translation, on WMT Englis-German dataset.(Ascend)
  38. - [STABLE] MaskRCNN: a conceptually simple, flexible, and general framework for object instance segmentation on COCO2017 dataset.(Ascend)
  39. - [STABLE] YOLOv4: a state-of-the-art detector which is faster and more accurate than all available alternative detectors on MS COCO dataset.(Ascend)
  40. - [STABLE] Openpose: proposes a bottom-up human attitude estimation algorithm using Part Affinity Fields on COCO2017 dataset.(Ascend)
  41. - [STABLE] CNN-CTC: proposes three major contributions to addresses scene text recognition (STR) on MJSynth and SynthText dataset.(Ascend)
  42. - [STABLE] CenterFace: a practical anchor-free face detection and alignment method for edge devices on WiderFace dataset.(Ascend)
  43. - [STABLE] ShuffleNetV2: a much faster and more accurate network than the previous networks on ImageNet 2012 dataset.(GPU)
  44. - [STABLE] EfficientNet-B0: a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient on ImageNet 2012 dataset.(GPU)
  45. - [BETA] SSD-GhostNet: based on an Ghost module structure which generate more features from cheap operations on Oxford-IIIT Pet dataset.(Ascend)
  46. - [BETA] DS-CNN: Depthwise separable convolutional neural network on Speech commands dataset.(Ascend)
  47. - [BETA] DeepPotentialH2O: A neural network model for molecular dynamics simulations. (Ascend)
  48. - [BETA] GOMO: A classical numerical method called GOMO for ocean simulation. (GPU)
  49. #### FrontEnd
  50. - [STABLE] Refactor the MINDIR to support 310 inference(Ascend).
  51. - [STABLE] The execution backend of sparse operations in optimizer can be set through 'target'. (Ascend/GPU/CPU)
  52. - [STABLE] Support saving specified network to checkpoint and filtering parameters according to prefix when load checkpoint. (Ascend/GPU/CPU)
  53. - [STABLE] Allow users choose whether to load parameter into network strictly.(Ascend/GPU/CPU)
  54. - [STABLE] Before training, in graph mode, in order to have the same network initialization parameter values ​​for all devices, broadcast the parameters on device 0 to other devices. (Ascend/GPU)
  55. - [STABLE] Support if by if of control flow subgraph. (Ascend/GPU)
  56. - [STABLE] Support the judgment that whether a tensor is in a list. (Ascend/GPU/CPU)
  57. - [STABLE] Support to get a value by using the corresponding key in a dictionary in the network; Support to get keys and values of a dictionary in the network. (Ascend/GPU/CPU)
  58. - [STABLE] Support Tensor in enumerate. (Ascend/GPU/CPU)
  59. - [STABLE] Support multilevel index assignment. (Ascend/GPU/CPU)
  60. - [STABLE] Support the 'expand_as','view','abs','mean' method of Tensor. (Ascend/GPU/CPU)
  61. - [STABLE] Support ResizeBilinear operation transfer ratio. (Ascend)
  62. - [STABLE] nn.Matmul supports matrix-vector product and batched matrix multiply. (Ascend/GPU)
  63. - [STABLE] nn.Dense supports input tensor whose dimension can be greater than 2. (Ascend/GPU)
  64. - [BETA] Support higher order differentiation for partial operators.(CPU/GPU/Ascend)
  65. - [STABLE] Support Tensor Augassign.(Ascend/GPU)
  66. - [BETA] Support 22 numpy native interfaces.
  67. #### Auto Parallel
  68. - [STABLE] Support parallel optimizer with weight shard. (Ascend/GPU)
  69. - [STABLE] Support distributed operators: element-wise series, UnsortedSegmentSum, UnsortedSegmentMin, Split, BroadcastTo and Unique etc. (Ascend/GPU)
  70. - [STABLE] Support distributed model prediction. (Ascend/GPU)
  71. - [STABLE] Support auto mixed precision level "O2" in auto and semi auto parallel mode. (Ascend/GPU)
  72. - [STABLE] Add MultiFieldEmbeddingLookup high-level interface. (Ascend/GPU)
  73. #### Executor
  74. - [STABLE] ResNet50 performance optimize. (GPU)
  75. - [STABLE] Support modelzoo net in PyNative mode(Ascend 29, GPU 23, CPU 2).(Ascend/GPU/CPU)
  76. - [STABLE] Support PyNative mode on CPU.(CPU)
  77. - [STABLE] Optimize performance in PyNative mode.(Ascend/GPU/CPU)
  78. - [STABLE] Support Safe Optimized Memory Allocation Solver (SOMAS) on Ascend to improve the memory-reuse, the batch size of Bert large model (128 sequence length) is increased from 160 to 208.(Ascend)
  79. - [BETA] Support second order differentiation in PyNative mode.(Ascend/GPU)
  80. - [DEMO] Add distributed trainning in PyNative mode.(Ascend/GPU)
  81. #### MDP
  82. - [STABLE] Add new operators for Ascend and GPU: IGamma, LGamma, DiGamma;
  83. - [STABLE] Add new distributions for Ascend and GPU: LogNormal, and Logistic;
  84. - [BETA] Add new distributions for Ascend only: Gumbel, Cauchy, Gamma, Beta, and Poisson; Add Categorical distribution for GPU;
  85. - [STABLE] Add new bijectors for Ascend and GPU: GumbelCDF, Invert;
  86. - [STABLE] Add Bayesian layer realized by local reparameterization method for Ascend and GPU;
  87. - [STABLE] Add Anomaly Detection Toolbox based on VAE for Ascend and GPU.
  88. #### DataSet
  89. - [STABLE] Support single node multi-p distributed cache data sharing
  90. - [STABLE] Support GPU profiling with data processing
  91. - [STABLE] Support YOLOV3 dynamic shape in sink mode with dataset
  92. - [STABLE] Support unique processing in the data processing pipeline
  93. - [STABLE] Python layer parameter verification error information unified
  94. ### API Change
  95. #### Backwards Incompatible Change
  96. ##### Python API
  97. ###### Delete shape and dtype of class Initializer ([!7373](https://gitee.com/mindspore/mindspore/pulls/7373/files))
  98. Delete shape and dtype attributes of Initializer class.
  99. ###### Modify the return type of initializer ([!7373](https://gitee.com/mindspore/mindspore/pulls/7373/files))
  100. Previously, the return type of initializer function may be string, number, instance of class Tensor or subclass of class Initializer.
  101. After modification, initializer function will return instance of class MetaTensor, class Tensor or subclass of class Initializer.
  102. Noted that the MetaTensor is forbidden to initialize parameters, so we recommend that use str, number or subclass of Initializer for parameters initialization rather than the initializer functions.
  103. <table>
  104. <tr>
  105. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  106. </tr>
  107. <tr>
  108. <td>
  109. ```python
  110. >>> import mindspore.nn as nn
  111. >>> from mindspore.common import initializer
  112. >>> from mindspore import dtype as mstype
  113. >>>
  114. >>> def conv3x3(in_channels, out_channels)
  115. >>> weight = initializer('XavierUniform', shape=(3, 2, 32, 32), dtype=mstype.float32)
  116. >>> return nn.Conv2d(in_channels, out_channels, weight_init=weight, has_bias=False, pad_mode="same")
  117. ```
  118. </td>
  119. <td>
  120. ```python
  121. >>> import mindspore.nn as nn
  122. >>> from mindspore.common.initializer import XavierUniform
  123. >>>
  124. >>> #1) using string
  125. >>> def conv3x3(in_channels, out_channels)
  126. >>> return nn.Conv2d(in_channels, out_channels, weight_init='XavierUniform', has_bias=False, pad_mode="same")
  127. >>>
  128. >>> #2) using subclass of class Initializer
  129. >>> def conv3x3(in_channels, out_channels)
  130. >>> return nn.Conv2d(in_channels, out_channels, weight_init=XavierUniform(), has_bias=False, pad_mode="same")
  131. ```
  132. </td>
  133. </tr>
  134. </table>
  135. Advantages:
  136. After modification, we can use the same instance of Initializer to initialize parameters of different shapes, which was not allowed before.
  137. <table>
  138. <tr>
  139. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  140. </tr>
  141. <tr>
  142. <td>
  143. ```python
  144. >>> import mindspore.nn as nn
  145. >>> from mindspore.common import initializer
  146. >>> from mindspore.common.initializer import XavierUniform
  147. >>>
  148. >>> weight_init_1 = XavierUniform(gain=1.1)
  149. >>> conv1 = nn.Conv2d(3, 6, weight_init=weight_init_1)
  150. >>> weight_init_2 = XavierUniform(gain=1.1)
  151. >>> conv2 = nn.Conv2d(6, 10, weight_init=weight_init_2)
  152. ```
  153. </td>
  154. <td>
  155. ```python
  156. >>> import mindspore.nn as nn
  157. >>> from mindspore.common import initializer
  158. >>> from mindspore.common.initializer import XavierUniform
  159. >>>
  160. >>> weight_init = XavierUniform(gain=1.1)
  161. >>> conv1 = nn.Conv2d(3, 6, weight_init=weight_init)
  162. >>> conv2 = nn.Conv2d(6, 10, weight_init=weight_init)
  163. ```
  164. </td>
  165. </tr>
  166. </table>
  167. ###### Modify get_seed function ([!7429](https://gitee.com/mindspore/mindspore/pulls/7429/files))
  168. Modify get_seed function implementation
  169. Previously, if seed is not set, the value of seed is default, parameters initialized by the normal function are the same every time.
  170. After modification, if seed is not set, the value of seed is generated randomly, the initialized parameters change according to the random seed.
  171. If you want to fix the initial value of parameters, we suggest to set seed.
  172. ```python
  173. >>> from mindspore.common import set_seed
  174. >>> set_seed(1)
  175. ```
  176. ###### `nn.LinSpace` ([!9494](https://gitee.com/mindspore/mindspore/pulls/9494)) has been removed and modify `ops.LinSpace` ([!8920](https://gitee.com/mindspore/mindspore/pulls/8920))
  177. The `nn.LinSpace` interface only support passing the value by args previously. For the convenience, we provided enhancive `ops.LinSpace` interface, which support passing the value by the inputs at the latest version. So there is no need for `nn.LinSpace`.
  178. <table>
  179. <tr>
  180. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  181. </tr>
  182. <tr>
  183. <td>
  184. ```python
  185. >>> from mindspore import nn
  186. >>>
  187. >>> start = 1
  188. >>> stop = 10
  189. >>> num = 5
  190. >>> linspace = nn.LinSpace(start, stop, num)
  191. >>> output = linspace()
  192. ```
  193. </td>
  194. <td>
  195. ```python
  196. >>> import mindspore
  197. >>> from mindspore import Tensor
  198. >>> from mindspore import ops
  199. >>>
  200. >>> linspace = ops.LinSpace()
  201. >>> start = Tensor(1, mindspore.float32)
  202. >>> stop = Tensor(10, mindspore.float32)
  203. >>> num = 5
  204. >>> output = linspace(start, stop, num)
  205. ```
  206. </td>
  207. </tr>
  208. </table>
  209. ###### Parts of `Optimizer` add target interface ([!6760](https://gitee.com/mindspore/mindspore/pulls/6760/files))
  210. The usage of the sparse optimizer is changed.
  211. The target interface is used to set the execution backend of the sparse operator.
  212. The add_primitive_attr interface is no longer allowed.
  213. The following optimizers add the target interface: Adam, FTRL, LazyAdam, ProximalAdagrad
  214. <table>
  215. <tr>
  216. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  217. </tr>
  218. <tr>
  219. <td>
  220. ```python
  221. >>> from mindspore.nn import Adam
  222. >>>
  223. >>> net = LeNet5()
  224. >>> optimizer = Adam(filter(lambda x: x.requires_grad, net.get_parameters()))
  225. >>> optimizer.sparse_opt.add_prim_attr("primitive_target", "CPU")
  226. ```
  227. </td>
  228. <td>
  229. ```python
  230. >>> from mindspore.nn import Adam
  231. >>>
  232. >>> net = LeNet5()
  233. >>> optimizer = Adam(filter(lambda x: x.requires_grad, net.get_parameters()))
  234. >>> optimizer.target = 'CPU'
  235. ```
  236. </td>
  237. </tr>
  238. </table>
  239. ###### `export` Modify the input parameters and export's file name ([!7385](https://gitee.com/mindspore/mindspore/pulls/7385), [!9057](https://gitee.com/mindspore/mindspore/pulls/9057/files))
  240. Export the MindSpore prediction model to a file in the specified format.
  241. The reference includes: `net`, `*inputs`, `file_name`, `file_format`, `**kwargs`.
  242. Input parameters can be input according to specific export requirements.
  243. Add the file name extension based on the format.
  244. <table>
  245. <tr>
  246. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  247. </tr>
  248. <tr>
  249. <td>
  250. ```python
  251. >>> from mindspore.train.quant import quant
  252. >>>
  253. >>> network = LeNetQuant()
  254. >>> inputs = Tensor(np.ones([1, 1, 32, 32]), mindspore.float32)
  255. >>> quant.export(network, inputs, file_name="lenet_quant.mindir", file_format='MINDIR')
  256. lenet_quant.mindir
  257. ```
  258. </td>
  259. <td>
  260. ```python
  261. >>> from mindspore import export
  262. >>>
  263. >>> network = LeNetQuant()
  264. >>> inputs = Tensor(np.ones([1, 1, 32, 32]), mindspore.float32)
  265. >>> export(network, inputs, file_name="lenet_quant", file_format='MINDIR', quant_mode='AUTO')
  266. lenet_quant.mindir
  267. ```
  268. </td>
  269. </tr>
  270. </table>
  271. ###### `Dense`, `Conv2dBnAct`, `DenseBnAct`, `DenseQuant` support setting the activation attribute as an instance of a class derived from `nn.Cell` or `Primtive` ([!7581](https://gitee.com/mindspore/mindspore/pulls/7581))
  272. activation (Union[str, Cell, Primitive]): activate function applied to the output of the fully connected layer
  273. <table>
  274. <tr>
  275. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  276. </tr>
  277. <tr>
  278. <td>
  279. ```python
  280. >>> import mindspore.nn as nn
  281. >>>
  282. >>> dense = nn.Dense(1, 1, activation='relu')
  283. ```
  284. </td>
  285. <td>
  286. ```python
  287. >>> import mindspore.nn as nn
  288. >>> import mindspore.ops as ops
  289. >>>
  290. >>> dense = nn.Dense(1, 1, activation=nn.ReLU())
  291. >>> dense = nn.Dense(1, 1, activation=ops.ReLU())
  292. ```
  293. </td>
  294. </tr>
  295. </table>
  296. ###### `tensor.dim()`, `tensor.size()` has been renamed to `tensor.ndim`, `tensor.size` ([!10175](https://gitee.com/mindspore/mindspore/pulls/10175))
  297. Previously, tensor.size() and tensor.dim() were used for checking the total number of elements/dimensions in the tensor.
  298. However, from a user's perspective, tensor.size and tensor.ndim (methods -> properties) are better choices, since they follow the numpy naming convention.
  299. <table>
  300. <tr>
  301. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  302. </tr>
  303. <tr>
  304. <td>
  305. ```python
  306. >>> from mindspore import Tensor
  307. >>>
  308. >>> Tensor((1,2,3)).size()
  309. >>> Tensor((1,2,3)).dim()
  310. ```
  311. </td>
  312. <td>
  313. ```python
  314. >>> from mindspore import Tensor
  315. >>>
  316. >>> Tensor((1,2,3)).size
  317. >>> Tensor((1,2,3)).ndim
  318. ```
  319. </td>
  320. </tr>
  321. </table>
  322. ###### `EmbeddingLookup` add a config in the interface: sparse ([!8202](https://gitee.com/mindspore/mindspore/pulls/8202))
  323. sparse (bool): Using sparse mode. When 'target' is set to 'CPU', 'sparse' has to be true. Default: True.
  324. <table>
  325. <tr>
  326. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  327. </tr>
  328. <tr>
  329. <td>
  330. ```python
  331. >>> from mindspore.nn import EmbeddingLookup
  332. >>>
  333. >>> input_indices = Tensor(np.array([[1, 0], [3, 2]]), mindspore.int32)
  334. >>> result = EmbeddingLookup(4,2)(input_indices)
  335. >>> print(result.shape)
  336. (2, 2, 2)
  337. ```
  338. </td>
  339. <td>
  340. ```python
  341. >>> from mindspore.nn import EmbeddingLookup
  342. >>>
  343. >>> input_indices = Tensor(np.array([[1, 0], [3, 2]]), mindspore.int32)
  344. >>> result = EmbeddingLookup(4,2)(input_indices, sparse=False)
  345. >>> print(result.shape)
  346. (2, 2, 2)
  347. ```
  348. </td>
  349. </tr>
  350. </table>
  351. ###### `nn.probability.bijector` change types of attributes from (int, float) to (float, list, numpy.ndarray, Tensor) ([!8191](https://gitee.com/mindspore/mindspore/pulls/8191))
  352. Attributes Type change: (int, float) -> (float, list, numpy.ndarray, Tensor).
  353. Int type is not supported anymore. Parameters of all bijectors should be type float, list, numpy.ndarray or Tensor.
  354. <table>
  355. <tr>
  356. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  357. </tr>
  358. <tr>
  359. <td>
  360. ```python
  361. >>> import mindspore.nn.probability.bijector as msb
  362. >>>
  363. >>> power = 2
  364. >>> bijector = msb.PowerTransform(power=power)
  365. ```
  366. </td>
  367. <td>
  368. ```python
  369. >>> import mindspore.nn.probability.bijector as msb
  370. >>>
  371. >>> power = 2.0
  372. >>> bijector = msb.PowerTransform(power=power)
  373. ```
  374. </td>
  375. </tr>
  376. </table>
  377. ###### `nn.probability.bijector.GumbelCDF` remove a attribute in the interface: dtype ([!8191](https://gitee.com/mindspore/mindspore/pulls/8191))
  378. dtype is removed from GumbelCDF and is no longer an argument of the class.
  379. <table>
  380. <tr>
  381. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  382. </tr>
  383. <tr>
  384. <td>
  385. ```python
  386. >>> import mindspore.nn.probability.bijector as msb
  387. >>> from mindspore import dtype as mstype
  388. >>>
  389. >>> bijector = msb.GumbelCDF(loc=0.0, scale=1.0, dtype=mstype.float32)
  390. ```
  391. </td>
  392. <td>
  393. ```python
  394. >>> import mindspore.nn.probability.bijector as msb
  395. >>>
  396. >>> bijector = msb.GumbelCDF(loc=0.0, scale=1.0)
  397. ```
  398. </td>
  399. </tr>
  400. </table>
  401. ###### `nn.layer.combined.Conv2dBnAct`, `nn.layer.combined.DenseBnAct` move from nn.layer.quant to nn.layer.combined ([!8187](https://gitee.com/mindspore/mindspore/pulls/8187))
  402. Previously Conv2dBnAct and DenseBnAct are in nn.layer.quant, since they are not quant cells, now they are moved to nn.layer.combined. If you import Conv2dBnAct, DenseBnAct from mindspore.nn, then your code doesn't need any change.
  403. <table>
  404. <tr>
  405. <td style="text-align:center"> 1.0.1 </td> <td style="text-align:center"> 1.1.0 </td>
  406. </tr>
  407. <tr>
  408. <td>
  409. ```python
  410. >>> from mindspore.nn.layer.quant import Conv2dBnAct, DenseBnAct
  411. ```
  412. </td>
  413. <td>
  414. ```python
  415. >>> from mindspore.nn import Conv2dBnAct, DenseBnAct
  416. ```
  417. </td>
  418. </tr>
  419. </table>
  420. ###### `nn.layer.conv.Conv2D`, `nn.layer.quant.Conv2dBnFoldQuant`, `nn.layer.quant.Conv2dBnWithoutFoldQuant` change weight shape when group > 1 in Ascend platform ([!9723](https://gitee.com/mindspore/mindspore/pulls/9723))
  421. In Ascend platform, if group > 1, the weight shape of Conv2D change from [in_channels//group, out_channels, kernel_size, kernel_size] to [out_channels, in_channels//group, kernel_size, kernel_size]. Previously, checkpoints of the networks are used, which use Conv2D with group > 1, such as MobileNet, can not be directly used now, need to transpose the first and second axis of the weight.
  422. ### Bug fixes
  423. #### FrontEnd
  424. - [STABLE] Fix the problem of the cse optimization in the situation of control flow. (Ascend/GPU)
  425. #### Auto Parallel
  426. - [STABLE] Resolve the restriction: input and output layouts of Reshape are restricted in tensor redistribution. (Ascend/GPU)
  427. - [STABLE] Resolve the restriction: output strategy should be data parallel in model evaluation. (Ascend/GPU)
  428. #### Executor
  429. - [STABLE] Fix fusion operator compilation cache. (Ascend)
  430. - [STABLE] Fix compilation error of dynamic shape operator. (Ascend)
  431. - [STABLE] Fix bug of pynative cannot insert transdata of node output when node should be spilted in the backend opt.(Ascend)
  432. - [STABLE] Fix the bug of TensorMove and memcpy_async merge to one after backend cse pass (Ascend)
  433. #### DataSet
  434. - [STABLE] Fix cache server hang on RequestFreeTag. (Ascend/GPU/CPU)
  435. - [STABLE] Fix hung when use pyfunc multi-processing. (Ascend/GPU/CPU)
  436. - [STABLE] Fix add multiple parent nodes to tree node cause core dump. (Ascend/GPU/CPU)
  437. ## MindSpore Lite
  438. ### Major Features and Improvements
  439. #### Converter and runtime
  440. 1. Support dynamic shape in MindSpore Lite Converter.
  441. 2. Optimize sub-graph mechanism by dynamically splitting the entire graph into multiple subgraphs based on the operator supported, backend hardware and user configuration.
  442. 3. Support TensorList and TensorList operators such as TensorListFromTensor, TensorListGetItem and so on.
  443. 4. Support BatchMatMul fusion and LSTM fusion in MindSpore Lite Converter.
  444. 5. Support converting model and run inference on Windows operator system.
  445. 6. Support Model(.ms) visualization on Netron.
  446. 7. Support Tensorflow model in MindSpore Lite Converter
  447. 8. Add 86 converter parsers.
  448. 9. Convert aware training model without user’s awareness
  449. 10. Support scalar tensor in MindSpore Lite Converter and Runtime
  450. 11. Support NPU backend on HUAWEI Kirin SoC.[BETA]
  451. 12. Merge timeprofiler into benchmark
  452. #### CPU backend optimization
  453. 1. Add 50+ new operators, including new Op type(like Adder, Gru).
  454. 2. Enhanced performance on armv8.2 supported platform. For example, utilizing sdot instruction more efficiently.
  455. 3. Optimize all operators(fp32, fp16, int8) by implementing multi-thread, SIMD tech as much as possible. Model inference time can reduce at least 20% after these optimizations.
  456. 4. Extending to support operators for x86_64 platform based on SSE/AVX instruction set.
  457. #### OpenCL backend
  458. 1. Add new ops: add 10+ ops, total 58 ops;
  459. 2. Performance optimization: by memory layout optimize, Winograd Convolution select strategyoptimize, SIMT local size optimize, local cache optimize, GPU performance improvement up to 20+% vs MSLITE Version1.0
  460. 3. Add Online Graph optimzation: by fusion Convolution/Matmul/Fullconnection and add/mul/pad/reshape, improve performance up to 50+% for some networks;
  461. 4. Add auto tuning: by online tuning in the graph compilation phase, optimize performance up to 10%;
  462. 5. Add weight quant: support weight quant
  463. 6. Add opencl kernel binary cache: improve Initialization time .
  464. #### Post quantization
  465. MindSpore Lite supports both weight quantization and full quantization. Currently, Weights can be quantized into 1 ~ 16 bits according to user configuration. In internal testing, quantization of networks, such as classification, detection, segmentation and transformer are well supported. To ensure high accuracy of quantized models, MindSpore Lite uses a pipeline quantization method. In the first phase, the weight and activation value are quantized using linear quantization methods, such as MIN-MAX. In the second phase, the quantization error is analyzed, and uses statistical methods to compensate loss caused by fp32 quantization to a fixed point such as Int8 to quantized models. The features of Post-training quantization are:
  466. 1. perchannel asymmetric quantization for weights, such as MAX_MIN and KMEANS
  467. 2. Perlayer symmetric quantization for activation, such as KL and MAX_MIN.
  468. 3. perlayer asymmetrical quantization for activation, such as, RemoveOutlier.
  469. 4. accuracy loss compensation, such as BiasCorrection
  470. | mobilenet_v2 | ACC (ImageNet) |
  471. |---|---|
  472. | FP32 | 71.56% |
  473. |A8W8 | 71.16% |
  474. | A8W8(without BiasCorrection) | 70.74% |
  475. | A8W7 | 71.06% |
  476. | A7W7 | 70.78% |
  477. The above table uses the mobilenet_v2 model from TF official website. Using MindSpore Lite quantization, the precision of A8W8 (8-bit activation value quantization and 8-bit weight quantization) decreases from 0.82% to 0.4% after accuracy loss compensation, for 7-bit quantization, the precision loss is still no more than 1%.
  478. #### Training on Device
  479. Within MindSpore 1.1 release, the MindSpore Lite provides the following Training-on-Device (ToD) capabilities:
  480. 1. Learning from scratch and Transfer Learning strategies are supported
  481. 2. MindSpore based models can be converted and used in training on the device. (Third-party models such as TensorFlow and PyTorch for now cannot be directly imported to the framework)
  482. 3. Grad operations are supported for more than 30 operators such as Dense layers, Convolutions and Batch Normalizations. Momentum, SGD, and ADAM optimizers are supported.
  483. 4. Supports networks such as LeNet, Alexnet, Resnet, MobileNetV1/V2/V3, and EffectiveNet, and provides complete model loading, conversion, and Python training scripts on the device side.
  484. The MindSpore Lite ToD framework is already in use in the newest Huawei Smart TV, providing a unique and personalized user experience as a family entertainment center.
  485. ### API Change
  486. #### API Incompatible Change
  487. ##### C++ API
  488. - [Modify] Context now support multi-context configuration.(Context.h)
  489. - [Modify] Callback is move from lite_session.h into ms_tensor.h.
  490. - [Modify] GetInputsByName in lite_session.h is changed into GetInputsByTensorName
  491. - [Add] add static LiteSession *CreateSession(const char*model_buf, size_t size, const lite::Context *context) in lite_session.h
  492. - [Add] add GetErrorInfo interface returning error message in errorcode.h
  493. - [Delete] Remove model_generated.h, ops_generated.h and headers of FlatBuffers library from interfaces
  494. ##### Java API
  495. - [Add] Implement JNI layer and add Java api for CPU and GPU backend
  496. #### Deprecations
  497. ##### C++ API
  498. Deprecate Interface GetOutputsByNodeName
  499. ### Bug fixes
  500. - [BUGFIX] Fix the bug in sub-graph segmentation
  501. - [BUGFIX] Fix the bug in Tensor getitem in which the ellipsis matches the wrong dim-size.
  502. - [BUGFIX] Fix the bug that activation modification after defining Dense will not take effect.
  503. ### Contributors
  504. zhouyifengCode, huqi, JulyAi, damon0626, chenbo116, rmdyh, davidmc, gray0v0, doitH, Gogery, zymaa, xinyunfan
  505. # MindSpore 1.0.0 Release Notes
  506. ## Major Features and Improvements
  507. ### MindSpore Training and Inference Framework
  508. #### Ascend 910
  509. - New models
  510. - DenseNet121: a dense convolutional neural network, which connects each layer to every other layer in a feed-forward fashion for object recognition on ImageNet dataset.
  511. - UNet2D-Medical: Unet Medical model for 2D image segmentation, Convolutional Networks for Biomedical Image Segmentation on ISBI Challenge database.
  512. - Frontend and user interface
  513. - Second-Order Optimization
  514. - Enable second-order optimization for Bert on Ascend 910, which can achieve a masked lm accuracy of 71.3% in 800 seconds using 8 Ascend 910 (Bert-Large @MLPerf v0.7 dataset).
  515. - New GNN model BGCF
  516. - Bayesian Graph Convolutional Filtering network which naturally incorporate the uncertainty in the user-item interaction graph shows excellent recommendation performance on Amazon-Beauty dataset.
  517. - Add append interface for SequentialCell.
  518. - Add a level `auto` for AMP.
  519. - Executor and performance optimization
  520. - Support quantitative network (Resnet50 & YoloV3 & MobileNetV2).
  521. - Project ease of use optimization: project compilation time optimization, CMakelist regularization, cudnn, cuda independent compilation and installation independent.
  522. - Data processing, augmentation, and save format
  523. - Support GeneratorDataset return string type
  524. #### Other Hardware Support
  525. - GPU platform
  526. - Enable second-order optimization for resnet50 on GPU, which achieve 30% improvement on training time compared to SGD with Momentum (Resnet50 @ImageNet).
  527. #### User interfaces change log
  528. - Remove global object GradOperation in Autodiff([!5011](https://gitee.com/mindspore/mindspore/pulls/5011))
  529. - Remove useless attribute 'name' in Autodiff([!5172](https://gitee.com/mindspore/mindspore/pulls/5172))
  530. - Rectification distributed init([!5350](https://gitee.com/mindspore/mindspore/pulls/5350))
  531. - Move the setting of ParalleMode from train.parallel_utils to context([!5351](https://gitee.com/mindspore/mindspore/pulls/5351))
  532. - Modification of save_checkpoint([!5482](https://gitee.com/mindspore/mindspore/pulls/5482))
  533. - Wrap numpy random seed into an api([!5634](https://gitee.com/mindspore/mindspore/pulls/5634))
  534. - Delete enable_fused_layernorm in some modelzoo scripts([!5665](https://gitee.com/mindspore/mindspore/pulls/5665))
  535. - Move 'multi-subgraphs' interface to internal([!5696](https://gitee.com/mindspore/mindspore/pulls/5696))
  536. - Rename mirror_mean to gradient_mean([!5700](https://gitee.com/mindspore/mindspore/pulls/5700))
  537. - Remove default value of 'group' of DepthWiseConv2d([!5865](https://gitee.com/mindspore/mindspore/pulls/5865))
  538. - Modify interface for function and remove duplicated def([!5958](https://gitee.com/mindspore/mindspore/pulls/5958))
  539. - Unify Conv2d and DepthwiseConv2d([!5916](https://gitee.com/mindspore/mindspore/pulls/5916))
  540. - Modification of SoftmaxCrossEntropyWithLogits([!5502](https://gitee.com/mindspore/mindspore/pulls/5502))
  541. - Change API set_strategy() to shard()([!5991](https://gitee.com/mindspore/mindspore/pulls/5991))
  542. - Move batch_size from bert_cfg_cfg to cfg([!6233](https://gitee.com/mindspore/mindspore/pulls/6233))
  543. - Remove unused parameters from SummaryRecord __init__([!5548](https://gitee.com/mindspore/mindspore/pulls/5548))
  544. - remove sens parameter of TrainOneStepWithLossScaleCell([!5753](https://gitee.com/mindspore/mindspore/pulls/5753))
  545. - optimize the TrainOneStepCell for user's define([!6159](https://gitee.com/mindspore/mindspore/pulls/6159))
  546. - delete seed0 and seed1 of nn.Dropout([!5735](https://gitee.com/mindspore/mindspore/pulls/5735))
  547. - delete DataWrapper([!6101](https://gitee.com/mindspore/mindspore/pulls/6101))
  548. - LSTM API optimization([!6374](https://gitee.com/mindspore/mindspore/pulls/6374))
  549. - Merge P\C\F of ops([!5645](https://gitee.com/mindspore/mindspore/pulls/5645))
  550. - delete SoftmaxCrossEntropyExpand interface([!6607](https://gitee.com/mindspore/mindspore/pulls/6607))
  551. - Adjust GroupNorm interface([!6329](https://gitee.com/mindspore/mindspore/pulls/6329))
  552. - Modify init interface to internal interface([!6651](https://gitee.com/mindspore/mindspore/pulls/6651))
  553. - Log optimization([!5842](https://gitee.com/mindspore/mindspore/pulls/5842))
  554. - Remove useless API dataset.set_dataset_size([!5806](https://gitee.com/mindspore/mindspore/pulls/5806))
  555. - Some of Dataset API add usage parameter([!5605](https://gitee.com/mindspore/mindspore/pulls/5605))
  556. - Change the import path, such as from mindspore.dataset.transforms.vision to mindspore.dataset.vision.transforms([!5384](https://gitee.com/mindspore/mindspore/pulls/5384))
  557. - Rename ImageFolderDatasetV2 to ImageFolderDataset([!5384](https://gitee.com/mindspore/mindspore/pulls/5384))
  558. - Dataset.map parameter optimization([!5384](https://gitee.com/mindspore/mindspore/pulls/5384))
  559. - Add new api dataset.get_col_names([!5384](https://gitee.com/mindspore/mindspore/pulls/5384))
  560. - Add new api dataset.get_col_names([!5384](https://gitee.com/mindspore/mindspore/pulls/5384))
  561. - Remove useless API MindRecord finish([!5580](https://gitee.com/mindspore/mindspore/pulls/5580))
  562. ### MindSpore Lite
  563. - Converter
  564. - Add 6 TFLite op, 7 Caffe op, 1 ONNX op.
  565. - Add support for Windows.
  566. - Support parallel inference of multiple sessions to adapt to more scenarios
  567. - Support 8bits only weight-quantization, most main-stream models has small accuracy loss (less than 0.5%) when compared to non-qunantized fp32 model.
  568. - CPU & GPU
  569. - Add 20 CPU ops,include FP32, int8/uint8, FP16 and int32 ops.
  570. - Add supporting FP16 for GPU, add 14 GPU ops include FP32/FP16.
  571. - Add Buffer/Image2D transform op for GPU
  572. - Performance optimization for CPU ops focus on ARM32.
  573. - Performance optimization for GPU Convolution using winograd.
  574. - Tool & example
  575. - Add object detection Android Demo.
  576. ## Bugfixes
  577. - Models
  578. - fix the constant folding problem in multiply.([!6092](https://gitee.com/mindspore/mindspore/pulls/6092))
  579. - move batch_size from bert_net_cfg to cfg in bert scripts.([!6233](https://gitee.com/mindspore/mindspore/pulls/6233))
  580. - modify the checkpoint file path.([!6137](https://gitee.com/mindspore/mindspore/pulls/6137))
  581. - Python API
  582. - fix semi auto parallel parameter of reshape has another user([!5722](https://gitee.com/mindspore/mindspore/pulls/5722))
  583. - raise ValueError when call hook function in graph mode([!5831](https://gitee.com/mindspore/mindspore/pulls/5831))
  584. - Executor
  585. - fix pynative mode to build temporary nn objects.([!6189](https://gitee.com/mindspore/mindspore/pulls/6189))
  586. - fix the accuracy problem of multiple inputs of multi-card communication operator broadcast.([!6522](https://gitee.com/mindspore/mindspore/pulls/5622))
  587. - fix the problem that the sample distribution interface categorical does not support graph mode.([!5772](https://gitee.com/mindspore/mindspore/pulls/5772))
  588. - fix the random seed failure problem of the polynomial downsampling distribution operator.([!5948](https://gitee.com/mindspore/mindspore/pulls/5948))
  589. - fix unnecessary address binding issues in GPU heterogeneous scenarios.([!6232](https://gitee.com/mindspore/mindspore/pulls/6232))
  590. - GPU platform
  591. - fix for kernel resource leak([!5315](https://gitee.com/mindspore/mindspore/pulls/5315))
  592. - fix for insufficient memory for continuous unit test running([!5617](https://gitee.com/mindspore/mindspore/pulls/5617))
  593. - fix for the memory leak in the sparse slicer([!5578](https://gitee.com/mindspore/mindspore/pulls/5578))
  594. - Data processing
  595. - fix hang when use pyfunc([!6346](https://gitee.com/mindspore/mindspore/pulls/6346))
  596. - fix GPU device queue does not release GIL during resource clean up([!5964](https://gitee.com/mindspore/mindspore/pulls/5964))
  597. - fix hang if scripte exit unnormally([!6441](https://gitee.com/mindspore/mindspore/pulls/6441))
  598. - Third party
  599. - Sqlite : Update sqlite to 3.32.2 to handle [CVE-2020-11656](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11656), [CVE-2020-13871](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13871), [CVE-2020-11655](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655), [CVE-2020-9327](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-9327), [CVE-2020-13630](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13630), [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-15358), [CVE-2020-13631](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13631), [CVE-2020-13632](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13632), [CVE-2020-13434](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13434), [CVE-2020-13435](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13435), and [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655).
  600. - Libjpeg-turbo : Update libjpeg-turbo to 2.0.4 to handle [CVE-2020-13790](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13790).
  601. ## Contributors
  602. Thanks goes to these wonderful people:
  603. Adel, AGroupofProbiotocs, anthonyaje, anzhengqi, askmiao, baihuawei, baiyangfan, bai-yangfan, bingyaweng, BowenK, buxue, caifubi, CaoJian, caojian05, caozhou, Cathy, changzherui, chenfei, chengxianbin, chenhaozhe, chenjianping, chenzomi, chenzupeng, chujinjin, cj, cjh9368, Corleone, danish, Danish, dayschan, eric, Eric, fary86, fuzhiye, Gaoxiong, gengdongjie, gongdaguo, gukecai, guoqi, gzhcv, hangq, hanhuifeng2020, Harshvardhan, He, heleiwang, hexia, Hoai, HuangBingjian, huangdongrun, huanghui, huangxinjing, huzhifeng, hwjiaorui, Jesse, jianghui58, jiangzhiwen, Jiaqi, jin-xiulang, jinyaohui, jjfeing, John, Jonathan, jonyguo, jzg, kai00, kingfo, kingxian, kpy, kswang, laiyongqiang, leonwanghui, Li, liangchenghui, liangzelang, lichen_101010, lichenever, lihongkang, lilei, limingqi107, ling, linqingke, liubuyu, liuwenhao4, liuxiao78, liuxiao93, liuyang_655, liuzhongkai, Lixia, lixian, liyanliu, liyong, lizhenyu, luoyang, lvchangquan, lvliang, lz, mahdi, Mahdi, maning202007, Margaret_wangrui, mayang, mengyuanli, nhussain, ougongchang, panfengfeng, panyifeng, Payne, Peilin, peixu_ren, Pengyongrong, qianlong, r1chardf1d0, riemann_penn, root, Sheng, shenwei41, simson, Simson, Su, sunsuodong, tao_yunhao, tinazhang, VectorSL, , Wan, wandongdong, wangdongxu, wangmin, wangnan39@huawei.com, wangyue01, wangzhe, wanyiming, Wei, wenchunjiang, wilfChen, WilliamLian, wsc, wukesong, wuweikang, wuxuejian, Xiaoda, xiefangqi, xuanyue, xulei2020, Xun, xuyongfei, yanghaitao, yanghaitao1, yanghaoran, YangLuo, yangruoqi713, yankai, yanzhenxiang2020, yao_yf, yepei6, yeyunpeng, Yi, yoni, yoonlee666, yuchaojie, yujianfeng, yuximiao, zengzitao, Zhang, zhanghaibo5@huawei.com, zhanghuiyao, zhangyihui, zhangz0911gm, zhanke, zhanyuan, zhaodezan, zhaojichen, zhaoting, zhaozhenlong, zhengjun10, zhoufeng, zhousiyi, zhouyaqiang, Zichun, Zirui, Ziyan, zjun, ZPaC
  604. Contributions of any kind are welcome!
  605. # MindSpore 0.7.0-beta Release Notes
  606. ## Major Features and Improvements
  607. ### MindSpore Training and Inference Framework
  608. #### Ascend 910
  609. - New models
  610. - TinyBert: a smaller and faster version of BERT using transformer distillation for natural language understanding on GLUE benchmark.
  611. - SE-ResNet50: add Squeeze-and-Excitation blocks(SE-Blocks) to the resnet50 network to improve channel interdependencies for image classification on ImageNet 2012 dataset.
  612. - Inception V3: the third version of Inception convolutional architectures for image classification on ImageNet 2012 dataset.
  613. - Frontend and user interface
  614. - Embedding operator high-level packaging to support segmented by field for Wide&Deep.
  615. - Load multi-node checkpoint into single-process to support host-device hybrid inference.
  616. - Support Concat/Tile/Strideslice distributed operators.
  617. - Support cumulative gradient and batch training split.
  618. - Support variable parameter input for Cell object.
  619. - Parameter mixed calculation optimization for pynative mode.
  620. - Deep Probabilistic Programming
  621. - Support statistical distributions classes used to generate stochastic tensors.
  622. - Support probabilistic inference algorithms.
  623. - Support BNN layers used to construct BNN in Graph mode.
  624. - Support interfaces for the transformation between BNN and DNN in Graph mode.
  625. - Support uncertainty estimation to estimate epistemic uncertainty and aleatoric uncertainty.
  626. - User interfaces change log
  627. - change base class of parameter([!3473](https://gitee.com/mindspore/mindspore/pulls/3473))
  628. - change binary to mindir([!4258](https://gitee.com/mindspore/mindspore/pulls/4258))
  629. - change export from geir to air([!4269](https://gitee.com/mindspore/mindspore/pulls/4269))
  630. - Init parameter data by default([!3967](https://gitee.com/mindspore/mindspore/pulls/3967))
  631. - change IndexedSlices to RowTensor([!4031](https://gitee.com/mindspore/mindspore/pulls/4031))
  632. - Must set or change parallel mode before any Initializer created([!4801](https://gitee.com/mindspore/mindspore/pulls/4801))
  633. - Executor and performance optimization
  634. - MindSpore graph compilation process performance improved by 20%.
  635. - Decoupling C++ and Python modules to achieve separate compilation of core modules.
  636. - Data processing, augmentation, and save format
  637. - Support automatic data augmentation
  638. - Support GNN distributed cache in single node
  639. - Support ConcatDataset using distributed sampler
  640. #### Other Hardware Support
  641. - GPU platform
  642. - New model supported: VGG16, ResNet101, DeepFM.
  643. - Support some distributed operators in ResNet50 and Wide&Deep.
  644. - Support automatic parallel for Wide&Deep.
  645. - Support function funcs[i](*inputs) (such as switch-case).
  646. - Support distributed training with parameter server.
  647. - Support GPU operator profiling.
  648. - Performance optimization of the distributed training with allreduce.
  649. - Performance optimization of the mixed precision training.
  650. - Performance optimization of the pynative mode.
  651. - Performance optimization of the convolution operator, batch normalization operator.
  652. - CPU platform
  653. - Support MobileNetV2 Re-Training: Re-train the network with different class number.
  654. ### MindSpore Lite
  655. - Converter
  656. - Support third-party models, including TFLite/Caffe/ONNX.
  657. - Add 93 TFLite op.
  658. - Add 24 Caffe op.
  659. - Add 62 ONNX op.
  660. - Add 11 optimized passes, include fusion/const fold.
  661. - Support aware-training and Post-training quantization.
  662. - CPU
  663. - Add 100+ops,support fp32, int8/uint8, FP16 ops
  664. - Support fast convolution algorithms: Sliding Window, Img2col + Gemm, Strassen, Winograd
  665. - Support assembly/neon instruction.
  666. - Support CPU fp16 and sdot on ARM v8.2+.
  667. - GPU
  668. - Add 20+ ops for OpenCL.
  669. - Support image2D/buffer format.
  670. - Optimize online initialization time.
  671. - add optimized convolution1X1/3X3/depthwise/convolution_transposed for OpenCL.
  672. - Tool & example
  673. - Add benchmark and TimeProfile tools.
  674. - Add image classification Android Demo.
  675. ## Bugfixes
  676. - Models
  677. - normalize the readme file([!5410](https://gitee.com/mindspore/mindspore/pulls/5410))
  678. - fix a sink_size bug for transformer([!5393](https://gitee.com/mindspore/mindspore/pulls/5393))
  679. - fix bool type optional for resnet50([!5363](https://gitee.com/mindspore/mindspore/pulls/5363))
  680. - Python API
  681. - improve interface '__bool__' for tensor([!4000](https://gitee.com/mindspore/mindspore/pulls/4000))
  682. - fix GPU-ResizeNearestNeighbor([!3760](https://gitee.com/mindspore/mindspore/pulls/3760))
  683. - fix topK multi dimension grad func([!3711](https://gitee.com/mindspore/mindspore/pulls/3711))
  684. - fix scatterop error msg([!3699](https://gitee.com/mindspore/mindspore/pulls/3699))
  685. - fix bug of cast dtype when using mix_presion in pynative mode([!3730](https://gitee.com/mindspore/mindspore/pulls/3730))
  686. - Executor
  687. - fix etsnet train error when UnsegmentSum's first input shape is (1,) ([!4573](https://gitee.com/mindspore/mindspore/pulls/4573))
  688. - fix bug of result error in while control flow because of unsupporting for value reference ([!4103](https://gitee.com/mindspore/mindspore/pulls/4103))
  689. - fix bug of the output tensor does not carry device data type ([!3774](https://gitee.com/mindspore/mindspore/pulls/3774))
  690. - fix bug of avoiding multi attr value are eliminated in pynative mode ([!4225](https://gitee.com/mindspore/mindspore/pulls/4225))
  691. - fix bug of AssignAdd unable to work normally in multi-cases ([!5171](https://gitee.com/mindspore/mindspore/pulls/5171))
  692. - GPU platform
  693. - improve the environment variable checking for nvcc compiler path ([!5140](https://gitee.com/mindspore/mindspore/pulls/5140))
  694. - fix bug of error in cast operator conversion from fp16 to fp32 ([!4147](https://gitee.com/mindspore/mindspore/pulls/4147))
  695. - fix bug of the array out of bound in case of make_tuple operator ([!5219](https://gitee.com/mindspore/mindspore/pulls/5219))
  696. - Data processing and Pro
  697. - fix GeneratorDataset time out([!3624](https://gitee.com/mindspore/mindspore/pulls/3624))
  698. - fix concat operator get_dataset_size error([!4701](https://gitee.com/mindspore/mindspore/pulls/4701))
  699. - fixing python validator for Repeat Op([!4366](https://gitee.com/mindspore/mindspore/pulls/4366))
  700. - Third party
  701. - Sqlite : Update sqlite to 3.32.2 to handle [CVE-2020-11656](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11656), [CVE-2020-13871](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13871), [CVE-2020-11655](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655), [CVE-2020-9327](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-9327), [CVE-2020-13630](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13630), [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-15358), [CVE-2020-13631](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13631), [CVE-2020-13632](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13632), [CVE-2020-13434](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13434), [CVE-2020-13435](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13435), and [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655).
  702. - Libjpeg-turbo : Update libjpeg-turbo to 2.0.4 to handle [CVE-2020-13790](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13790).
  703. ## Contributors
  704. Thanks goes to these wonderful people:
  705. Adel, Alexey, andy, andy_wangrui, anthonyaje, anzhengqi, askmiao, avakh, baihuawei, bingyaweng, BowenK, buxue, caifubi, CaoJian, caozhou, Cathy, changzherui, chenfei, chengxianbin, chenhaozhe, chenjianping, chentingting, chenzomi, chenzupeng, chujinjin, cjh9368, Corleone, cristoval, danish, dengyutao, eric, Eric, ervinzhang, etone-chan, fangzehua, fary86, fuzhiye, gengdongjie, genglishuai, Giancarlo, gongdaguo, gukecai, guohongzilong, GuoMengHao, hangq, hanhaocheng, hanhuifeng2020, hanjun996, Harshvardhan, He, heleiwang, hesham, hexia, Hoai, hongxing, huangdongrun, huanghui, huangxinjing, islam_amin, Jesse, jianghui58, jiangzhiwen, jin-xiulang, jinyaohui, jjfeing, John, Jonathan, jonyguo, kai00, kingfo, kpy, kswang, laiyongqiang, leilei_snow, leopz, Li, liangzelang, lianliguang, lichen_101010, lichenever, lihongkang, lilei, limingqi107, ling, lingyunli63, linqingke, lirongzhen1, liubuyu, liuwenhao4, liuxiao78, liuxiao93, liuzhongkai, Lixia, lixian, liyong, lizhenyu, looop5, luoyang, lvchangquan, lvliang, lvwenyuan, lyvette, mahdi, Mahdi, mamba_ni, maning202007, Margaret_wangrui, mayang, meixiaowei, meng_chunyang, ms_yan, nhussain, panbingao, panfengfeng, panyifeng, Payne, Peilin, peixu_ren, pengyongrong, Pengyongrong, qianlong, qujianwei, root, shenwei41, shibeiji, simson, songhonglei413, Su, sunsuodong, suteng, tao_yunhao, TFbunny, tinazhang, tom__chen, tony_liu2, tronzhang, VectorSL, wandongdong, wangdongxu, wanghua, wangmin, wangshaocong, wangzhe, wanyiming, Wei, wenchunjiang, wilfChen, WilliamLian, wsc, wukesong, wuweikang, wuxuejian, wuyongkang, xiefangqi, xuanyue, Xun, xutianchun, xuyongfei, yanghaitao, yangjie159, YangLuo, yangruoqi713, yangyongjie, yangzhenzhang, yankai, yao_yf, yelihua, yeyunpeng, Yi, yoni, yoonlee666, yuchaojie, yujianfeng, yuximiao, zhangxuetong, zhaizhiqiang, Zhang, zhangxinfeng3, zhangxuetong, zhangyihui, zhangz0911gm, zhanke, zhanyuan, zhaodezan, zhaoting, zhaozhenlong, zhengjun10, zhongligeng, zhoufeng, zhousiyi, zhouyaqiang, zhouyuanshen, Zichun, Zirui, zjun, zongha, ZPaC, lijiaqi, liangchenghui, wangminggui
  706. Contributions of any kind are welcome!
  707. # MindSpore 0.6.0-beta Release Notes
  708. ## Major Features and Improvements
  709. ### Ascend 910 Training and Inference Framework
  710. - New models
  711. - There are official, research and community under modelzoo.
  712. - Official is maintained with the newest APIs by MindSpore team, MaskRCNN are added.
  713. - Research is uploaded by researchers for official review, and APIs may not be updated in time.
  714. - Community reprints the relevant links of partner research results.
  715. - Hub added on the same level as modelzoo, synchronous storage of materials needed for official hub web pages which will be launched soon.
  716. - Support pre-trained models, few lines of code can be used to download and load pre-trained models, supporting inference or transfer learning.
  717. - Frontend and user interface
  718. - Supports user side operator compilation and graph execution error rendering.
  719. - Uniform definition dynamic learning rate behavior in optimizers.
  720. - Support IndexSlice in sparse expression.
  721. - Support use parent construct method during construct.
  722. - Support asynchronous execution save checkpoint file.
  723. - Support implicit type conversion in pynative mode.
  724. - User interfaces change log
  725. - unform learning rate behavior in optimizers([!2755](https://gitee.com/mindspore/mindspore/pulls/2755))
  726. - rename operator of sparse optimizer([!3217](https://gitee.com/mindspore/mindspore/pulls/3217))
  727. - move profiler module from mindinsight to mindspore([!3075](https://gitee.com/mindspore/mindspore/pulls/3075))
  728. - VOCDataset output change to multi-columns([!3093](https://gitee.com/mindspore/mindspore/pulls/3093))
  729. - GetDatasize feature([!3212](https://gitee.com/mindspore/mindspore/pulls/3212))
  730. - dataset: modify config api([!2936](https://gitee.com/mindspore/mindspore/pulls/2936))
  731. - Executor and performance optimization
  732. - Decouple C++ and python, so make the architecture more extensible.
  733. - Parameter Server for distributed deep learning supported.
  734. - Serving:a flexible service deployment framework for deep learning models.
  735. - Memory reuse is enhanced, and the batch size of Bert large model is increased from 96 to 160 on a single server.
  736. - Data processing, augmentation, and save format
  737. - Support MindRecord save operator after date processing
  738. - Support automatic fusion operator, such as decode/resize/crop
  739. - Support CSV dataset loading
  740. ### Other Hardware Support
  741. - GPU platform
  742. - New model supported: ResNext50, WarpCTC and GoogLeNet.
  743. - Support hyperparametric search and data enhanced automl on GPU.
  744. - Support Resnet50 automatic parallel in GPU backend.
  745. ## Bugfixes
  746. - Models
  747. - Improved the performance and accuracy on ResNet50([!3456](https://gitee.com/mindspore/mindspore/pulls/3456))
  748. - Fixed the performance test case of bert([!3486](https://gitee.com/mindspore/mindspore/pulls/3486))
  749. - Python API
  750. - Fix assign used in while loop([!2720](https://gitee.com/mindspore/mindspore/pulls/2720))
  751. - Revert optimize the graph output of all nop node.([!2857](https://gitee.com/mindspore/mindspore/pulls/2857))
  752. - Print tensor as numpy.([!2859](https://gitee.com/mindspore/mindspore/pulls/2859))
  753. - Support weight decay for sparse optimizer([!2668](https://gitee.com/mindspore/mindspore/pulls/2668))
  754. - Fix BatchToSpaceND([!2741](https://gitee.com/mindspore/mindspore/pulls/2741))
  755. - Fixing type check mistakes of InplaceAdd and Inplace Sub ops([!2744](https://gitee.com/mindspore/mindspore/pulls/2744]))
  756. - Change order param only equal to group param([!2748](https://gitee.com/mindspore/mindspore/pulls/2748))
  757. - Executor
  758. - The performance of graph with control flow is optimized([!2931](https://gitee.com/mindspore/mindspore/pulls/2931))
  759. - Fix bug of wrong number of tuple layers([!3390](https://gitee.com/mindspore/mindspore/pulls/3390))
  760. - Fix cpu multi graph memory exception([!3631](https://gitee.com/mindspore/mindspore/pulls/3631))
  761. - Enable data sync when calling operator without defining a cell([!3081](https://gitee.com/mindspore/mindspore/pulls/3081))
  762. - Fix argmaxwith value error in pynative mode on GPU([!3082](https://gitee.com/mindspore/mindspore/pulls/3082))
  763. - Fix precision error with fp16 input on pynative mode([!3196](https://gitee.com/mindspore/mindspore/pulls/3196))
  764. - Data processing
  765. - Fix bug of RandomColor and RandomSharpness default parameter checking ([!2833](https://gitee.com/mindspore/mindspore/pulls/2833))
  766. - Fix process hung when training and eval ([!3469](https://gitee.com/mindspore/mindspore/pulls/3469))
  767. - Third party
  768. - Sqlite : Update sqlite to 3.32.2 to handle [CVE-2020-11656](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11656), [CVE-2020-13871](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13871), [CVE-2020-11655](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655), [CVE-2020-9327](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-9327), [CVE-2020-13630](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13630), [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-15358), [CVE-2020-13631](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13631), [CVE-2020-13632](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13632), [CVE-2020-13434](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13434), [CVE-2020-13435](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13435), and [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655).
  769. - Libjpeg-turbo : Update libjpeg-turbo to 2.0.4 to handle [CVE-2020-13790](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13790).
  770. ## Contributors
  771. Thanks goes to these wonderful people:
  772. Alexey Shevlyakov, avakh, baihuawei, BowenK, buxue, caifubi, caojian05, Cathy Wong, changzherui, chenfei, chengxianbin, chenhaozhe, chenjianping, chentingting, chenzomi, chujinjin, Danish Farid, dayschan, dengwentao, dinghao, etone-chan, fangzehua, fary86, geekun, Giancarlo Colmenares, gong chen, gukecai, guohongzilong, hangangqiang, heleiwang, hesham, He Wei, hexia, hongxing, huangdongrun, huanghui, islam_amin, Jamie Nisbet, Jesse Lee, jiangjinsheng, jiangzhiwen, jinyaohui, jjfeing, jojobugfree, Jonathan Yan, jonyguo, Junhan Hu, Kang, kingfo, kouzhenzhong, kpy, kswang, laiyongqiang, leopz, liangzelang, lichenever, lihongkang, Li Hongzhang, lilei, limingqi107, lirongzhen1, liubuyu, liuchongming74, liuwenhao4, liuxiao, Lixia Chen, liyanliu, liyong, lizhenyu, lvliang, Mahdi, Margaret_wangrui, meixiaowei, ms_yan, nhussain, ougongchang, panfengfeng, panyifeng, peilinwang, Peilin Wang, pkuliuliu, qianlong, rick_sanchez, shibeiji, Shida He, shijianning, simson, sunsuodong, suteng, Tinazhang, Tron Zhang, unknown, VectorSL, wandongdong, wangcong, wangdongxu, wangdongxu6, wanghua, wangnan39, Wei Luning, wenchunjiang, wenkai, wilfChen, WilliamLian, wukesong, Xian Weizhao, Xiaoda Zhang, xiefangqi, xulei2020, xunxue, xutianchun, Yang, yanghaitao, yanghaitao1, yanghaoran, yangjie, yangjie159, YangLuo, Yanjun Peng, yankai, yanzhenxiang2020, yao_yf, Yi Huaijie, yoonlee666, yuchaojie, yujianfeng, zhangzhongpeng, zhangdengcheng, Zhang Qinghua, zhangyinxia, zhangz0911gm, zhaojichen, zhaoting, zhaozhenlong, zhoufeng, zhouneng, zhousiyi, Zirui Wu, Ziyan, zjun, ZPaC, lihongzhang, wangdongxu
  773. Contributions of any kind are welcome!
  774. # MindSpore 0.5.2-beta Release Notes
  775. ## Major Features and Improvements
  776. ### Ascend 910 Training and Inference Framework
  777. - New models
  778. - DenseNet121: a convolution based neural network for the task of image classification on ImageNet 2012 dataset.
  779. ## Bugfixes
  780. - Models
  781. - VGG16,Alexnet,GoogleNet,optimize network for better performance. ([!5539](https://gitee.com/mindspore/mindspore/pulls/5539))
  782. - YOLOV3, fix yolov3_darknet53 dataset bug. ([!5658](https://gitee.com/mindspore/mindspore/pulls/5658))
  783. ## Contributors
  784. Thanks goes to these wonderful people:
  785. Alexey Shevlyakov, avakh, baihuawei, BowenK, buxue, caifubi, caojian05, Cathy Wong, changzherui, chenfei, chengxianbin, chenhaozhe, chenjianping, chentingting, chenzomi, chujinjin, Danish Farid, dayschan, dengwentao, dinghao, etone-chan, fangzehua, fary86, geekun, Giancarlo Colmenares, gong chen, gukecai, guohongzilong, hangangqiang, heleiwang, hesham, He Wei, hexia, hongxing, huangdongrun, huanghui, islam_amin, Jamie Nisbet, Jesse Lee, jiangjinsheng, jiangzhiwen, jinyaohui, jjfeing, jojobugfree, Jonathan Yan, jonyguo, Junhan Hu, Kang, kingfo, kouzhenzhong, kpy, kswang, laiyongqiang, leopz, liangzelang, lichenever, lihongkang, Li Hongzhang, lilei, limingqi107, lirongzhen1, liubuyu, liuchongming74, liuwenhao4, liuxiao, Lixia Chen, liyanliu, liyong, lizhenyu, lvliang, Mahdi, Margaret_wangrui, meixiaowei, ms_yan, nhussain, ougongchang, panfengfeng, panyifeng, peilinwang, Peilin Wang, pkuliuliu, qianlong, rick_sanchez, shibeiji, Shida He, shijianning, simson, sunsuodong, suteng, Tinazhang, Tron Zhang, unknown, VectorSL, wandongdong, wangcong, wangdongxu, wangdongxu6, wanghua, wangnan39, Wei Luning, wenchunjiang, wenkai, wilfChen, WilliamLian, wukesong, Xian Weizhao, Xiaoda Zhang, xiefangqi, xulei2020, xunxue, xutianchun, Yang, yanghaitao, yanghaitao1, yanghaoran, yangjie, yangjie159, YangLuo, Yanjun Peng, yankai, yanzhenxiang2020, yao_yf, Yi Huaijie, yoonlee666, yuchaojie, yujianfeng, zhangzhongpeng, zhangdengcheng, Zhang Qinghua, zhangyinxia, zhangz0911gm, zhaojichen, zhaoting, zhaozhenlong, zhoufeng, zhouneng, zhousiyi, Zirui Wu, Ziyan, zjun, ZPaC, lihongzhang, wangdongxu
  786. Contributions of any kind are welcome!
  787. # MindSpore 0.5.0-beta Release Notes
  788. ## Major Features and Improvements
  789. ### Ascend 910 Training and Inference Framework
  790. - New models
  791. - ResNext50: a simple, highly modularized network architecture using aggregated resdiual transformations for image classification on ImageNet 2012 dataset.
  792. - MASS: a pre-training method for sequence to sequence based language generation tasks on Text Summarization and Conversational Response Generation using News Crawls 2007-2017 dataset, Gigaword corpus and Cornell movie dialog corpus.
  793. - Transformer: a neural network architecture for language understanding on WMT 2014 English-German dataset.
  794. - GCN:Graph Convolutional Networks for the task of classification of nodes in a graph on Cora and Citeseer datasets.
  795. - GAT:an attention-based graph neural network for node classification on Cora and CiteSeer dataset.
  796. - Frontend and user interface
  797. - Support tensor value and assignment of mixed tensor index in graph mode.
  798. - Support tensor comparison, len operator, constexpr syntax, value and assignment of tensor index in pynative mode.
  799. - Support converting MindSpore IR to pb format for infer model.
  800. - Support print operator to write data directly on the hard disk.
  801. - Add the double recursive programming solution for very high speed parallel strategy search in automatic parallel.
  802. - User interfaces change log
  803. - Allow the learning rate of AdamWeightDecayDynamicLR and Lamb to be 0([!1826](https://gitee.com/mindspore/mindspore/pulls/1826))
  804. - Restricting the entire network input parameter is Tensor([!1967](https://gitee.com/mindspore/mindspore/pulls/1967))
  805. - Turn shape and dtype into attributes instead of interfaces([!1919](https://gitee.com/mindspore/mindspore/pulls/1919))
  806. - Delete multitypefungraph([!2116](https://gitee.com/mindspore/mindspore/pulls/2116))
  807. - Refactor the callback module in an encapsulated way, use _CallbackManager instead of_build_callbacks([!2236](https://gitee.com/mindspore/mindspore/pulls/2236))
  808. - Delete EmbeddingLookup([!2163](https://gitee.com/mindspore/mindspore/pulls/2163))
  809. - Checkpoint add model_type([!2517](https://gitee.com/mindspore/mindspore/pulls/2517))
  810. - Executor and performance optimization
  811. - Heterogeneous execution on CPU and Ascend devices supported, and is verified in Wide&Deep model.
  812. - Quantitative training of MobileNetV2, Lenet and Resnet50 on Ascend-910 are supported.
  813. - Support new fusion architecture, which can do fusion optimization across graphs and kernels to improve execution speed.
  814. - Data processing, augmentation, and save format
  815. - Support data processing pipeline performance profiling.
  816. - Support public dataset loading, such as CLUE and Coco.
  817. - Support more text processing, such as more tokenizers and vocab data.
  818. - Support MindRecord padded data.
  819. ### Other Hardware Support
  820. - GPU platform
  821. - New model supported: Bert / Wide&Deep.
  822. - Support setting max device memory.
  823. - CPU platform
  824. - New model supported: LSTM.
  825. ## Bugfixes
  826. - Models
  827. - Bert, Move Bert from `example` to `model_zoo`, optimize network for better performance. ([!1902](https://gitee.com/mindspore/mindspore/pulls/1902))
  828. - VGG16, Move VGG16 from `example` to `model_zoo`, optimize network for better accuracy. ([!2645](https://gitee.com/mindspore/mindspore/pulls/2645))
  829. - Alexnet, modify parameter setting to improve accuracy ([!1364](https://gitee.com/mindspore/mindspore/pulls/2370))
  830. - Wide&Deep, Move Wide&Deep from `example` to `model_zoo`, optimize network for better performance. ([!2221](https://gitee.com/mindspore/mindspore/pulls/2221))
  831. - Python API
  832. - Fix bug in auto cast([!1766](https://gitee.com/mindspore/mindspore/pulls/1766))
  833. - Fix bug of register_backward_hook([!2148](https://gitee.com/mindspore/mindspore/pulls/2148))
  834. - Fix bug of tuple args in pynative mode([!1878](https://gitee.com/mindspore/mindspore/pulls/1878))
  835. - Fix bug of checking numbers of arguments and graph parameters([!1701](https://gitee.com/mindspore/mindspore/pulls/1701))
  836. - Executor
  837. - Fix bug of loading input data repeatedly in pynative mode([!1966](https://gitee.com/mindspore/mindspore/pulls/1966))
  838. - Fix bug of list cannot be used as input in pynative mode([!1765](https://gitee.com/mindspore/mindspore/pulls/1765))
  839. - Fix bug of kernel select ([!2103](https://gitee.com/mindspore/mindspore/pulls/2103))
  840. - Fix bug of pattern matching for batchnorm fusion in the case of auto mix precision.([!1851](https://gitee.com/mindspore/mindspore/pulls/1851))
  841. - Fix bug of generate hccl's kernel info.([!2393](https://gitee.com/mindspore/mindspore/pulls/2393))
  842. - GPU platform
  843. - Fix bug of summary feature invalid([!2173](https://gitee.com/mindspore/mindspore/pulls/2173))
  844. - Data processing
  845. - Fix bug of Cifar dataset reading([!2096](https://gitee.com/mindspore/mindspore/pulls/2096))
  846. - Fix bug of C++ behavior in RandomCropAndResize([!2026](https://gitee.com/mindspore/mindspore/pulls/2026))
  847. - Fix the bug of mindrecord shuffle([!2420](https://gitee.com/mindspore/mindspore/pulls/2420))
  848. - Third party
  849. - Sqlite : Update sqlite to 3.32.2 to handle [CVE-2020-11656](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11656), [CVE-2020-13871](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13871), [CVE-2020-11655](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655), [CVE-2020-9327](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-9327), [CVE-2020-13630](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13630), [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-15358), [CVE-2020-13631](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13631), [CVE-2020-13632](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13632), [CVE-2020-13434](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13434), [CVE-2020-13435](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-13435), and [CVE-2020-15358](https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-11655).
  850. ## Contributors
  851. Thanks goes to these wonderful people:
  852. Alexey Shevlyakov, avakh, baihuawei, BowenK, buxue, caifubi, caojian05, Cathy Wong, changzherui, chenfei, chengxianbin, chenhaozhe, chenjianping, chentingting, chenzomi, chujinjin, Danish Farid, dayschan, dengwentao, dinghao, etone-chan, fangzehua, fary86, geekun, Giancarlo Colmenares, gong chen, gukecai, guohongzilong, hangangqiang, heleiwang, hesham, He Wei, hexia, hongxing, huangdongrun, huanghui, islam_amin, Jamie Nisbet, Jesse Lee, jiangjinsheng, jiangzhiwen, jinyaohui, jjfeing, jojobugfree, Jonathan Yan, jonyguo, Junhan Hu, Kang, kingfo, kouzhenzhong, kpy, kswang, laiyongqiang, leopz, liangzelang, lichenever, lihongkang, Li Hongzhang, lilei, limingqi107, lirongzhen1, liubuyu, liuchongming74, liuwenhao4, liuxiao, Lixia Chen, liyanliu, liyong, lizhenyu, lvliang, Mahdi, Margaret_wangrui, meixiaowei, ms_yan, nhussain, ougongchang, panfengfeng, panyifeng, peilinwang, Peilin Wang, pkuliuliu, qianlong, rick_sanchez, shibeiji, Shida He, shijianning, simson, sunsuodong, suteng, Tinazhang, Tron Zhang, unknown, VectorSL, wandongdong, wangcong, wangdongxu, wangdongxu6, wanghua, wangnan39, Wei Luning, wenchunjiang, wenkai, wilfChen, WilliamLian, wukesong, Xian Weizhao, Xiaoda Zhang, xiefangqi, xulei2020, xunxue, xutianchun, Yang, yanghaitao, yanghaitao1, yanghaoran, yangjie, yangjie159, YangLuo, Yanjun Peng, yankai, yanzhenxiang2020, yao_yf, Yi Huaijie, yoonlee666, yuchaojie, yujianfeng, zhangzhongpeng, zhangdengcheng, Zhang Qinghua, zhangyinxia, zhangz0911gm, zhaojichen, zhaoting, zhaozhenlong, zhoufeng, zhouneng, zhousiyi, Zirui Wu, Ziyan, zjun, ZPaC, lihongzhang, wangdongxu
  853. Contributions of any kind are welcome!
  854. # MindSpore 0.3.1-alpha Release Notes
  855. ## Major Features and Improvements
  856. ### Ascend 910 Training and Inference Framework
  857. - Frontend and User Interface
  858. - Independent model init interface.
  859. - Data processing, augmentation, and save format
  860. - Support sample padding for minddataset.
  861. ## Bugfixes
  862. - Python API
  863. - Fix bugs in the lars optimizer([!1894](https://gitee.com/mindspore/mindspore/pulls/1894))
  864. - Data processing
  865. - Fix accuracy problem of RandomCropDecodeResize ([!2340](https://gitee.com/mindspore/mindspore/pulls/2340))
  866. # Release 0.3.0-alpha
  867. ## Major Features and Improvements
  868. ### Ascend 910 Training and Inference Framework
  869. - New models
  870. - DeepFM: a factorization-machine based neural network for CTR prediction on Criteo dataset.
  871. - DeepLabV3: significantly improves over our previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2007 semantic image segmentation benchmark.
  872. - Faster-RCNN: towards real-time object detection with region proposal networks on COCO 2017 dataset.
  873. - SSD: a single stage object detection methods on COCO 2017 dataset.
  874. - GoogLeNet: a deep convolutional neural network architecture codenamed Inception V1 for classification and detection on CIFAR-10 dataset.
  875. - Wide&Deep: jointly trained wide linear models and deep neural networks for recommender systems on Criteo dataset.
  876. - Frontend and User Interface
  877. - Complete numpy advanced indexing method. Supports value and assignment through tensor index.
  878. - Some optimizers support separating parameter groups. Different parameter groups can set different `learning_rate` and `weight_decay`.
  879. - Support setting submodule's logging level independently, e.g. you can set logging level of module `A` to warning and set logging level of module `B` to info.
  880. - Support weights to be compiled according to shape to solve the problem of large memory overhead.
  881. - Add some operators implement and grammar support in pynative mode. To be consistent with graph mode.
  882. - User interfaces change log
  883. - Learning rate and weight decay making group params([!637](https://gitee.com/mindspore/mindspore/pulls/637))
  884. - Support weights to be compiled according to shape([!1015](https://gitee.com/mindspore/mindspore/pulls/1015))
  885. - delete some context param([!1100](https://gitee.com/mindspore/mindspore/pulls/1100))
  886. - ImageSummary/ScalarSummary/TensorSummary/HistogramSummary([!1329](https://gitee.com/mindspore/mindspore/pulls/1329))([!1425](https://gitee.com/mindspore/mindspore/pulls/1425))
  887. - Executor and Performance Optimization
  888. - Support doing evaluation while in training process, so that the accuracy of training can be easily obtained.
  889. - Enable second-order optimization for resnet50, which can achieve 75.9% accuracy in 45 epochs (Resnet50 @ImageNet).
  890. - Optimize pynative implementation and improve it's execution performance.
  891. - Optimize summary record implementation and improve its performance.
  892. - Data processing, augmentation, and save format
  893. - Support simple text processing, such as tokenizer/buildvocab/lookup.
  894. - Support padding batch.
  895. - Support split or concat dataset.
  896. - Support MindDataset reading from file list.
  897. ### Other Hardware Support
  898. - GPU platform
  899. - New models supported: MobileNetV2, MobileNetV3.
  900. - Support mixed precision training.
  901. - Support device memory swapping.
  902. ## Bugfixes
  903. - Python API
  904. - An exception to the broadcast input data type check([!712](https://gitee.com/mindspore/mindspore/pulls/712))
  905. - Fix issues assignsub return value 0([!1036](https://gitee.com/mindspore/mindspore/pulls/1036))
  906. - Fix issue Conv2dBackpropInput bprop should return 3 instead of 2 items([!1001](https://gitee.com/mindspore/mindspore/pulls/1001))
  907. - Fix sens shape error of TrainOneStepWithLossScaleCell([!1050](https://gitee.com/mindspore/mindspore/pulls/1050))
  908. - Fix BatchNormGrad operator([!1344](https://gitee.com/mindspore/mindspore/pulls/1344))
  909. - Executor
  910. - Fix dropout,topK and addn errors in PyNative mode ([!1285](https://gitee.com/mindspore/mindspore/pulls/1285), [!1138](https://gitee.com/mindspore/mindspore/pulls/1138), [!1033](https://gitee.com/mindspore/mindspore/pulls/1033)).
  911. - Fix memory leaks after execution in PyNatvie mode ([!1201](https://gitee.com/mindspore/mindspore/pulls/1201)).
  912. - Fix HCCL failure in some special scenes ([!1204](https://gitee.com/mindspore/mindspore/pulls/1204), [!1252](https://gitee.com/mindspore/mindspore/pulls/1252)).
  913. - Fix SSD network when Select failed, can't find kernel info([!1449](https://gitee.com/mindspore/mindspore/pulls/1449)).
  914. - Fix Topk operator selection strategy bug between aicore and aicpu([!1367](https://gitee.com/mindspore/mindspore/pulls/1367)).
  915. - Fix input memory size of 'assign' op unequal in control sink mode when assigning a data from one child graph to another child graph([!802](https://gitee.com/mindspore/mindspore/pulls/802)).
  916. - Fix allreduce ir inconsistency([!989](https://gitee.com/mindspore/mindspore/pulls/989)).
  917. - GPU platform
  918. - Fix summary for gradient collection ([!1364](https://gitee.com/mindspore/mindspore/pulls/1364))
  919. - Fix the slice operator ([!1489](https://gitee.com/mindspore/mindspore/pulls/1489))
  920. - Data processing
  921. - Fix memory problems of GeneratorDataset of sub-process ([!907](https://gitee.com/mindspore/mindspore/pulls/907))
  922. - Fix getting data timeout when training the cifar10 dataset under the lenet([!1391](https://gitee.com/mindspore/mindspore/pulls/1391))
  923. ## Contributors
  924. Thanks goes to these wonderful people:
  925. Alexey Shevlyakov, Amir Lashkari, anthony, baihuawei, biffex, buxue, caifubi, candanzg, caojian05, Cathy Wong, changzherui, chenfei, chengxianbin, chenhaozhe, chenzomi, chujinjin, cristoval, dengwentao, eric, etone-chan, fary86, gaojing, gengdongjie, gongchen, guohongzilong, guozhijian, heleiwang, hesham, He Wei, Hoai Linh Tran, hongxing, huangdongrun, huanghui, Jamie Nisbet, Jesse Lee, jiangjinsheng, jiangzhiwen, jinyaohui, jjfeing, jonwe, jonyguo, Junhan Hu, Kang, kingfo, kswang, laiyongqiang, leopz, lichenever, lihongkang, limingqi107, liubuyu, liuliyan2, liuwenhao4, liuxiao, liuxiao, liyong, lizhenyu, lvliang, Margaret_wangrui, meixiaowei, ms_yan, Nat Sutyanyong, ougongchang, panfengfeng, panyifeng, Peilin Wang, peixu_ren, qianlong, rick_sanchez, seatea, sheng, shijianning, simson, sunsuodong, Tinazhang, VectorSL, wandongdong, wangcong, wanghua, wangnan39, Wei Luning, wenchunjiang, wilfChen, WilliamLian, wsc, wukesong, wuxuejian, Xiaoda Zhang, xiefangqi, xulei2020, Yang, yangjie159, yangruoqi713, yangyongjie, yangzhenzhang, Yanjun Peng, yanzhenxiang2020, yao_yf, Yi Huaijie, yoonlee666, yujianfeng, YuJianfeng, yvetteliu, zhangdengcheng, Zhang Qinghua, zhangz0911gm, zhaojichen, zhaoting, zhaozhenlong, zhoufeng, zhouneng, zhousiyi, zhouyuanshen, Zirui Wu, Ziyan, zjun, ZPaC, lihongzhang
  926. Contributions of any kind are welcome!
  927. # MindSpore 0.2.0-alpha Release Notes
  928. ## Major Features and Improvements
  929. ### Ascend 910 Training and Inference Framework
  930. - New models
  931. - MobileNetV2: Inverted Residuals and Linear Bottlenecks.
  932. - ResNet101: Deep Residual Learning for Image Recognition.
  933. - Frontend and User Interface
  934. - Support for all python comparison operators.
  935. - Support for math operators **,//,%. Support for other python operators like and/or/not/is/is not/ in/ not in.
  936. - Support for the gradients of function with variable arguments.
  937. - Support for tensor indexing assignment for certain indexing type.
  938. - Support for dynamic learning rate.
  939. - User interfaces change log
  940. - DepthwiseConv2dNative, DepthwiseConv2dNativeBackpropFilter, DepthwiseConv2dNativeBackpropInput([!424](https://gitee.com/mindspore/mindspore/pulls/424))
  941. - ReLU6, ReLU6Grad([!224](https://gitee.com/mindspore/mindspore/pulls/224))
  942. - GeneratorDataset([!183](https://gitee.com/mindspore/mindspore/pulls/183))
  943. - VOCDataset([!477](https://gitee.com/mindspore/mindspore/pulls/477))
  944. - MindDataset, PKSampler([!514](https://gitee.com/mindspore/mindspore/pulls/514))
  945. - map([!506](https://gitee.com/mindspore/mindspore/pulls/506))
  946. - Conv([!226](https://gitee.com/mindspore/mindspore/pulls/226))
  947. - Adam([!253](https://gitee.com/mindspore/mindspore/pulls/253))
  948. - _set_fusion_strategy_by_idx,_set_fusion_strategy_by_size([!189](https://gitee.com/mindspore/mindspore/pulls/189))
  949. - CheckpointConfig([!122](https://gitee.com/mindspore/mindspore/pulls/122))
  950. - Constant([!54](https://gitee.com/mindspore/mindspore/pulls/54))
  951. - Executor and Performance Optimization
  952. - Support parallel execution of data prefetching and forward/backward computing.
  953. - Support parallel execution of gradient aggregation and forward/backward computing in distributed training scenarios.
  954. - Support operator fusion optimization.
  955. - Optimize compilation process and improve the performance.
  956. - Data processing, augmentation, and save format
  957. - Support multi-process of GeneratorDataset/PyFunc for high performance
  958. - Support variable batchsize
  959. - Support new Dataset operators, such as filter,skip,take,TextLineDataset
  960. ### Other Hardware Support
  961. - GPU platform
  962. - Use dynamic memory pool by default on GPU.
  963. - Support parallel execution of computation and communication.
  964. - Support continuous address allocation by memory pool.
  965. - CPU platform
  966. - Support for windows 10 OS.
  967. ## Bugfixes
  968. - Models
  969. - Fix mixed precision bug for VGG16 model ([!629](https://gitee.com/mindspore/mindspore/pulls/629)).
  970. - Python API
  971. - Fix ControlDepend operator bugs on CPU and GPU ([!396](https://gitee.com/mindspore/mindspore/pulls/396)).
  972. - Fix ArgMinWithValue operator bugs ([!338](https://gitee.com/mindspore/mindspore/pulls/338)).
  973. - Fix Dense operator bugs on PyNative mode ([!276](https://gitee.com/mindspore/mindspore/pulls/276)).
  974. - Fix MatMul operator bugs on PyNative mode ([!288](https://gitee.com/mindspore/mindspore/pulls/288)).
  975. - Executor
  976. - Fix operator selection bugs and make it general ([!300](https://gitee.com/mindspore/mindspore/pulls/300)).
  977. - Fix memory reuse bug for GetNext op ([!291](https://gitee.com/mindspore/mindspore/pulls/291)).
  978. - GPU platform
  979. - Fix memory allocation in multi-graph scenarios ([!444](https://gitee.com/mindspore/mindspore/pulls/444)).
  980. - Fix bias_add_grad under fp16 precision ([!598](https://gitee.com/mindspore/mindspore/pulls/598)).
  981. - Fix support for fp16 kernels on nvidia 1080Ti([!571](https://gitee.com/mindspore/mindspore/pulls/571)).
  982. - Fix parsing of tuple type parameters ([!316](https://gitee.com/mindspore/mindspore/pulls/316)).
  983. - Data processing
  984. - Fix TypeErrors about can't pickle mindspore._c_dataengine.DEPipeline objects([!434](https://gitee.com/mindspore/mindspore/pulls/434)).
  985. - Add TFRecord file verification([!406](https://gitee.com/mindspore/mindspore/pulls/406)).
  986. ## Contributors
  987. Thanks goes to these wonderful people:
  988. Alexey_Shevlyakov, Cathy, Chong, Hoai, Jonathan, Junhan, JunhanHu, Peilin, SanjayChan, StrawNoBerry, VectorSL, Wei, WeibiaoYu, Xiaoda, Yanjun, YuJianfeng, ZPaC, Zhang, ZhangQinghua, ZiruiWu, amongo, anthonyaje, anzhengqi, biffex, caifubi, candanzg, caojian05, casgj, cathwong, ch-l, chang, changzherui, chenfei, chengang, chenhaozhe, chenjianping, chentingting, chenzomi, chujinjin, dengwentao, dinghao, fanglei, fary86, flywind, gaojing, geekun, gengdongjie, ghzl, gong, gongchen, gukecai, guohongzilong, guozhijian, gziyan, h.farahat, hesham, huangdongrun, huanghui, jiangzhiwen, jinyaohui, jjfeing, jojobugfree, jonathan_yan, jonyguo, jzw, kingfo, kisnwang, laiyongqiang, leonwanghui, lianliguang, lichen, lichenever, limingqi107, liubuyu, liuxiao, liyong, liyong126, lizhenyu, lupengcheng, lvliang, maoweiyong, ms_yan, mxm, ougongchang, panfengfeng, panyifeng, pengyanjun, penn, qianlong, seatea, simson, suteng, thlinh, vlne-v1, wangchengke, wanghua, wangnan39, wangqiuliang, wenchunjiang, wenkai, wukesong, xiefangqi, xulei, yanghaitao, yanghaoran, yangjie159, yangzhenzhang, yankai10, yanzhenxiang2020, yao_yf, yoonlee666, zhangbuxue, zhangz0911gm, zhangzheng, zhaojichen, zhaoting, zhaozhenlong, zhongligeng, zhoufeng, zhousiyi, zjun, zyli2020, yuhuijun, limingqi107, lizhenyu, chenweifeng.
  989. Contributions of any kind are welcome!
  990. # MindSpore 0.1.0-alpha Release Notes
  991. ## Main Features
  992. ### Ascend 910 Training and Inference Framework
  993. - Recommended OS: Ubuntu 16.04 (or later) or EulerOS 2.5 or EulerOS 2.8
  994. - Python version: 3.7.5
  995. - Preset models
  996. - ResNet-50: residual structure-based convolutional neural network (CNN) for image classification, which is widely used.
  997. - AlexNet: classic CNN for image classification, achieving historical results in ImageNet LSVRC-2012.
  998. - LeNet: classic CNN for image classification, which was proposed by Yann LeCun.
  999. - VGG16: classic CNN for image classification, which was proposed by Oxford Visual Geometry Group.
  1000. - YoloV3: real-time object detection network.
  1001. - NEZHA: BERT-based Chinese pre-training network produced by Huawei Noah's Ark Laboratory.
  1002. - Execution modes
  1003. - Graph mode: provides graph optimization methods such as memory overcommitment, IR fusion, and buffer fusion to achieve optimal execution performance.
  1004. - PyNative mode: single-step execution mode, facilitating process debugging.
  1005. - Debugging capability and methods
  1006. - Save CheckPoints and Summary data during training.
  1007. - Support asynchronous printing.
  1008. - Dump the computing data.
  1009. - Support profiling analysis of the execution process performance.
  1010. - Distributed execution
  1011. - Support AllReduce, AllGather, and BroadCast collective communication.
  1012. - AllReduce data parallel: Each device obtains different training data, which accelerates the overall training process.
  1013. - Collective communication-based layerwise parallel: Models are divided and allocated to different devices to solve the problem of insufficient memory for large model processing and improve the training speed.
  1014. - Automatic parallel mode: The better data and model parallel mode can be predicted based on the cost model. It is recommended that this mode be used on ResNet series networks.
  1015. - Automatic differentiation
  1016. - Implement automatic differentiation based on Source to Source.
  1017. - Support distributed scenarios and automatic insertion of reverse communication operators.
  1018. - Data processing, augmentation, and save format
  1019. - Load common datasets such as ImageNet, MNIST, CIFAR-10, and CIFAR-100.
  1020. - Support common data loading pipeline operations, such as shuffle, repeat, batch, map, and sampler.
  1021. - Provide basic operator libraries to cover common CV scenarios.
  1022. - Support users to customize Python data augmentation operators through the Pyfunc mechanism.
  1023. - Support the access of user-defined datasets through the GeneratorDataset mechanism.
  1024. - Provide the MindSpore data format, data aggregation and storage, random access example, data partition, efficient parallel read, user-defined index, and dataset search.
  1025. - Convert user datasets to the MindSpore data format.
  1026. - After data processing and augmentation, provide training applications in feed and graph modes.
  1027. - FP32/16 mixed precision computation, supporting automatic and manual configuration
  1028. - Provide common operators such as nn, math, and array, which can be customized.
  1029. ### Inference Deployment
  1030. - Deploy models in MindSpore format on the Ascend 310 platform for inference.
  1031. - Save models in ONNX format.
  1032. - Support saving models in LITE format and running models based on the lightweight inference framework.
  1033. - Recommended OS: Android 4.3 or later
  1034. - Supported network type: LeNet
  1035. - Provide the generalization operators generated by TVM and operators generated after specific networks are tuned.
  1036. ### Other Hardware Support
  1037. - GPU platform training
  1038. - Recommended OS: Ubuntu 16.04
  1039. - CUDA version: 9.2 or 10.1
  1040. - CuDNN version: 7.6 or later
  1041. - Python version: 3.7.5
  1042. - NCCL version: 2.4.8-1
  1043. - OpenMPI version: 3.1.5
  1044. - Supported models: AlexNet, LeNet, and LSTM
  1045. - Supported datasets: MNIST and CIFAR-10
  1046. - Support data parallel.
  1047. - CPU platform training
  1048. - Recommended OS: Ubuntu 16.04
  1049. - Python version: 3.7.5
  1050. - Supported model: LeNet
  1051. - Supported dataset: MNIST
  1052. - Provide only the stand-alone operation version.
  1053. ## Peripherals and Tools
  1054. - [MindSpore Official Website](https://www.mindspore.cn/)
  1055. - [MindInsight Visualization Debugging and Optimization](https://gitee.com/mindspore/mindinsight)
  1056. - [MindArmour Model Security Hardening Package](https://gitee.com/mindspore/mindarmour)
  1057. - [GraphEngine Computational Graph Engine](https://gitee.com/mindspore/graphengine)