i-robot
4f55a3f52f
!26275 [API] Tensor python float trans to ms float32
Merge pull request !26275 from kingxian/master
4 years ago
lianliguang
a6e5dcfb22
if parameter total larger than 1T split to another file
4 years ago
kingxian
545dc83fcb
fix: `cell` does not support deep copy, tensor default dtype -> float32
4 years ago
i-robot
1b8c2ff0e9
!26414 fault_recover_by_mirror_group_fix_opt_shard
Merge pull request !26414 from yao_yf/fault_recover_by_mirror_group_fix_opt_shard
4 years ago
i-robot
4f34c32c6d
!26235 AutoTune v1
Merge pull request !26235 from danishfarid/autotune
4 years ago
Rescue
da986710f0
Updated concat-zip check and UT.
4 years ago
hesham
40797c40da
AutoTune
4 years ago
i-robot
70363899e7
!24080 [assistant][ops] Add new audio operator ComputeDeltas
Merge pull request !24080 from Rainfor/compute_detlas
4 years ago
i-robot
c6821bde0a
!21648 [assistant][ops] Add new data loading operator YesNoDataset
Merge pull request !21648 from 杨旭华/YesNoDataset
4 years ago
vddong
f465cd0242
[feat][assistant][I3T96L]add new dataset operator YesNo
4 years ago
i-robot
f38df5c888
!26234 New dataset config option, auto_offload
Merge pull request !26234 from markuskunej/offload_config
4 years ago
yao_yf
188d39da83
slice_activation_in_recompute
slice recompute activation
4 years ago
i-robot
7559d5b798
!26494 [Auto parallel] Adjusting sharding propagation
Merge pull request !26494 from Xiaoda/102-adjusting-sharding-propagation
4 years ago
yao_yf
01dc4bbdf9
fix fault recover in optimizer shard
4 years ago
i-robot
3c39afad11
!24547 [MS][crowdfunding]New operator implementation, AutoAugment
Merge pull request !24547 from yangwm/autoaugment
4 years ago
markuskunej
29cf442521
Added dataset.config.set_auto_offload(), which will automatically attempt to offload dataset map operations if set to True.
4 years ago
i-robot
117df75afc
!26126 MD Profiling - Add Start Stop Python UT
Merge pull request !26126 from cathwong/ckw_ut_prof_startstop
4 years ago
zhuchuang
4c3931f5c9
[feat] [assistant] [I3CKEE] add new audio operator ComputeDeltas
4 years ago
Xiaoda Zhang
df67e74eaf
making sharding_propagation smooth, add a reshape justification:
1) when propagate sharding strategy from one op to another, try to find the strategy with zero communication cost;
2) if there is no such strategy, find the strategy with minimum communication cost, and raise a warning;
4 years ago
i-robot
9f52343a6a
!26350 add check for resizenearestneighbor parallel op
Merge pull request !26350 from yangzhenzhang/add-check-for-resize-op
4 years ago
despicablemme
09897ce7ec
[feat] [assistant] [I40GZP] add new data op AutoAugment
4 years ago
i-robot
707e7fbbf9
!26453 [Fallback] Add some fallback numpy test cases
Merge pull request !26453 from Margaret_wangrui/fallback_tests
4 years ago
Margaret_wangrui
c6582b9390
[Fallback] Add some fallback numpy test cases
4 years ago
Cathy Wong
b1094b9d81
MD Profiling - Add Start Stop Python UT
4 years ago
i-robot
4e88f2e5e8
!26345 add examples of function grad, jvp and vjp
Merge pull request !26345 from chenzhuo/jvp
4 years ago
i-robot
36517de69c
!24519 [assistant][ops] Add new audio operator SlidingWindowCmn.
Merge pull request !24519 from Rainfor/SlidingWindowCmn
4 years ago
i-robot
9f2e1edc00
!26279 neighborexchangev2 fix bug
Merge pull request !26279 from TuDouNi/neighborexchangev2_bug
4 years ago
wangkc123
706f9e2cbb
[feat] [assistant] [I3CKEH] add new audio operator SlidingWindowCmn
4 years ago
yangzhenzhang
ba99e4c505
add check for resize op
4 years ago
i-robot
58b69a05ee
!20576 [assistant][ops] add new dataset loading operator LJSpeechDataset
Merge pull request !20576 from 杨旭华/LJSpeechDataset
4 years ago
chenzhuo
d07f24462c
add examples of grad, jvp and vjp
4 years ago
ttudu
33ac1de062
fix bug
4 years ago
liyong
9663884db3
fix occasional failed in random sampler
4 years ago
i-robot
5af8572cd7
!25488 add the set_dump API
Merge pull request !25488 from wenkai/wk1027setdump1
4 years ago
i-robot
0113b3eee1
!26236 Add MD Profiler save()
Merge pull request !26236 from Mohammad Motallebi/add_profiler_save
4 years ago
mohammad
5c8ab5f60c
add MD Profiler Save()
4 years ago
wenkai
d2eca80d40
add the set_dump API, see I4AUIR for details
4 years ago
i-robot
9dfe08b805
!25909 [Fallback] Add global Py param when processing global namespace info
Merge pull request !25909 from huangbingjian/update_global_params
4 years ago
i-robot
9522ee9686
!25703 [assistant][ops] Add math operator LpNorm
Merge pull request !25703 from 孟权令/LpNorm
4 years ago
vddong
6fe2d35ae2
[feat][assistant][I3T96D] add new dataset loading operator LJSpeechDataset
4 years ago
i-robot
e154dca4b1
!22193 [assistant][ops] Add new loader operator DBpedia
Merge pull request !22193 from 杨旭华/DBpedia
4 years ago
i-robot
e773f0621e
!25918 add Function jvp and vjp
Merge pull request !25918 from chenzhuo/jvp
4 years ago
i-robot
5869e02fa1
!26135 Enable tuple input of optimizer as variable
Merge pull request !26135 from LiangZhibo/broaden3
4 years ago
Carry955
0d26c38693
[feat][assistant][I40GXT] add new loader DBpedia
4 years ago
i-robot
7a73bae5c3
!26036 add output strategy for matmul operator
Merge pull request !26036 from yangzhenzhang/add-output-strategy-for-op-init
4 years ago
i-robot
e03544685b
!26049 [Auto parallel] Support reshape in sharding propagation
Merge pull request !26049 from Xiaoda/99-ad-reshape-adjusting-in-sharding-propagation
4 years ago
chenzhuo
f36edec3ff
add function jvp
4 years ago
i-robot
1792d5f8a4
!26131 Offload Hardware Acceleration Computing
Merge pull request !26131 from alashkari/hw_accel_move_copy
4 years ago
i-robot
b910870ecc
!19986 [assistant][ops] Add new dataset loading operator AGNEWS
Merge pull request !19986 from 杨旭华/AGNEWS
4 years ago
Xiaoda Zhang
a772767265
support reshape in sharding propagation:
1) using 'swc index of strategy_cost_' as reshape's selected strategy;
2) when encountering reshape in BFS, select the 'swc index' with zero communication cost;
3) when encountering a reshape that is already visited, check whether there exists communication between reshape and current operator. It is OK if communication happens between two configured operators;
4) currently, two consecutive reshapes are not supported;
5) adjusting BFS structure in graph_costmodel.cc;
6) adjusting some code in step_auto_parallel.cc to avoid cyclomatic complexity.
4 years ago