Yingda Chen
a82dbb8f97
Merge pull request #33 from modelscope/codegeex_code_translation
CodeGeex code translation and generation
ut failed due to a known run.py environment setup issue that is being fixed. nothing to do with the change itself.
3 years ago
shuaigezhu
028551cd62
add code_generation files
3 years ago
shuaigezhu
c9064caa58
add code_generation
3 years ago
pengzhendong
eb2ef3a1cf
[lint] fix lint
3 years ago
pengzhendong
2605824dea
[tests] add unittest
3 years ago
shuaigezhu
1ab8a1f764
updated
3 years ago
shuaigezhu
f171552ee3
updated
3 years ago
pengzhendong
2e30caf1e6
[pipelines] add wenetruntime
3 years ago
shuaigezhu
db0f25a594
init
3 years ago
leyuan.hjy
1a127ed997
fix outputs none
fix 输出为空的情况
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10725840
3 years ago
mulin.lyh
90a5efa1c2
[to #46106568 ]feat: parallel run ci case
3 years ago
shuying.shu
10926a06d4
fix bug for downloading hugging face pretrained model http error
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10750369
3 years ago
bin.xue
3798677395
[to #42322933 ] bug fix: deadlock when setting the thread number up to 90
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10743508
* fix: load model directly from .pth
3 years ago
james.wjg
541e460f8b
add support for cv/language_guided_video_summarization
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10636269
3 years ago
yingda.chen
4e4faa9a30
specifiy file encoding when open text for read
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10708723
3 years ago
hemu.zp
0f0fdcae6f
[to #42322933 ] Fix bug for mplug evaluation
修复了 mplug evaluation 使用了错误的 metrics 的问题,将部分中文处理代码独立到 utils 中,为 mplug 添加 trainer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10612875
3 years ago
bin.xue
e1dd996460
fix: failed to update sc_config_file concurrently
3 years ago
yichang.zyc
5f1b9a6218
add default config and fix proprocess detokenizer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10603232
3 years ago
mulin.lyh
21383fa25e
Merge branch library_api_tag_ci into master
Title: api tagging for pipeline/train/evaluate
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10588387
3 years ago
jiangyu.xzy
cc76d900bc
add model name to baseModel. use model name as tag
3 years ago
干劲
943478de63
Update
3 years ago
干劲
f5c31b3319
Add miss init
3 years ago
yichang.zyc
9187103e3a
[to #42322933 ]兼容新增clip huge模型
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10585552
* compatiable with vit huge, and set clip base default mm-ebed pipeline
3 years ago
xiaodongdeng.dxd
64868bf2ad
[to #42322933 ]多模态预训练模型OFA增加支持6b模型的feature
多模态预训练模型OFA增加支持6b模型的feature
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10574571
3 years ago
yuanzheng.yuanzhen
ce08cfbea8
[to #42322933 ]Add licenses
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10580553
* Add licenses
3 years ago
yichang.zyc
e2d35fbb14
[to #42322933 ]clip支持finetune
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10572842
3 years ago
yuze.zyz
4b7e8e89aa
[to #42322933 ] Fix some bugs when downgrade the version of some dependencies
1. Fix bug in model exporting
2. Skip some long trainings in test level 2
3. Refine some comments
4. Fix a bug that mode is not correct when saving checkpoints
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10564716
3 years ago
Yufeng
261c04b8b5
add Mglm ( #5 )
* mglm init
* add mglm requirements
Co-authored-by: Yufeng <zhuyufeng@gmail.com>
Co-authored-by: wenmeng.zwm <wenmeng.zwm@alibaba-inc.com>
3 years ago
xianzhe.xxz
88e8d4291a
[to #42322933 ]"fix: set the eps and momentum of BN consistent with training"
To keep consistent between training and evaluation, change the eps and momentum of BN.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10554451
3 years ago
menrui.mr
c7b0787049
修复初始化过程参数未生效问题
此前文生图模型没有加载configuration.json中的参数 影响默认配置
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10558026
3 years ago
hemu.zp
fa415d8720
[to #42322933 ] Fix bug for bloom and gpt_neo
1. 修复 bloom 和 gpt_neo 模型更新 transformers 4.23 后后处理报错的问题
2. 统一使用 ModelOutput 作为模型输出
3. gpt_neo checkpoint 已上线,修改 ut 为 level2
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10553103
3 years ago
zhangzhicheng.zzc
b713e3de1c
[to #42322933 ]fix token classification bugs
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10550136
3 years ago
yuze.zyz
212cf53318
[to #42322933 ] Fix some bugs
1. Add F1 score to sequence classification metric
2. Fix a bug that the evaluate method in trainer does not support a pure pytorch_model.bin
3. Fix a bug in evaluation of veco trainer
4. Add some tips if lr_scheduler in the trainer needs a higher version torch
5. Add some comments
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10532230
3 years ago
shuying.shu
ddcb57440d
[to #42322933 ]add fine-tune code for referring video object segmentation
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10539423
3 years ago
hemu.zp
69104c0f8a
[to #42322933 ] Refactor text generation model outputs and fix some bugs
1. 将 single_gpu_test 与 multi_gpu_test 中的 model.forward 部分分离为 EpochBasedTrainer 中的 evaluation_step,为部分 evaluation 阶段不调用 forward 的模型提供更好的灵活性
2. 重构代码将文本生成模型 Model 层的输入输出统一为 Tensor,Tensor 到 str 的 decode 过程移动到 pipeline 中完成
3. pipeline 后处理添加对中文和中文标点与英文混杂时空格的处理,使 decode 后中英文混杂输出正确
4. 添加 TextGenerationTrainer 修复了部分模型 evaluation 过程 forward 输出单个 token 计算 metrics 的问题
5. 修复了 rouge 无法接收空字符串的问题
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10473768
3 years ago
hemu.zp
d0f8547e7e
[to #42322933 ] Fix gpt3 loading checkpoint after finetuning.
1. 修复GPT-3模型无法加载finetune保存的checkpoint的问题
2. 为GPT-3诗词生成模型添加 ut
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10537209
3 years ago
ran.zhou
13f7e9ceca
[to #42322933 ]SEA multilingual NLP (NER & word segmentation)
添加东南亚小语种NLP支持,包括:
1. 针对泰语,越南语NER的预处理
2. 基于XLMR-CRF架构的分词模型和pipeline
3. 针对泰语分词的预处理
添加了相应pipeline的unittest
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10492404
3 years ago
zhangyanzhao.zyz
781fe49d63
[to #42322933 ]修正finetune text ranking bugs
之前的finetune代码当dataset最后长度不足制定batch size时会出错,现已修正
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10524066
3 years ago
yuanzheng.yuanzhen
bab54bbce8
[to #42322933 ]support uni fold
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10481410
3 years ago
siyang.ssy
ba3db0f552
[to #42322933 ] fix video embedding output
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10525516
3 years ago
tingwei.gtw
d40cc98994
[to #42322933 ] update IO for demo services
修改了I/O的代码,以支持modelscope的demo services
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10518318
3 years ago
yuze.zyz
c2da44b371
[to #42322933 ] remove dev model inference and fix some bugs
1. Change structbert dev revision to master revision
2. Fix bug: Sample code failed because the updating of model configuration
3. Fix bug: Continue training regression failed
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10519992
3 years ago
lllcho.lc
41b35619e8
[to #42322933 ] Fix bug for demo service
在demo service场景,同时调用同一个视频文件,会导致ffmpeg处理同名视频的冲突。通过uuid生成唯一的文件名解决这个冲突。
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10518178
3 years ago
hemu.zp
ffd834fc25
[to #42322933 ] Add bloom model
添加 bloom 模型
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10509187
3 years ago
yichang.zyc
6ddafb3218
[to #42322933 ]caption finetune done, add belu metric
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10318299
3 years ago
yuze.zyz
605cd7f44a
[to #42322933 ] NLP 1030 Refactor
Features:
1. Refactor the directory structure of nlp models. All model files are placed into either the model folder or the task_model folder
2. Refactor all the comments to google style
3. Add detail comments to important tasks and nlp models, to list the description of the model, and its preprocessor&trainer
4. Model Exporting now supports a direct all to TorchModelExporter(no need to derive from it)
5. Refactor model save_pretrained method to support direct running(independent from trainer)
6. Remove the judgement of Model in the pipeline base class, to support outer register models running in our pipelines
7. Nlp trainer now has a NLPTrainingArguments class , user can pass arguments into the dataclass, and use it as a normal cfg_modify_fn, to simplify the operation of modify cfg.
8. Merge the BACKBONES and the MODELS, so user can get a backbone with the Model.from_pretrained call
9. Model.from_pretrained now support a task argument, so user can use a backbone and load it with a specific task class.
10. Support Preprocessor.from_pretrained method
11. Add standard return classes to important nlp tasks, so some of the pipelines and the models are independent now, the return values of the models will always be tensors, and the pipelines will take care of the conversion to numpy and the following stuffs.
12. Split the file of the nlp preprocessors, to make the dir structure more clear.
Bugs Fixing:
1. Fix a bug that lr_scheduler can be called earlier than the optimizer's step
2. Fix a bug that the direct call of Pipelines (not from pipeline(xxx)) throws error
3. Fix a bug that the trainer will not call the correct TaskDataset class
4. Fix a bug that the internal loading of dataset will throws error in the trainer class
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10490585
3 years ago
siyang.ssy
6d51f44dc7
[to #42322933 ]fix input type for video embeding
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10506601
3 years ago
leyuan.hjy
8b28b725ee
[to #42322933 ] video detecor support output with timestamp
适配demoservice,增加视频时间戳输出,每一个结果对应一个时间戳
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10496879
3 years ago
caorongyu.cry
35644fa0a7
[to #42322933 ] change star3 to space_T_cn
1. 合并star和star3框架
2. 修改star和star3的model type
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10492793
3 years ago
zhangzhicheng.zzc
182ba1768f
[to #42322933 ]support multi tasks for part of speech
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10491994
3 years ago