FengZiYjun
|
b93ca9bb30
|
* FieldArray添加对list of np.array的支持
* 添加测试:FieldArray的初始化
|
7 years ago |
FengZiYjun
|
e4f997d52a
|
refactor type system in FieldArray:
* 重构dtype的检测代码,在FieldArray的初始化和append两处,达到更好的代码复用
* 类型检测的责任完全落在FieldArray,DataSet与之配合
测试:
* 整理dtype相关的测试代码
* 给所有tutorial添加测试
其他:
* 完善一个完整的Conll dataset loader
* 升级POS tag model训练脚本
|
7 years ago |
yh
|
73dd35dabf
|
Merge branch 'dev' of github.com:choosewhatulike/fastNLP-private into dev
|
7 years ago |
yh
|
3e33a23042
|
修改Padder的测试用例
|
7 years ago |
yh
|
8091a734ee
|
1. 将pad的功能从FieldArray中剥离出来,使用Padder完成各种padding操作。
2. FieldArray默认使用AutoPadder, AutoPadder的行为与之前不使用padder是一致的的
3. 为了解决二维padding的问题,引入了EngChar2dPadder用于对character进行padding
4. 增加一份padding的tutorial。
|
7 years ago |
FengZiYjun
|
d80d944e40
|
* 添加callbacks:EarlyStopCallback
* 将dataset.py中的assert改为raise error
* 给trainer添加try-except,捕捉EarlyStopError
* 优化trainer代码
* 给callbacks添加测试
|
7 years ago |
FengZiYjun
|
1fdaf236d2
|
Updates:
* 改名: chinese_word_segment ---> Chinese_word_segmentation
* 改名: pos_tag_model ---> POS_tagging
* 添加4个对Batch的测试
* 删除无用的chinese_word_segment/run.py
|
7 years ago |
yunfan
|
2e9e6c6c20
|
- fix trainer with validate_every > 0
- refine & fix Transformer Encoder
- refine & speed up biaffine parser
|
7 years ago |
yunfan
|
62a7556a04
|
Merge remote-tracking branch 'private/dev' into dev
# Conflicts:
# fastNLP/api/api.py
# fastNLP/modules/encoder/variational_rnn.py
|
7 years ago |
yunfan
|
ba28702e68
|
update Biaffine Parser, Variational RNN
add parser API
|
7 years ago |
FengZiYjun
|
c0b67a2bc9
|
fix tests
|
7 years ago |
FengZiYjun
|
179d12327a
|
add callback in Trainer
|
7 years ago |
FengZiYjun
|
1f4d784068
|
Merge branch 'dev' into local-dev
# Conflicts:
# fastNLP/core/dataset.py
|
7 years ago |
yunfan
|
c91696e1ee
|
update parser, optimize embed_loader
|
7 years ago |
yh
|
0c7e7519fc
|
conflict fix
|
7 years ago |
yh
|
897c43fc3b
|
1. CRF增加constrain, 用于限制跃迁,比如BMES中B不能跃迁到S
2. metric增加SpanFMetric,可以用于计算sequence labelling的performance
3. 分词复现任务根据新版接口做了部分调整。
|
7 years ago |
FengZiYjun
|
5d8f6960a7
|
* fix README figure
* refine code style
|
7 years ago |
hazelnutsgz
|
5f4ab131ac
|
Add a loader for conll2003 dataset
|
7 years ago |
FengZiYjun
|
337e3035b3
|
* update most processors to use dataset.apply
* fix failed tests
|
7 years ago |
FengZiYjun
|
23305af733
|
Fix failed tests.
|
7 years ago |
FengZiYjun
|
db0a789d61
|
* final clean up
* remove conflicts
* all tests passed
|
7 years ago |
FengZiYjun
|
720a264eb3
|
* rename DataSet.get_fields() into get_all_fields()
* add DataSet.get_field(), to fetch a FieldArray based on its name
* remove old tutorials & add new tutorials
|
7 years ago |
FengZiYjun
|
447746d9f5
|
* remove unused codes in losses.py & metrics.py
* refine code style
* fix tests
* add a new tutorial
|
7 years ago |
FengZiYjun
|
27e9453d19
|
* fix processor.py
* add code comments
* merge *_saver.py & *_loader.py in io/
* (ancient codes) rename Loss into LossFromTorch
|
7 years ago |
yh
|
306eee9690
|
Merge branch 'trainer' of github.com:FengZiYjun/fastNLP into trainer
|
7 years ago |
yh
|
cd83866527
|
bug fix in LossInForward
|
7 years ago |
FengZiYjun
|
6129a31c1d
|
* fix tests
* clean up unused codes
|
7 years ago |
yh
|
aea931812b
|
1. trainer中losser修改为loss
|
7 years ago |
yh
|
1158556236
|
1. 优化trainer checkcode过程的报错信息
|
7 years ago |
yh
|
e779409cf8
|
conflict fix
|
7 years ago |
yh
|
4dff3ec81f
|
修改losses中直接使用F.cross_entropy的情况,因为这些函数的signature是(input, target)
|
7 years ago |
FengZiYjun
|
f26f11608b
|
* 更新教程,放在在./tutorial
* remove unused codes in metrics.py
* add tests for DataSet
* add tests for FieldArray
* add tests for metrics.py
* fix predictor, add tests for predictor
* fix bucket sampler, add tests for bucket sampler
|
7 years ago |
yh
|
7c261faf19
|
Merge branch 'trainer' of github.com:FengZiYjun/fastNLP into trainer
|
7 years ago |
yh
|
87e5d44b01
|
。
|
7 years ago |
yunfan
|
52b1b18a76
|
fix bugs in vocab
|
7 years ago |
yh
|
79ae38738f
|
conflict solved
|
7 years ago |
yh
|
62c63f159a
|
test loss
|
7 years ago |
FengZiYjun
|
27833d06ae
|
FieldArray only check type when is_input or is_target is set.
|
7 years ago |
FengZiYjun
|
9acdb54fc8
|
优化loss在missing和duplicate时报错的信息:返回loss初始化约定接受的key
|
7 years ago |
yh
|
abe5ec7261
|
Merge branch 'trainer' of github.com:FengZiYjun/fastNLP into trainer
# Conflicts:
# test/core/test_trainer.py
|
7 years ago |
yh
|
a1a41c2d8b
|
1. unused报错运行错误修复
2. loss中修复一个错误
3. metric中fast_param调整
|
7 years ago |
FengZiYjun
|
4b099bb0dd
|
* add tqdm in requirements.txt
* fix FieldArray type check bugs
|
7 years ago |
FengZiYjun
|
661780b975
|
Improve FieldArray. Support nested list and a list of np.array
|
7 years ago |
yh
|
1fb1df4a31
|
1. metric修改fast_param
2. trainer中update_every改为print_every, 因为update_every可能引起optimizer update的误解
3. fieldarray content支持使用np.ndarray初始化
|
7 years ago |
yh
|
785c41ded5
|
Merge branch 'trainer' of github.com:FengZiYjun/fastNLP into trainer
# Conflicts:
# fastNLP/core/dataset.py
# fastNLP/core/trainer.py
# test/core/test_trainer.py
Trainer support print_train and tqdm train.
|
7 years ago |
yh
|
beb55f5288
|
* change trainer iterating into tqdm
|
7 years ago |
yunfan
|
ad3c5b6ef0
|
add magic iter in dataset
|
7 years ago |
FengZiYjun
|
513876d5db
|
Updates:
* fix losses的_fast_param_map的bug
* Trainer添加sampelr初始化参数,并调整参数顺序
* refine codes
|
7 years ago |
FengZiYjun
|
6f58ec34b4
|
Updates:
* DataSet修改__repr__,优化print(datset)的输出
* Instance修改__repr__,优化print的输出
* Optimizer优化传参提示
* Trainer去除kwargs参数
* losses.py加个参数
* 对应test code的修改
|
7 years ago |
FengZiYjun
|
f62060339e
|
All tests pass. Ready to merge.
* 更新Loss的接口形参跟metric保持一致
* 添加对几种loss的测试
* embed_loader采用维度独立的方法采样
* 对应测试代码的修改
|
7 years ago |