yh_cc
06891cf90a
补充注释,并修改部分代码
6 years ago
yh_cc
6e265e5ae9
完善了trainer,callback等的文档; 修改了部分代码的命名以使得代码从文档中隐藏
6 years ago
xuyige
55783314bc
update documents on metrics
6 years ago
yh_cc
7997dce8a7
对DataSet的文档进行更新
6 years ago
yh_cc
66ad7d9bf9
Merge branch 'dev' of github.com:choosewhatulike/fastNLP-private into dev
6 years ago
yh_cc
28ece53df0
本地暂存
6 years ago
yunfan
001586fa3e
- add document
6 years ago
yh
2c202bb151
测试文档
6 years ago
yunfan
abff8d9daa
- fix test_tutorial
6 years ago
yunfan
e025350ea8
Merge branch 'dev' of https://github.com/choosewhatulike/fastNLP-private into pr
6 years ago
yunfan
c344f7a2f9
- add pad sequence for lstm
- add csv, conll, json filereader
- update dataloader
- remove useless dataloader
- fix trainer loss print
- fix tests
6 years ago
yh_cc
c1ee0b27df
1.DataSet.apply()报错时提供错误的index
2.Vocabulary.from_dataset(), index_dataset()提供报错时的vocab顺序
3.embedloader在embed读取时遇到不规则的数据跳过这一行.
6 years ago
yh_cc
b69f8985c8
1. 在embedding_loader中增加新的读取函数load_with_vocab(), load_without_vocab, 比之前的函数改变主要在(1)不再需要传入embed_dim(2)自动判断当前是word2vec还是glove.
2. vocabulary增加from_dataset(), index_dataset()函数。避免需要多行写index dataset的问题。
3. 在utils中新增一个cache_result()修饰器,用于cache函数的返回值。
4. callback中新增update_every属性
6 years ago
yh_cc
29f81e79ad
准备发布0.4.0版本“
6 years ago
yh
dffd9b96cd
合并冲突
6 years ago
yh
4d1721ffe3
修改部分bug;调整callback
6 years ago
yunfan
f4e64906d4
- fix callback & tests
6 years ago
yunfan
e12041513f
Merge remote-tracking branch 'private/dev' into pr
# Conflicts:
# fastNLP/core/callback.py
# fastNLP/core/trainer.py
6 years ago
yunfan
58f373d371
- fix test
6 years ago
yunfan
70fb4a2284
- add star transformer model
- add ConllLoader, for all kinds of conll-format files
- add JsonLoader, for json-format files
- add SSTLoader, for SST-2 & SST-5
- change Callback interface
- fix batch multi-process when killed
- add README to list models and their performance
6 years ago
yh
ec90a1f0bb
Merge branch 'dev' of github.com:choosewhatulike/fastNLP-private into dev
6 years ago
yh
e5f68bbd5b
修复CRF为负数的bug
6 years ago
FengZiYjun
6a498bbdf2
* 给vocabulary添加遍历方法
6 years ago
FengZiYjun
f5ab7a5d45
* 将enas相关代码放到automl目录下
* 修复fast_param_mapping的一个bug
* Trainer添加自动创建save目录
* Vocabulary的打印,显示内容
6 years ago
FengZiYjun
ef0c6e936d
Changes to Callbacks:
* 给callback添加给定几个只读属性
* 通过manager设置这些属性
* 代码优化,减轻@transfer的负担
6 years ago
yh
f2d7d01bb7
修复CRF中可能存在的bug
6 years ago
yunfan
7c7f28f2ac
- add star-transformer
6 years ago
FengZiYjun
95a72f06b9
* AutoPadder will not pad when dtype is None
* add ignore_type in DataSet.apply
6 years ago
FengZiYjun
3e01b14249
add ignore_type in DataSet.add_field
6 years ago
chenkaiyu1997
251088ac4b
Add ENAS (Efficient Neural Architecture Search)
6 years ago
chenkaiyu1997
efeac2c427
Add ENAS (Efficient Neural Architecture Search)
6 years ago
FengZiYjun
0c5630bd16
Ready for V0.3.1
* 升级parser API和模型
* update docs: add new pages for tutorials
* upgrade CWS api download source
* add a new method for dataset field access
* add introduction for bert
* add more unit tests for api/processor
* remove unused test data. Add new test data.
6 years ago
FengZiYjun
986541139a
整理所有dataset loader,建立单元测试
6 years ago
FengZiYjun
bfaf09df8c
add BERT model
* load pre-trained BERT weights from local binary
* add tests
6 years ago
FengZiYjun
887fc9281f
update callbacks:
* rename callback methods. Use fastai's notation.
* add a new callback method - on_valid_begin
6 years ago
FengZiYjun
f3cb812554
将tesorboardX处理为callback, 从trainer移除tensorboardX相关代码
6 years ago
FengZiYjun
62ea4f7fed
添加LR finder,用第一个epoch找最佳lr,从第二个epoch开始训练
6 years ago
FengZiYjun
ab953b43ab
* 重构POS API,改成接受word作为输入
* 添加两类Callback
* 完善Trainer对error的捕捉
6 years ago
yh_cc
d9ac334409
减少batch中不断创建多进程的开销
6 years ago
yh_cc
2e3ef52a7d
将batch增强为多进程batch
6 years ago
FengZiYjun
864c2238f8
添加FieldArray对list of np.array的支持
6 years ago
FengZiYjun
b93ca9bb30
* FieldArray添加对list of np.array的支持
* 添加测试:FieldArray的初始化
6 years ago
FengZiYjun
e4f997d52a
refactor type system in FieldArray:
* 重构dtype的检测代码,在FieldArray的初始化和append两处,达到更好的代码复用
* 类型检测的责任完全落在FieldArray,DataSet与之配合
测试:
* 整理dtype相关的测试代码
* 给所有tutorial添加测试
其他:
* 完善一个完整的Conll dataset loader
* 升级POS tag model训练脚本
6 years ago
yh
73dd35dabf
Merge branch 'dev' of github.com:choosewhatulike/fastNLP-private into dev
7 years ago
yh
3e33a23042
修改Padder的测试用例
7 years ago
yh
8091a734ee
1. 将pad的功能从FieldArray中剥离出来,使用Padder完成各种padding操作。
2. FieldArray默认使用AutoPadder, AutoPadder的行为与之前不使用padder是一致的的
3. 为了解决二维padding的问题,引入了EngChar2dPadder用于对character进行padding
4. 增加一份padding的tutorial。
7 years ago
FengZiYjun
d80d944e40
* 添加callbacks:EarlyStopCallback
* 将dataset.py中的assert改为raise error
* 给trainer添加try-except,捕捉EarlyStopError
* 优化trainer代码
* 给callbacks添加测试
7 years ago
FengZiYjun
1fdaf236d2
Updates:
* 改名: chinese_word_segment ---> Chinese_word_segmentation
* 改名: pos_tag_model ---> POS_tagging
* 添加4个对Batch的测试
* 删除无用的chinese_word_segment/run.py
7 years ago
yunfan
2e9e6c6c20
- fix trainer with validate_every > 0
- refine & fix Transformer Encoder
- refine & speed up biaffine parser
7 years ago
yunfan
62a7556a04
Merge remote-tracking branch 'private/dev' into dev
# Conflicts:
# fastNLP/api/api.py
# fastNLP/modules/encoder/variational_rnn.py
7 years ago