fastNLP

Commit Graph

Author	SHA1	Message	Date
ROGERDJQ	f3ee16a5f6	[new] add ConfusionMatrix, ConfusionMatrixMetric (#272 ) * add ConfusionMatrix, ConfusionMatrixMetric * add confusionmatrix to utils * add ConfusionMatrixmetric * add ConfusionMatrixMetric * init for test * begin test * test finish * doc finish	6 years ago
yh_cc	4518a8fccb	[bug fix] test bug fix	6 years ago
yunfan	1139986b36	[bugfix] fix logger's logging level	6 years ago
yh_cc	9293a6c1ab	修复一个小bug 新增CheckPointCallback用于恢复Trainer的训练; CheckPointCallback会将optimizer，model以及Trainer的状态恢复到保存的epoch	6 years ago
yh_cc	faec3245d4	update CMRC2018Metric测试	6 years ago
Yige Xu	be432c3b39	fix a bug in ClassifyFPRMetric	6 years ago
Yige Xu	0547572d58	add ClassifyFPRMetric	6 years ago
yh	f887da12a1	解决CrossEntropyLoss测试因为数值问题无法通过测试的问题	6 years ago
yh	9c0190fbd8	新增对CrossEntropyLoss中class_in_dim的测试	6 years ago
yh	70d572cb3e	Merge branch 'dev0.5.0' of https://github.com/fastnlp/fastNLP into dev0.5.0	6 years ago
yh	7e41b9d355	1.新增CMRC2018的metric，loss，pipe，loader等; 2. 新增中文Bert的使用教程	6 years ago
Yige Xu	da2416a1b8	fix test bugs in: 1. use prettytable to print instance; 2. CNXNLI loader and pipe.	6 years ago
yunfan	c73f327a3e	[bugfix] fix test cases	6 years ago
ChenXin	4f0ec4a081	add the test for EarlyStopCallback, BUG found!	6 years ago
ChenXin	d8fa75b058	add the test for ControlC callback	6 years ago
ChenXin	e28ecb8b33	hide the EchoCallback	6 years ago
yunfan	05eb499eb8	[bugfix] dist_trainer's save & load	6 years ago
yunfan	29e4de36e3	[add] tutorial for callback [add] test case for logger, batch [bugfix] batch.py	6 years ago
Yige Xu	fccb6d9b1b	add __init__.py in test dir to solve file conflicts.	6 years ago
yh	5ebce3176f	增加部分测试	6 years ago
Yige Xu	5768cbbfef	add test code in AdamW	6 years ago
Yige Xu	9b2317e3e8	delete util functions and test cases in fastNLP/core/losses.py	6 years ago
Yige Xu	38d5b36be1	Update test_callbacks.py	6 years ago
Yige Xu	a3b4d5e76e	remove tensorboard logs after testing TensorboardCallback	6 years ago
yh_cc	8c8e22cc9b	DataSet中增加print_field_meta方法，使得其可以获取field的input和target信息	6 years ago
yh	1994029ab8	1.当前支持的encoding_type都支持从tag_vocab中自动判断;避免触发无意识导致的metric bug; 2. 修复部分inplace操作无法求导的问题; 3.Vocabulary将一些属性通过property暴露	6 years ago
Yige Xu	091f24e393	fix some bugs in test code.	6 years ago
xuyige	8142bad87a	Merge branch 'dev0.5.0' of https://github.com/fastnlp/fastNLP into dev0.5.0	6 years ago
xuyige	5d8a8c98c6	1. delete io/data_loader dir; 2. delete model/enas*; 3. delete legacy dir; 4. delete DateSetLoader and relevant codes; 5. fix a test code error in core/test_dataset.py; 6. delete io.BaseLoader and relevant code.	6 years ago
yh_cc	55e736bf4c	SpanFMetric增加对encoding_type和tag_vocab的检查	6 years ago
yunfan	cacf40366c	[fix] distributed trainer	6 years ago
yunfan	329a18976f	[update] distributed trainer, add evaluation part	6 years ago
yunfan	606d63a5a4	[update] distributed trainer	6 years ago
yunfan	861f5387a4	[add] very first version of distributed trainer	6 years ago
xuyige	c2d687528e	fix bugs and add test codes for: 1. models.snli; 2. core.metrics.extractive_qa; 3. io.data_loader.mnli	6 years ago
yh	22a8702d22	1. Trainer支持使用DistributedDataParallel训练; 但是还没有经过广泛测试，谨慎使用; 2. 修复import os bug; 3.FitlogCallback支持不传入任何DataSet; 4. NullOptimizer的construct_from_optimer返回self; 5. 修复Bert中pooled_cls的bug;“	6 years ago
yh_cc	1babf5384f	Vocabulary中no_create_entry的bug修复	6 years ago
yh_cc	e57b8e4fd3	seq_len_to_mask修复测试失败的问题	6 years ago
yh_cc	8f7ed07441	1. 在vocabulary的from_dataset中增加no_create_entry_dataset选项，用于传递dev和test 2. 调整各种Embedding的实现，使得确保来自dev和test的未发现词使用unk的表示 3. 在Embedding中增加dropout_word的选项，使得可以随机drop掉词语 4. 以及其它若干小的bug	6 years ago
yh	8a766f070b	seq_len_to_mask修改为直接使用max_len而不再和句中最大长度对比	6 years ago
yh_cc	2f5d8967a3	1. 适配将Batch修改为pytorch的DataLoader的修改 2. 修改embedding.py中的bug 3. ConllReader默认跳过所有的DOCSTART标签 4. 交换bert的heavy lifting到_bert, 将BertEncoder在bert.py中暴露 5. crf中allow_transition的include_end_start修改为false，以与CRF的默认值适配 6. allow_transition与SpanMetric支持BIOES类型的tag 7. datainfo中增加打印格式化输出	6 years ago
yh_cc	17b5fd0066	1. 删除Trainer中对train_data必须为DataSet的assert 2. 删除Trainer的prefetch参数; 在注释中增加num_workers参数 3. Trainer中默认sampler为RandomSampler	6 years ago
yunfan	efe3574014	Merge remote-tracking branch 'origin/dev0.5.0' into batch	6 years ago
yunfan	7564818f4b	[unstable] change Batch to torch's DataLoader	6 years ago
xuyige	1a4c3c2d20	fix some bugs in test	6 years ago
yh	e90bbbb3f1	！！！重要更新，DataSet理论上支持任意类型的数据了，但是因为改动非常大，所以可能会有bug	6 years ago
yh	0257dc6dde	修复Trainer无法捕获Exception的bug	6 years ago
yh	e40447cc84	禁止travis测试非 test之外的文件	6 years ago
ChenXin	881ce01762	Dev0.4.0 (#149 ) * 1. CRF增加支持bmeso类型的tag 2. vocabulary中增加注释 * BucketSampler增加一条错误检测 * 1.修改ClipGradientCallback的bug；删除LRSchedulerCallback中的print，之后应该传入pbar进行打印;2.增加MLP注释 * update MLP module * 增加metric注释；修改trainer save过程中的bug * Update README.md fix tutorial link * Add ENAS (Efficient Neural Architecture Search) * add ignore_type in DataSet.add_field * * AutoPadder will not pad when dtype is None * add ignore_type in DataSet.apply * 修复fieldarray中padder潜在bug * 修复crf中typo; 以及可能导致数值不稳定的地方 * 修复CRF中可能存在的bug * change two default init arguments of Trainer into None * Changes to Callbacks: * 给callback添加给定几个只读属性 * 通过manager设置这些属性 * 代码优化，减轻@transfer的负担 * * 将enas相关代码放到automl目录下 * 修复fast_param_mapping的一个bug * Trainer添加自动创建save目录 * Vocabulary的打印，显示内容 * * 给vocabulary添加遍历方法 * 修复CRF为负数的bug * add SQuAD metric * add sigmoid activate function in MLP * - add star transformer model - add ConllLoader, for all kinds of conll-format files - add JsonLoader, for json-format files - add SSTLoader, for SST-2 & SST-5 - change Callback interface - fix batch multi-process when killed - add README to list models and their performance * - fix test * - fix callback & tests * - update README * 修改部分bug；调整callback * 准备发布0.4.0版本“ * update readme * support parallel loss * 防止多卡的情况导致无法正确计算loss“ * update advance_tutorial jupyter notebook * 1. 在embedding_loader中增加新的读取函数load_with_vocab(), load_without_vocab, 比之前的函数改变主要在(1)不再需要传入embed_dim(2)自动判断当前是word2vec还是glove. 2. vocabulary增加from_dataset(), index_dataset()函数。避免需要多行写index dataset的问题。 3. 在utils中新增一个cache_result()修饰器，用于cache函数的返回值。 4. callback中新增update_every属性 * 1.DataSet.apply()报错时提供错误的index 2.Vocabulary.from_dataset(), index_dataset()提供报错时的vocab顺序 3.embedloader在embed读取时遇到不规则的数据跳过这一行. * update attention * doc tools * fix some doc errors * 修改为中文注释，增加viterbi解码方法 * 样例版本 * - add pad sequence for lstm - add csv, conll, json filereader - update dataloader - remove useless dataloader - fix trainer loss print - fix tests * - fix test_tutorial * 注释增加 * 测试文档 * 本地暂存 * 本地暂存 * 修改文档的顺序 * - add document * 本地暂存 * update pooling * update bert * update documents in MLP * update documents in snli * combine self attention module to attention.py * update documents on losses.py * 对DataSet的文档进行更新 * update documents on metrics * 1. 删除了LSTM中print的内容; 2. 将Trainer和Tester的use_cuda修改为了device; 3.补充Trainer的文档 * 增加对Trainer的注释 * 完善了trainer，callback等的文档; 修改了部分代码的命名以使得代码从文档中隐藏 * update char level encoder * update documents on embedding.py * - update doc * 补充注释，并修改部分代码 * - update doc - add get_embeddings * 修改了文档配置项 * 修改embedding为init_embed初始化 * 1.增加对Trainer和Tester的多卡支持; * - add test - fix jsonloader * 删除了注释教程 * 给 dataset 增加了get_field_names * 修复bug * - add Const - fix bugs * 修改部分注释 * - add model runner for easier test models - add model tests * 修改了 docs 的配置和架构 * 修改了核心部分的一大部分文档，TODO： 1. 完善 trainer 和 tester 部分的文档 2. 研究注释样例与测试 * core部分的注释基本检查完成 * 修改了 io 部分的注释 * 全部改为相对路径引用 * 全部改为相对路径引用 * small change * 1. 从安装文件中删除api/automl的安装 2. metric中存在seq_len的bug 3. sampler中存在命名错误，已修改 * 修复 bug ：兼容 cpu 版本的 PyTorch TODO：其它地方可能也存在类似的 bug * 修改文档中的引用部分 * 把 tqdm.autonotebook 换成tqdm.auto * - fix batch & vocab * 上传了文档文件 .rst 上传了文档文件和若干 TODO * 讨论并整合了若干模块 * core部分的测试和一些小修改 * 删除了一些冗余文档 * update init files * update const files * update const files * 增加cnn的测试 * fix a little bug * - update attention - fix tests * 完善测试 * 完成快速入门教程 * 修改了sequence_modeling 命名为 sequence_labeling 的文档 * 重新 apidoc 解决改名的遗留问题 * 修改文档格式 * 统一不同位置的seq_len_to_mask, 现统一到core.utils.seq_len_to_mask * 增加了一行提示 * 在文档中展示 dataset_loader * 提示 Dataset.read_csv 会被 CSVLoader 替换 * 完成 Callback 和 Trainer 之间的文档 * index更新了部分 * 删除冗余的print * 删除用于分词的metric，因为有可能引起错误 * 修改文档中的中文名称 * 完成了详细介绍文档 * tutorial 的 ipynb 文件 * 修改了一些介绍文档 * 修改了 models 和 modules 的主页介绍 * 加上了 titlesonly 这个设置 * 修改了模块文档展示的标题 * 修改了 core 和 io 的开篇介绍 * 修改了 modules 和 models 开篇介绍 * 使用 .. todo:: 隐藏了可能被抽到文档中的 TODO 注释 * 修改了一些注释 * delete an old metric in test * 修改 tutorials 的测试文件 * 把暂不发布的功能移到 legacy 文件夹 * 删除了不能运行的测试 * 修改 callback 的测试文件 * 删除了过时的教程和测试文件 * cache_results 参数的修改 * 修改 io 的测试文件; 删除了一些过时的测试 * 修复bug * 修复无法通过test_utils.py的测试 * 修复与pytorch1.1中的padsequence的兼容问题; 修改Trainer的pbar * 1. 修复metric中的bug; 2.增加metric测试 * add model summary * 增加别名 * 删除encoder中的嵌套层 * 修改了 core 部分 import 的顺序，__all__ 暴露的内容 * 修改了 models 部分 import 的顺序，__all__ 暴露的内容 * 修改了文件名 * 修改了 modules 模块的__all__ 和 import * fix var runn * 增加vocab的clear方法 * 一些符合 PEP8 的微调 * 更新了cache_results的例子 * 1. 对callback中indices潜在None作出提示;2.DataSet支持通过List进行index * 修改了一个typo * 修改了 README.md * update documents on bert * update documents on encoder/bert * 增加一个fitlog callback，实现与fitlog实验记录 * typo * - update dataset_loader * 增加了到 fitlog 文档的链接。 * 增加了 DataSet Loader 的文档 * - add star-transformer reproduction	7 years ago
FengZiYjun	986541139a	整理所有dataset loader，建立单元测试	7 years ago

1 2 3

143 Commits (eb74379fd99f5aa59e0011ff7c2eafece1922c63)