fastNLP

Commit Graph

Author	SHA1	Message	Date
FengZiYjun	ec9fd32d60	improve trainer: log mean and std of model params, and sum of gradients	7 years ago
FengZiYjun	cd68d78d50	Merge remote-tracking branch 'origin/dataset' into dataset # Conflicts: # fastNLP/api/pipeline.py # fastNLP/api/pos_tagger.py # fastNLP/api/processor.py # fastNLP/modules/decoder/CRF.py	7 years ago
FengZiYjun	26e3abdf58	- 修改pos tag训练脚本，可以跑 - 在api中创建converter.py - Pipeline添加初始化方法，方便一次性添加processors - 删除pos_tagger.py - 优化整体code style	7 years ago
yunfan	a6ab34fd38	fix crf	7 years ago
yh_cc	3cb98ddcf2	Sampler中增加了一个BucketSampler, CWS的训练基本可以实现	7 years ago
yh_cc	69a138eb18	修改了遇到的若干问题，增加了分词任务的一些方法	7 years ago
yh_cc	dc0124cf02	修改model到models	7 years ago
yh_cc	25a53ac5c9	修改processor适配昨天的sao操作	7 years ago
yh	d818e91380	增加dataset自动创建对应的array	7 years ago
yh	515e4f4987	移动processor到processor.py	7 years ago
yh	89ce85b6ed	Merge branch 'dataset' of https://github.com/yhcc/fastNLP into dataset	7 years ago
yh	38aa207ea2	新增cws converter, io	7 years ago
FengZiYjun	12e9a93b52	Merge remote-tracking branch 'origin/dataset' into dataset	7 years ago
FengZiYjun	79105381f5	- add interfaces for pos_tagging API - update predictor.py to remove unused methods - update model_loader.py & model_saver.py to support entire model saving & loading - update pos tagging training script	7 years ago
yh	1b9daa1985	新增CWS的部分功能	7 years ago
yunfan	053249420f	update parser, fix bugs varrnn & vocab	7 years ago
yunfan	3192c9ac66	update trainer	7 years ago
yunfan	c14d9f4d66	update biaffine	7 years ago
yunfan	830d223344	add transformer	7 years ago
yunfan	102259df39	update biaffine parser	7 years ago
yunfan	a4c9786ca4	update dataset & loader	7 years ago
yunfan	637c37d62b	add new model, new module, fix bugs	7 years ago
FengZiYjun	fb806163c3	remove unused codes; add more tests	7 years ago
FengZiYjun	cc15588a77	- add progress bar for data set loading - improve metrics codes - fix validator bugs in trainer; remove early saving - run CWS codes - improve README.md	7 years ago
FengZiYjun	0b86d7cf2b	Merge Preprocessor and DataSet	7 years ago
FengZiYjun	cb11a1f2dc	- analyze codes for language model, unable to run yet - add character vocab in preprocessor - add dataset loader for language model dataset - other minor adjustments - preserve only a little example data for language model	7 years ago
FengZiYjun	28a0683853	1. add tests in test_fastNLP.py & test_sampler.py; increase test coverage to 81% 2. changes of names: aggregation ----> aggregator interaction ----> interactor action.py ----> sampler.py BasePreprocess ---> Preprocessor BaseTester ----> Tester BaseTrainer ----> Trainer 3. add more code comments 4. fix bugs in predictor's data_forward 5. in sampler.py, remove Bachifier, fix some codes. but not test 6. remove unused codes in other_modules.py & utils.py 7. update fastnlp.py with new config file names and code comments 8. add data examples in data_for_tests/	7 years ago
yunfan	819c8f05be	fix vocab	7 years ago
yunfan	8f60a4fa01	update MLP	7 years ago
2017alan	b3e8db74a6	add self_attention for yelp classification example.	7 years ago
FengZiYjun	57911f771a	- clean up unused codes - improve code comments - BaseLoader & its subclasses does not need a data name any more - update file tree - add setup.py	7 years ago
FengZiYjun	32a036e8e6	[fix] drop "data" in Tester.make_batch; correct spelling of "show_metrics" [add] PeopleDailyCorpusLoader, to parse PeopleDaily Corpus [update] add CWS + POS_tag interface at FastNLP, see example in test_fastNLP.py [update] modify README.md and readme_example.py to the latest version.	7 years ago
FengZiYjun	501ffb26c5	optimize CWS example - see test_fastNLP.py - update interpret_word_seg_results in fastnlp.py - delete useless data to increase git clone speed	7 years ago
FengZiYjun	ab55f25e20	Updates to Trainer/Tester/fastnlp 1. Tester has a parameter "print_every_step" to control printing. print_every_step == 0 means NO print. 2. Tester's evaluate return (list of) floats, rather than torch.cuda.tensor 3. Trainer also has a parameter "print_every_step". The same usage. 4. In training, validation steps are not shown. 5. Updates to code comments. 6. fastnlp.py is ready for CWS. test_fastNLP.py works.	7 years ago
FengZiYjun	9d6b0daa99	Prepare for CWS service: - specify the name of the config file and the name of corresponding section where model init params store. - fastnlp.py needs load_pickle to get dictionary size and the number of labels - other minor adjustments	7 years ago
Coet	aea53c1833	Merge pull request #43 from FengZiYjun/master New Trainer Initialization Interface	7 years ago
FengZiYjun	2df8eb740a	Updates to core, loader: - add Loss, Optimizer - change Trainer & Tester initialization interface: two styles of definition provided - handle Optimizer construction and loss function definition in a hard manner - add argparse in task-specific scripts. (seq_labeling.py & text_classify.py) - seq_labeling.py & text_classify.py work	7 years ago
Coet	ceac3f2e1f	Merge pull request #38 from FengZiYjun/new_updates New updates	7 years ago
FengZiYjun	4c8c2dfdb8	updates to core, loader, test: - move preprocess.py from loader/ to core/ - changes to interface of preprocess: 1. add run method, to run the main processing 2. add cross validation split 3. add return value 4. merge subclasses - Trainer supports cross validation - add data as arguments in Trainer.train & Tester.test - add readme.example.py, to run the example program shown in README.md - other corresponding changes	7 years ago
Coet	fc7dd7eced	Merge pull request #33 from FengZiYjun/master Updates to cores, loader, saver	7 years ago
choosewhatulike	fb20e87321	add chinese word segmentation model	7 years ago
FengZiYjun	4bbeaebe96	Updates to cores, action, loader: - rename Inference to Predictor - rename Trainer.prepare_input to Trainer.load_train_data, load data_train.pkl only - add __contains__ method to config Section class - more code comments - more elegant make_batch & data_iterator: Samplers return batch samples instead of batch indices	7 years ago
FengZiYjun	8e6db05339	changes to Trainer, Tester & Inference: - rename "POSTrainer", "POSTester" to "SeqLabelTrainer", "SeqLabelTester" - Trainer & Tester have NO relation with Action - Inference owns independent "make_batch" & "data_forward" - Conversion to Tensor & go into cuda are done in "make_batch" - "make_batch" support maximum/minimum length	7 years ago
FengZiYjun	c1d7c5d7da	changes to action, trainer and tester: - rename "POSTrainer" to "SeqLabelTrainer" - add text classification test data - update make_batch in Trainer and Tester	7 years ago
FengZiYjun	743a6d7547	fix bugs in preprocessor	7 years ago
FengZiYjun	ef8ec3b9e4	add cws train script and corresponding config file	7 years ago
FengZiYjun	242e576a30	changes to trainer, tester, preprocessor, etc. - [tester][trainer] add cuda support - [preprocess] fix label2index for padding label seq - update README.md - [test] add test_tester.py - rename "action" to "core"	7 years ago
FengZiYjun	eb66cbe6c4	restructure module: 4 classes; add modules; move prototype and rename	7 years ago
FengZiYjun	7514be6f30	- add validation loss into trainer.train - restructure: move reproduction outside - add evaluate in tester	7 years ago
FengZiYjun	32652407df	restructure files & add "modules" directory & add CRF.py	7 years ago

1 2

58 Commits (ec9fd32d6070330c8b8a6499113ee8d5abf91b21)