fastNLP

Commit Graph

Author	SHA1	Message	Date
FengZiYjun	5be4cb7bb5	Merge Preprocessor into DataSet. - DataSet's __init__ takes a function as argument, rather than class object - Preprocessor is about to remove. Don't use anymore. - Remove cross_validate in trainer, because it is rarely used and wired - Loader.load is expected to be a static method - Delete sth. in other_modules.py - Add more tests - Delete extra sample data	7 years ago
xuyige	91f3d97ace	Update to new version of framework	7 years ago
FengZiYjun	0b86d7cf2b	Merge Preprocessor and DataSet	7 years ago
FengZiYjun	28a0683853	1. add tests in test_fastNLP.py & test_sampler.py; increase test coverage to 81% 2. changes of names: aggregation ----> aggregator interaction ----> interactor action.py ----> sampler.py BasePreprocess ---> Preprocessor BaseTester ----> Tester BaseTrainer ----> Trainer 3. add more code comments 4. fix bugs in predictor's data_forward 5. in sampler.py, remove Bachifier, fix some codes. but not test 6. remove unused codes in other_modules.py & utils.py 7. update fastnlp.py with new config file names and code comments 8. add data examples in data_for_tests/	7 years ago
yunfan	819c8f05be	fix vocab	7 years ago
yunfan	9c7f3cf261	add vocabulary into preprocessor	7 years ago
yunfan	3f4544759d	add unittest of data, fix bug	7 years ago
Coet	ef3c753e0d	Update test_seq_label.py	7 years ago
FengZiYjun	ad044ef4c7	fix test path to pass py.test	7 years ago
FengZiYjun	f2fc98b5e6	add Field support in Predictor: - apply DataSet in Predictor; remove sub-predictors; add "task" argument to specify which task to predict, as how Trainer/Tester did. - remove Action class - add helper function for DataSet, to create DataSet easily - more code comments - clean up unnecessary codes - add unit tests for Batch, Predictor, Preprocessor, Trainer, Tester	7 years ago
FengZiYjun	05af2e7544	Introduce Fields concept to eliminate the use of different sub-trainers/sub-testers. - update LabelField's to_tensor method to support int & str single label - update preprocessor's convert_to_dataset method to support single label inputs - introduce "task" in Trainer/Tester's data_forward, Tester's evaluate and metrics methods - in cnn_text_classification.py, change the name of the argument of forward - in sequence_modeling.py, change the name of the argument of forward - minor adjustments in test codes - text_classify.py works	7 years ago
FengZiYjun	758f0c0bd6	Introduce Field concept to optimize data representation. - add DataSet, Instance, Field to represent data in different levels - encapsulate batching method in Batch class - modify samplers in action.py to fit Batch - preprocessor.run returns DataSet, instead of list - Use Batch in Trainer/Tester - add required_arg "task" in Trainer/Tester - remove SeqLabelTrainer/SeqLabelTester dependencies successfully. They empty classes to deprecate. - modify SeqLabeling model, add another argument in forward, in order to compute mask inside model - test\model\seq_labeling.py works	7 years ago
yunfan	4dfe7aaacc	format test folder	7 years ago
xuyige	aac7982e93	fix a bug in config saver testing code	7 years ago
xuyige	6ddf5fcdcd	update test code for testing config saver	7 years ago
xuyige	7138ff210f	update config file for testing code, add more sections for testing.	7 years ago
Yige XU	be2f4aade3	Merge branch 'master' into test_code	7 years ago
xuyige	12f06d09d2	clean up code in test loader	7 years ago
xuyige	58ccb6576f	clean up codes	7 years ago
xuyige	476988573b	add test code for testing config saver	7 years ago
xuyige	2dd2f0c8f4	update config file for test	7 years ago
FengZiYjun	5309c98846	Text classification interface is ready. - fix issue #58, use path.join instead of + - modify description and version in setup.py - docstring in core/ follows RestructureText format	7 years ago
FengZiYjun	6f59384d6c	pass CI	7 years ago
FengZiYjun	57911f771a	- clean up unused codes - improve code comments - BaseLoader & its subclasses does not need a data name any more - update file tree - add setup.py	7 years ago
FengZiYjun	32a036e8e6	[fix] drop "data" in Tester.make_batch; correct spelling of "show_metrics" [add] PeopleDailyCorpusLoader, to parse PeopleDaily Corpus [update] add CWS + POS_tag interface at FastNLP, see example in test_fastNLP.py [update] modify README.md and readme_example.py to the latest version.	7 years ago
FengZiYjun	501ffb26c5	optimize CWS example - see test_fastNLP.py - update interpret_word_seg_results in fastnlp.py - delete useless data to increase git clone speed	7 years ago
FengZiYjun	ab55f25e20	Updates to Trainer/Tester/fastnlp 1. Tester has a parameter "print_every_step" to control printing. print_every_step == 0 means NO print. 2. Tester's evaluate return (list of) floats, rather than torch.cuda.tensor 3. Trainer also has a parameter "print_every_step". The same usage. 4. In training, validation steps are not shown. 5. Updates to code comments. 6. fastnlp.py is ready for CWS. test_fastNLP.py works.	7 years ago
FengZiYjun	9d6b0daa99	Prepare for CWS service: - specify the name of the config file and the name of corresponding section where model init params store. - fastnlp.py needs load_pickle to get dictionary size and the number of labels - other minor adjustments	7 years ago
xuyige	beee885689	add test code for testing variational rnn	7 years ago
xuyige	2bc54c6d17	add test code for testing masked rnn	7 years ago
xuyige	b362a810e0	fix a bug in testing code	7 years ago
xuyige	8899b44add	fix a bug	7 years ago
xuyige	7a54a20908	add codes testing utils	7 years ago
xuyige	246908bf45	find a bug that bilinear must have bias	7 years ago
xuyige	08e924b54f	add test code for testing preprocess.py	7 years ago
xuyige	812160493e	add test code for testing trainer	7 years ago
xuyige	e58295d657	add test code for testing other modules	7 years ago
xuyige	ff37b03670	add test code for testing loader	7 years ago
xuyige	d6d92010e0	add test code for testing action.py	7 years ago
Xipeng Qiu	47ddb24d1d	modify readme example	7 years ago
FengZiYjun	2df8eb740a	Updates to core, loader: - add Loss, Optimizer - change Trainer & Tester initialization interface: two styles of definition provided - handle Optimizer construction and loss function definition in a hard manner - add argparse in task-specific scripts. (seq_labeling.py & text_classify.py) - seq_labeling.py & text_classify.py work	7 years ago
FengZiYjun	fac830e1cd	fix bugs and clean up	7 years ago
FengZiYjun	4c8c2dfdb8	updates to core, loader, test: - move preprocess.py from loader/ to core/ - changes to interface of preprocess: 1. add run method, to run the main processing 2. add cross validation split 3. add return value 4. merge subclasses - Trainer supports cross validation - add data as arguments in Trainer.train & Tester.test - add readme.example.py, to run the example program shown in README.md - other corresponding changes	7 years ago
FengZiYjun	80baf35765	add logging in Trainer & Tester - see fastNLp/saver/logger.py to know how to create and use a logger - a log file named "train_test.log" will be created in the same dir as the main file where the program starts - this file records all important events happened in Trainer & Tester's methods	7 years ago
Coet	ffe7c26369	Merge branch 'master' into more_code_comments	7 years ago
choosewhatulike	1146ef0825	fix test_metrics	7 years ago
FengZiYjun	929a595c4c	Merge branch 'master' of https://github.com/fastNLP/fastNLP into to_merge # Conflicts: # fastNLP/core/metrics.py # fastNLP/core/predictor.py	7 years ago
FengZiYjun	4bbeaebe96	Updates to cores, action, loader: - rename Inference to Predictor - rename Trainer.prepare_input to Trainer.load_train_data, load data_train.pkl only - add __contains__ method to config Section class - more code comments - more elegant make_batch & data_iterator: Samplers return batch samples instead of batch indices	7 years ago
choosewhatulike	d6ef132207	fix test_metrics bug	7 years ago
Coet	3560fb1f67	Merge pull request #22 from fastnlp/dev/ner Dev/ner	7 years ago

1 2

84 Commits (8b6d0826f1279dc140127c8587f31e9c771f35e1)