fastNLP

Commit Graph

Author	SHA1	Message	Date
FengZiYjun	125c2718e4	Update * fix bug in DataSet.split * fix bugs in FieldArray, to allow content as a list * fix bug in losses check * ...	7 years ago
yh	c2d2137500	bug fix in MetricAccuracy	7 years ago
yh	8d7d2b428c	initial test for AccuracyMetric	7 years ago
FengZiYjun	fb5215ae73	fix bug in Trainer about metric_key 更新Optimizer: 多种初始化方法 1. SGD() 2. SGD(0.01) 3. SGD(lr=0.01) 4. SGD(lr=0.01, momentum=0.9) 5. SGD(model.parameters(), lr=0.1, momentum=0.9)	7 years ago
xuyige	ba7b17661c	Merge branch 'trainer' of https://github.com/FengZiYjun/fastNLP into check	7 years ago
xuyige	6d36190be4	update LossBase class	7 years ago
FengZiYjun	8a7077fed2	更新Optimizer: optimizer.SGD(lr=xxx);如果没有传入parameters，则在trainer中帮他加入parameter	7 years ago
FengZiYjun	3120cdd09a	更新embed_loader: * 添加fast_load_embedding方法，用vocab的词索引pre-trained中的embedding * 如果vocab有词没出现在pre-train中，从已有embedding中正态采样 Update embed_loader: * add fast_load_embedding method, to index pre-trained embedding with words in Vocab * If words in Vocab are not exist in pre-trained, sample them from normal distribution computed by current embeddings	7 years ago
FengZiYjun	6427e85e8f	升级Vocab： * 增量添加单词到词典中 * lazy update: 当用到词典的时候才重新build * 当新添加的词导致词典大小超出限制时，打印一个warning Update Vocabulary: * More words can be added after the building. * Lazy update: rebuild automatically when vocab is used. * print warning when max size is reached	7 years ago
FengZiYjun	07e227aa4d	add interface of Loss	7 years ago
FengZiYjun	6839bb91cc	Add auto type detection/conversion in FieldArray * In init, detect content type to be Python int, float, or str. * In append(), check type consistence. * In init & append(), int will be cast into float if they occur together. * Map Python type into numpy dtype * Raise error if type detection fails.	7 years ago
FengZiYjun	da901ed5b0	* DataSet __getitem__ returns copy of Instance * refine interface of set_target & set_input * rename DataSet.Instance into DataSet.DataSetIter * remove unused methods in DataSet.DataSetIter * remove __setattr__ in DataSet; It is dangerous. * comment adjustment	7 years ago
yunfan	26a4324342	fix test	7 years ago
yunfan	04206f8099	Merge branch 'master' into dataset-res	7 years ago
FengZiYjun	3d66975091	* refine code comments * refine code style * set up unit tests for Batch, DataSet, FieldArray * remove a lot of out-of-date unit tests, to get testing passed	7 years ago
FengZiYjun	837bef47dc	* add unit tests for instance, vocabulary * remove and fix other unit tests * add more code comments	7 years ago
FengZiYjun	090f7aef5b	* fixing unit tests	7 years ago
FengZiYjun	e9d7074ba1	* delete readme_example.py because it is oooooooout of date. * rename preprocess.py into utils.py, because nothing about preprocess in it * anything in loader/ and saver/ is moved directly into io/ * corresponding unit tests are moved to /test/io * delete fastnlp.py, because we have new and better APIs * rename Biaffine_parser/run_test.py to Biaffine_parser/main.py; Otherwise, test will fail. * A looooooooooot of ancient codes to be refined...........	7 years ago
FengZiYjun	4be15a5b43	保存pos tag 脚本	7 years ago
yunfan	82f4351540	add index to word processor	7 years ago
FFTYYY	07fb61efdc	Update test_loss	7 years ago
FFTYYY	2cd2dae251	update loss	7 years ago
FengZiYjun	325157b53f	add tests	7 years ago
FengZiYjun	5133fe67b4	add character field	7 years ago
Coet	b80e5e8b29	Merge branch 'master' into dev	7 years ago
yunfan	ebbfcb7829	add dataset read functions	7 years ago
xuyige	b43d333738	clean some codes and fix some bugs	7 years ago
yunfan	baac29cfa0	fix tests	7 years ago
yunfan	a4c9786ca4	update dataset & loader	7 years ago
yunfan	8ea529404e	fix test	7 years ago
yunfan	2698094d8f	update embedding loader & vocab	7 years ago
FengZiYjun	fb806163c3	remove unused codes; add more tests	7 years ago
FengZiYjun	671975a223	add model.fit() method	7 years ago
FengZiYjun	5be4cb7bb5	Merge Preprocessor into DataSet. - DataSet's __init__ takes a function as argument, rather than class object - Preprocessor is about to remove. Don't use anymore. - Remove cross_validate in trainer, because it is rarely used and wired - Loader.load is expected to be a static method - Delete sth. in other_modules.py - Add more tests - Delete extra sample data	7 years ago
xuyige	91f3d97ace	Update to new version of framework	7 years ago
FengZiYjun	0b86d7cf2b	Merge Preprocessor and DataSet	7 years ago
FengZiYjun	28a0683853	1. add tests in test_fastNLP.py & test_sampler.py; increase test coverage to 81% 2. changes of names: aggregation ----> aggregator interaction ----> interactor action.py ----> sampler.py BasePreprocess ---> Preprocessor BaseTester ----> Tester BaseTrainer ----> Trainer 3. add more code comments 4. fix bugs in predictor's data_forward 5. in sampler.py, remove Bachifier, fix some codes. but not test 6. remove unused codes in other_modules.py & utils.py 7. update fastnlp.py with new config file names and code comments 8. add data examples in data_for_tests/	7 years ago
yunfan	819c8f05be	fix vocab	7 years ago
yunfan	9c7f3cf261	add vocabulary into preprocessor	7 years ago
yunfan	3f4544759d	add unittest of data, fix bug	7 years ago
Coet	ef3c753e0d	Update test_seq_label.py	7 years ago
FengZiYjun	ad044ef4c7	fix test path to pass py.test	7 years ago
FengZiYjun	f2fc98b5e6	add Field support in Predictor: - apply DataSet in Predictor; remove sub-predictors; add "task" argument to specify which task to predict, as how Trainer/Tester did. - remove Action class - add helper function for DataSet, to create DataSet easily - more code comments - clean up unnecessary codes - add unit tests for Batch, Predictor, Preprocessor, Trainer, Tester	7 years ago
FengZiYjun	05af2e7544	Introduce Fields concept to eliminate the use of different sub-trainers/sub-testers. - update LabelField's to_tensor method to support int & str single label - update preprocessor's convert_to_dataset method to support single label inputs - introduce "task" in Trainer/Tester's data_forward, Tester's evaluate and metrics methods - in cnn_text_classification.py, change the name of the argument of forward - in sequence_modeling.py, change the name of the argument of forward - minor adjustments in test codes - text_classify.py works	7 years ago
FengZiYjun	758f0c0bd6	Introduce Field concept to optimize data representation. - add DataSet, Instance, Field to represent data in different levels - encapsulate batching method in Batch class - modify samplers in action.py to fit Batch - preprocessor.run returns DataSet, instead of list - Use Batch in Trainer/Tester - add required_arg "task" in Trainer/Tester - remove SeqLabelTrainer/SeqLabelTester dependencies successfully. They empty classes to deprecate. - modify SeqLabeling model, add another argument in forward, in order to compute mask inside model - test\model\seq_labeling.py works	7 years ago
yunfan	4dfe7aaacc	format test folder	7 years ago
xuyige	aac7982e93	fix a bug in config saver testing code	7 years ago
xuyige	6ddf5fcdcd	update test code for testing config saver	7 years ago
xuyige	7138ff210f	update config file for testing code, add more sections for testing.	7 years ago
Yige XU	be2f4aade3	Merge branch 'master' into test_code	7 years ago

1 2 3 4

167 Commits (3219b9da33065035a05762458335c511b1b8b130)