History

FengZiYjun 7514be6f30 - add validation loss into trainer.train - restructure: move reproduction outside - add evaluate in tester		7 years ago
..
LICENSE	- add validation loss into trainer.train	7 years ago

README.md	- add validation loss into trainer.train	7 years ago

__init__.py	- add validation loss into trainer.train	7 years ago

model.py	- add validation loss into trainer.train	7 years ago

test.py	- add validation loss into trainer.train	7 years ago

test.txt	- add validation loss into trainer.train	7 years ago

train.py	- add validation loss into trainer.train	7 years ago

train.txt	- add validation loss into trainer.train	7 years ago

utilities.py	- add validation loss into trainer.train	7 years ago

valid.txt	- add validation loss into trainer.train	7 years ago

PyTorch-Character-Aware-Neural-Language-Model

This is the PyTorch implementation of character-aware neural language model proposed in this paper by Yoon Kim.

Requiredments

The code is run and tested with Python 3.5.2 and PyTorch 0.3.1.

HyperParam	value
LSTM batch size	20
LSTM sequence length	35
LSTM hidden units	300
epochs	35
initial learning rate	1.0
character embedding dimension	15

Train the model with split train/valid/test data.

python train.py

The trained model will saved in cache/net.pkl.
Test the model.

python test.py

Best result on test set:
PPl=127.2163
cross entropy loss=4.8459

This implementation borrowed ideas from

一款轻量级的自然语言处理（NLP）工具包，目标是减少用户项目中的工程型代码，例如数据处理循环、训练循环、多卡运行等

自然语言处理 nlp

Python Jupyter Notebook Text CSV Markdown

writerphone@163.com xuyige1996@gmail.com 1901722105@qq.com xpqiu@fudan.edu.cn lyhuang19@163.com keezen@qq.com 17210240044@fudan.edu.cn 42239874+lyhuang18@users.noreply.github.com henryL7 15307130288@fudan.edu.cn fdjingyuan@outlook.com kunya.dn@gmail.com 378213564@qq.com yunfan.shao@outlook.com 294130139@qq.com zyfeng@yitu-inc.intra