Browse Source

[verify] sstdataloader add sst2

[add] readme
tags/v0.4.10
wyg 5 years ago
parent
commit
f369778ab3
2 changed files with 24 additions and 1 deletions
  1. +22
    -0
      reproduction/text_classification/README.md
  2. +2
    -1
      reproduction/text_classification/data/sstLoader.py

+ 22
- 0
reproduction/text_classification/README.md View File

@@ -0,0 +1,22 @@
# text_classification任务模型复现
这里使用fastNLP复现以下模型:
char_cnn :论文链接[Character-level Convolutional Networks for Text Classification](https://arxiv.org/pdf/1509.01626v3.pdf)
dpcnn:论文链接[Deep Pyramid Convolutional Neural Networks for TextCategorization](https://ai.tencent.com/ailab/media/publications/ACL3-Brady.pdf)
HAN:论文链接[Hierarchical Attention Networks for Document Classification](https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf)
#待补充
awd_lstm:
lstm_self_attention(BCN?):
awd-sltm:

# 数据集及复现结果汇总

使用fastNLP复现的结果vs论文汇报结果(/前为fastNLP实现,后面为论文报道,-表示论文没有在该数据集上列出结果)

model name | yelp_p | sst-2|IMDB|
:---: | :---: | :---: | :---:
char_cnn | 93.80/95.12 | - |- |
dpcnn | 95.50/97.36 | - |- |
HAN |- | - |-|
BCN| - |- |-|
awd-lstm| - |- |-|


+ 2
- 1
reproduction/text_classification/data/sstLoader.py View File

@@ -5,7 +5,8 @@ from fastNLP.core.vocabulary import VocabularyOption, Vocabulary
from fastNLP import DataSet
from fastNLP import Instance
from fastNLP.io.embed_loader import EmbeddingOption, EmbedLoader

import csv
from typing import Union, Dict

class SSTLoader(DataSetLoader):
URL = 'https://nlp.stanford.edu/sentiment/trainDevTestTrees_PTB.zip'


Loading…
Cancel
Save