Semi-supervised Sequence Learning

Dai, Andrew M.; Le, Quoc V.

Computer Science > Machine Learning

arXiv:1511.01432 (cs)

[Submitted on 4 Nov 2015]

Title:Semi-supervised Sequence Learning

Authors:Andrew M. Dai, Quoc V. Le

View PDF

Abstract:We present two approaches that use unlabeled data to improve sequence learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a conventional language model in natural language processing. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" step for a later supervised sequence learning algorithm. In other words, the parameters obtained from the unsupervised step can be used as a starting point for other supervised training models. In our experiments, we find that long short term memory recurrent networks after being pretrained with the two approaches are more stable and generalize better. With pretraining, we are able to train long short term memory recurrent networks up to a few hundred timesteps, thereby achieving strong performance in many text classification tasks, such as IMDB, DBpedia and 20 Newsgroups.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:1511.01432 [cs.LG]
	(or arXiv:1511.01432v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1511.01432

Submission history

From: Andrew Dai [view email]
[v1] Wed, 4 Nov 2015 18:48:36 UTC (21 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-11

Change to browse by:

cs
cs.CL

References & Citations

3 blog links

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Andrew M. Dai
Quoc V. Le

export BibTeX citation

Computer Science > Machine Learning

Title:Semi-supervised Sequence Learning

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Semi-supervised Sequence Learning

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators