Improving Word Representations Using Self-Supervised Learning

We employed BYOL, a self-supervised implicit contrastive learning strategy, to incorporate grammatical information such as synonyms and antonyms into existing word2vec embeddings. We did this to enhance current word vectors with more information early on in a pipeline as we thought it was vital to leaarn grammar first before anything else like we humans do. We tested our enhanced embeddings on a Sentiment Analysis based classification task on MovieReview dataset and achieved an increase in 2.5% accuracy by using a single GPU for a couple of hours.

To train BYOL from scratch and obtain enhanced embeddings:

from get_byol_embeddings import get_byol_embed
get_byol_embed()

This piece of code will save models at every epoch in current folder

To test our latest embeddings on Movie Review Classification task:

from get_byol_embeddings import test_embeddings
test_embeddings()

To test word2vec embeddings on Movie Review Classification task:

from get_byol_embeddings import test_embeddings
test_embeddings(enhance=False)

To perform PCA and visualize and compare both embeddings for a particular word:

from get_byol_embeddings import pca
pca(word='get')

The below plot shows the accuracy plots for various models - OurBYOL Vs Word2Vec:

Here's the performance gain in terms of PCA visualization:

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
assets		assets
README.md		README.md
benchmark_embeddings.py		benchmark_embeddings.py
get_byol_embeddings.py		get_byol_embeddings.py
get_word_embeddings.py		get_word_embeddings.py
online_7		online_7
online_8		online_8
online_9		online_9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Word Representations Using Self-Supervised Learning

About

Releases

Packages

Languages

lb-97/NLPBYOL

Folders and files

Latest commit

History

Repository files navigation

Improving Word Representations Using Self-Supervised Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages