Skip to content

Unbabel/nlp-seminar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 

Repository files navigation

Welcome to Unbabel's AI Reading Group Page

We have a weekly gathering to discuss recent papers in NLP and AI. These are very informal and relaxed reading meetings, so slides are not required although presenters usually make them.

Meeting Link

Schedule

2020-11-02

José Camargo de Souza

Nearest Neighbor Machine Translation

2020-10-26

Alon Lavie, Craig Stewart and Amin Farajian

AMTA2020 Findings

Slides

2020-09-28

Ricardo Rei and Catarina Farinha

COMET: A Neural Framework for MT Evaluation + some findings from participating in the WMT20 Metrics shared task.

Slides

2020-09-21

Daan Van Stigt and Jose Camargo de Souza

Learning to summarize from human feedback

2020-09-14

Ricardo Rei

An Overview on Adapters: What they are and how can we use them for NLG

Slides

2020-09-07

Rita Costa

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2

Slides

2020-08-24

Daan Van Stigt

  1. SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings multilingual-BERT
  2. Multilingual Alignment of Contextual Word Representations

Slides

2020-08-17

Pedro Mota

End-to-End Neural Word Alignment Outperforms GIZA++ Slides

2020-08-10

Miguel Vera

ACL 2020 Best paper award: Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Video Recording

2020-08-04

Lena Voita

Note: The paper presented by Lena is still under review, thus it is not publicly available. Meanwhile check out her page: Lena Voita published work

2020-07-27

Kyunghyun Cho

LxMLS 2020 talk: Question Answering and Generation for Evaluating Summarization

2020-07-20

Daan Van Stigt

Posterior Control of Blackbox Generation

Slides

2020-07-13

Ricardo and Catarina F.

Recent trends in Evaluation: ACL notes on Natural Language Generation Evaluation (focus on MT)

Slides

2020-07-06

Invited talk from Sean Welleck. where he talked about his current work on Natural Language Generation: Some interesting work by Sean:

2020-06-29

Patrick Fernandes

Paper: [2005.02354] It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Slides

2020-06-22

Catarina F. and Ricardo

Paper: [2006.06264] Tangled up in BLEU

Slides

2020-06-15

Austin

Paper: Mirror-Generative Neural Machine Translation

Slides

2020-06-08

Craig

Paper: On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Slides

2020-06-01

@jose.souza

Paper: [2004.12681] Lexically Constrained Neural Machine Translation with Levenshtein Transformer

Slides

2020-05-25

@Ricardo

Paper: [2004.13637] Recipes for building an open-domain chatbot

Slides: retrival vs generative chatbots

2020-05-18

@Daan and @Fabio

Paper: [1905.00076] Ensemble Distribution Distillation and [2002.11531] A general framework for ensemble distribution distillation

Slides: RG - Ensemble Distribution Distillation

2020-05-11

@Patrick

Paper: [2004.03061] Information-Theoretic Probing for Linguistic Structure

Slides: Information-Theoretic Probing for Linguistic Structure

2020-05-04

@Daan

Paper: [2003.12298] Information-Theoretic Probing with Minimum Description Length

Blog: Information-Theoretic Probing with MDL

Slides: Information-Theoretic Probing

2020-03-30

Pedro Lobato

Paper: ELECTRA: Pre-Training Text Encoders as Discriminators rather than Generators

Slides: Google Slides

2020-03-09

@Katya

Paper: Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models and Mix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for Neural Language Generation Models

Slides: mixout and mix-review

2020-02-17

@Daan

Paper: Torch-Struct: Deep Structured Prediction Library

Slides: Torch-Struct

2020-01-27

@Nuno Miguel G and Pedro Lobato

Paper: Reformer: The Efficient Transformer (PDF)

2020-01-12

@Amin and @António

Paper: 

2020-01-05

@Ekaterina

Paper: Optimizing data usage via differentiable rewards

Slides: “Optimizing data usage via differentiable rewards” by Xinyi Wang et al.

2019-12-30

@Catarina F

Paper: Gmail Smart Compose: Real-Time Assisted Writing

2019-12-16

@Fabio and @Amin

Paper: Neural Machine Translation with Soft Prototype

2019-12-09

@Miguel and @Nuno Miguel G

Paper: Improving Conditioning in Context-Aware Sequence to Sequence Models

2019-12-02

@Ekaterina and @Fabio

Paper: On NMT Search Errors and Model Errors: Cat Got Your Tongue?

2019-11-25

@António

Paper: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

2019-11-18

Pedro Lobato

Paper: Mask-Predict: Parallel Decoding of Conditional Masked Language Models

2019-11-11

@Fabio

Paper: Improving Back-Translation with Uncertainty-based Confidence Estimation

2019-11-04

@Sérgio

Paper: CORRECTION OF AUTOMATIC SPEECH RECOGNITION WITH TRANSFORMER SEQUENCE-TO-SEQUENCE MODEL

2019-10-28

@Ricardo

Paper: 75 Languages, 1 Model: Parsing Universal Dependencies Universally

2019-10-21

Rafaela Saraiva

Paper: Training Neural Response Selection for Task-Oriented Dialogue Systems 

2019-10-14

Pedro Lobato

Paper: TinyBERT

2019-09-09

Pedro Lobato

Paper: Large Memory Layers with Product Keys

2019-08-26

Invited speaker: Patrick Fernandes

Paper: Structured Neural Summarization (ICLR 2019)

2019-08-19

@Ricardo

Paper: Do Neural Dialog Systems Use Conversation History Effectively? An Empirical Study (And if we have time: Pretraining Methods for Dialog Context Representation Learning)

2019-08-12

ACL compilation and digest.

2019-08-05

@Daan

Title: Neural language models with latent syntax

Description: In this talk I will present my thesis work at the University of Amsterdam under supervision of Wilker Aziz. In the work I investigate semi-supervised and unsupervised learning of the recurrent neural network grammar (RNNG) (Dyer et al. 2016). I will also briefly describe concurrent work by Kim et al. (2019) who (unbeknownst to me) worked on an almost identical approach.

Slides: https://github.com/daandouwe/thesis/blob/master/doc/presentation-unbabel.pdf

Links:

2019-07-22

Invited speaker: Daniel Loureiro

He’s going to talk about his accepted paper at ACL:

“Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation”

2019-07-08

Antonio Gois

Resuming XLnet and related papers

Slides: non-sequential overview

2019-07-01

@Tsvetomila and @Marcos

Paper: [1906.08237] XLNet: Generalized Autoregressive Pretraining for Language Understanding

Slides: XLNet

2019-06-24

@Amin and @António

2nd part

2019-06-17

@Amin and @António

Papers: TBD

Releases

No releases published

Packages

No packages published