Autoregressive mode and embedding calculation addition #62

metric-space · 2023-10-03T07:09:04Z

Changes

Add in autoregressive flag to notify AutoModelSequenceEmbeddings and AutoModelForRagE2E that an autoregressive model is being used
Add autoregressive model of computation of embeddings based on last hidden state
Move logic around so mean_pooling is more applicable to both scenarios (clm and mlm)
update cli args
pad_token = eos_token if autorgessive
add is_autoregressive flag to eval portion
fix if to elif in save hook
remove default save statement
unwrap model properly while saving

shamanez · 2023-10-03T10:08:33Z

dalm/models/retriever_only_base_model.py

+            ).hidden_states[-1]
+        else:
+            # First element of model_output contains all token embeddings
+            token_embeddings = self.model(input_ids, attention_mask)[0]


I don't think the first element is useful. Cz the attention is from the left to right.

I think that is the existing code, I moved taking the zeroth item from inside of the mean pooling function and put it here

shamanez · 2023-10-03T10:10:53Z

dalm/models/retriever_only_base_model.py

+        else:
+            # First element of model_output contains all token embeddings
+            token_embeddings = self.model(input_ids, attention_mask)[0]
+        embeddings = self.mean_pooling(token_embeddings, attention_mask)


Why do we need a pooling step since we select only a single embedding?

I guess two methods should be,

Selecting the EOS embedding as the representation, since it has seen all the previous.

Getting all the embeddings for every token and pool them.

I believe the shape of the input token embeddings is (1,<length of tokenized inputs>, <number of features>)

The output of hidden states flag is of shape [number of layers, 1, length of tokenized inputs, number_of_features]

so I think we're doing number 2 here?

Got it. I guess we can do the same trick for encoder-only models as well.

2. add is_autoregressive flag to eval portion 3. fix if to elif in save hook 4. remove default save statement 5. unwrap moel properly while saving

shamanez · 2023-10-10T09:50:42Z

I guess we can remove the me embedding addition thing .. with a note we can say we got better results by taking the eos token as the representation ..

shamanez

Looks good

Serega6678 · 2024-03-06T16:44:40Z

BGE models require CLS pooling and not the mean pooling

https://huggingface.co/BAAI/bge-large-en#frequently-asked-questions

metric-space added 2 commits October 3, 2023 03:03

Autoregressive mode and embedding calculation addition

0de926c

Corrections

101bcf4

shamanez requested changes Oct 3, 2023

View reviewed changes

metric-space added 6 commits October 3, 2023 19:02

1. pad_token = eos_token if autorgessive

5501216

2. add is_autoregressive flag to eval portion 3. fix if to elif in save hook 4. remove default save statement 5. unwrap moel properly while saving

Add autoregressive tailored cut of lora-config

9803580

Corrections post training runs

b3380fd

Eos token feature extraction

3618d80

Correct typo

107b9ca

Flag correction

f8d824e

metric-space added 2 commits October 10, 2023 20:47

Make eos extraction default

fdab665

Add retriever_is_autoregressive flag to e2e-rag and cli

4cc8602

metric-space marked this pull request as ready for review October 11, 2023 03:31

metric-space requested a review from shamanez October 11, 2023 03:31

metric-space changed the title ~~[WIP] : Autoregressive mode and embedding calculation addition~~ Autoregressive mode and embedding calculation addition Oct 11, 2023

shamanez approved these changes Oct 11, 2023

View reviewed changes

metric-space merged commit 4ac31f2 into main Oct 11, 2023

metric-space deleted the autoregressive-model branch October 11, 2023 08:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autoregressive mode and embedding calculation addition #62

Autoregressive mode and embedding calculation addition #62

metric-space commented Oct 3, 2023 •

edited

Loading

shamanez Oct 3, 2023

metric-space Oct 3, 2023 •

edited

Loading

shamanez Oct 3, 2023

metric-space Oct 3, 2023

shamanez Oct 3, 2023

shamanez commented Oct 10, 2023

shamanez left a comment

Serega6678 commented Mar 6, 2024

Autoregressive mode and embedding calculation addition #62

Autoregressive mode and embedding calculation addition #62

Conversation

metric-space commented Oct 3, 2023 • edited Loading

Changes

shamanez Oct 3, 2023

Choose a reason for hiding this comment

metric-space Oct 3, 2023 • edited Loading

Choose a reason for hiding this comment

shamanez Oct 3, 2023

Choose a reason for hiding this comment

metric-space Oct 3, 2023

Choose a reason for hiding this comment

shamanez Oct 3, 2023

Choose a reason for hiding this comment

shamanez commented Oct 10, 2023

shamanez left a comment

Choose a reason for hiding this comment

Serega6678 commented Mar 6, 2024

metric-space commented Oct 3, 2023 •

edited

Loading

metric-space Oct 3, 2023 •

edited

Loading