Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Investigation of multilingual LLMs #109

Merged
merged 1 commit into from
Dec 3, 2024
Merged

Conversation

KevinJBoyer
Copy link
Contributor

Ticket

https://navalabs.atlassian.net/browse/DST-518

Changes

  • Compare recall for our current embedding model (mpnet) and a model designed to work across multiple languages
  • Set up a way for generating synthetic content/question pairs from existing chunks

Copy link

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines Covered Coverage Threshold Status
1850 1589 86% 80% 🟢

New Files

No new covered files...

Modified Files

No covered modified files...

updated for commit: 7934852 by action🐍

@KevinJBoyer KevinJBoyer requested a review from ccheng26 November 25, 2024 21:16
Copy link
Contributor

@ccheng26 ccheng26 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should the .json files be in an .gitignore file?

@KevinJBoyer
Copy link
Contributor Author

I thought they'd be good to commit so others could take a look (and re-run the analysis) without having to regenerate them (expensive)

@KevinJBoyer KevinJBoyer merged commit f1bc74c into main Dec 3, 2024
4 checks passed
@KevinJBoyer KevinJBoyer deleted the kb/translation branch December 3, 2024 15:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants