-
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use chapter-filtering for terms #545
Conversation
4f37c23
to
e5ead88
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 6 of 6 files at r1, all commit messages.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @Enkidu93 and @johnml1135)
src/ServiceToolkit/src/SIL.ServiceToolkit/Services/ParallelCorpusPreprocessingService.cs
line 87 at r1 (raw file):
ITextCorpus[]? sourceTermCorpora = _corpusService .CreateTermCorpora( corpus.SourceCorpora.SelectMany(sc => sc.Files.Select(f => (f, sc.TrainOnChapters))).ToArray()
What about textIds?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 2 of 6 files reviewed, 1 unresolved discussion (waiting on @ddaspit and @johnml1135)
src/ServiceToolkit/src/SIL.ServiceToolkit/Services/ParallelCorpusPreprocessingService.cs
line 87 at r1 (raw file):
Previously, ddaspit (Damien Daspit) wrote…
What about textIds?
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 4 of 4 files at r2, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @johnml1135)
Still fixing a bug |
Does this also address #538? Can we unique these - or do we want to do it in a separate PR? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 2 of 6 files at r1, 3 of 4 files at r2, 3 of 3 files at r3, all commit messages.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @Enkidu93)
Previously, johnml1135 (John Lambert) wrote…
A separate PR is fine. |
…th the texts of the corpus
…th the texts of the corpus
788beb9
to
551e6cd
Compare
Is it all ready? Do we need to re-release Machine? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 6 of 6 files at r4, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @Enkidu93)
Alright, the last issue has been addressed. We'll need
|
…erval into filter_kbt_by_chapter
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 3 of 3 files at r3, 6 of 6 files at r4, 3 of 3 files at r5, all commit messages.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @Enkidu93)
src/ServiceToolkit/src/SIL.ServiceToolkit/Services/CorpusService.cs
line 43 at r5 (raw file):
) { foreach ((CorpusFile file, Dictionary<string, HashSet<int>>? chapters) in corpora)
Why was this added?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 8 of 9 files reviewed, 1 unresolved discussion (waiting on @ddaspit and @johnml1135)
src/ServiceToolkit/src/SIL.ServiceToolkit/Services/CorpusService.cs
line 43 at r5 (raw file):
Previously, ddaspit (Damien Daspit) wrote…
Why was this added?
Sorry, it was there before; I added it back in temporarily while I was rethinking what the right behavior should be when no filters are supplied and then forgot to remove it. Removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 1 of 1 files at r6, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @Enkidu93)
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #545 +/- ##
==========================================
+ Coverage 56.97% 57.03% +0.05%
==========================================
Files 302 302
Lines 15620 15639 +19
Branches 2153 2155 +2
==========================================
+ Hits 8900 8919 +19
+ Misses 6077 6076 -1
- Partials 643 644 +1 ☔ View full report in Codecov by Sentry. |
Reworking of #508
Fixes #476
This change is