CAT vocabulary lookup not working #534

vmarchman · 2024-08-20T21:37:20Z

@HenryMehta Testing some CAT lookups. The Thetas that I'm getting seem really low, but they are there. But, all of the vocabulary look-ups are 0.

vmarchman · 2024-08-20T21:43:14Z

Old CATs run previously seem to be working just fine. Also ones that are under 30 months. It appears to be only for children 31-36 months where it is not working.

HenryMehta · 2024-08-21T12:01:49Z

@vmarchman The Percentile for both sexes is 0. We have no values when the percentile is less than 5% which is why the estimated vocabulary by both sexes shows 0.

There is an error for the Est Vocab by sex. I've been going through the code and although it is getting the different Percentiles, it is then basing the Est Vocab for both and by sex on the Percentile by both.

I have corrected this and I'm just running tests (takes 1-2 hours) to ensure it has broken something else. I'll then deploy to test and prod.

Please note, this means all the CAT Est Vocabs by sex have been showing based on the percentile for by Both sexes. This will have been right when the percentiles were the same but wrong when different.

I'll let you know when deployed

HenryMehta · 2024-08-21T13:26:25Z

@vmarchman done

vmarchman · 2024-08-21T13:55:45Z

@HenryMehta Let's see if we can just spit out the lowest value in the table in these cases. We are currently using tables with only 5 percentile increments. Let's switch to tables with 1 percentile increments. Then, you can just spit out the lowest value in the table (1st percentile).

You also then don't have to do the interpolation step as you would need to do for the 5 percentile tables.

vmarchman · 2025-01-27T23:44:38Z

Hi @HenryMehta I think we never implemented the 1 percentile tables for the estimating of the vocabulary size based on the theta percentile. If the Theta percentile is <1 (it should never be 0), then the estimated vocabulary for that child should be the 1 percentile value. Here are the 1 percentile tables for English WG and WS. Let me know if you have any questions.

WGprod_both_1.csv
WGprod_boys_1.csv
WGprod_girls_1.csv
WSprod_both_36months_1.csv
WSprod_boys_36_1.csv
WSprod_girls_36months_1.csv

HenryMehta · 2025-01-28T11:25:56Z

@vmarchman

I have done WG and it is available to test. I have assumed the percentages given are for words understood. I have not amended the other benchmark categories (Words Produced, Total Gestures, Phrases, Later Gestures, Early Gestures).

HenryMehta · 2025-01-28T12:01:50Z

@vmarchman
I have started working on the WS but the boys file only goes to age 30 months and the girls file is only every 5%.

Also, I think you need to understand, this will not stop the 0 issue. A WS (raw score rather than sex specific) of age 36 months having indicated less than 162 words will show as 0%. In order to show 1% in this case, we need the value in the cell to be 0.

vmarchman · 2025-01-28T17:32:59Z

Hi @HenryMehta These are only for computing the estimated vocabulary for the English CAT, not the English WG or WS.

I will double check the numbers and files and get back to you.

HenryMehta · 2025-01-28T18:40:12Z

@vmarchman Blast, that means I've edited the wrong files. Fortunately I kept the originals

vmarchman · 2025-01-28T20:02:53Z

Sorry about that @HenryMehta !!

You were right that some of the files weren't right. Here they are all again, all should be 1 percentile increments and to 36 months for WS. the WG is only prod from 8 to 18 months. These should be used to compute the estimated vocabulary scores based on the CAT Theta percentiles. Because they are at the 1-percentile level, you won't need to interpolate any scores (like we had to do with the 5 percentile-level tables).

WGprod_boys_1.csv
WGprod_girls_1.csv
WGprod_both_1.csv
WSprod_boys_36_1.csv
WSprod_both_36months_1.csv
WSprod_girls_36months_1.csv

I don't understand your comment re the 0 issue: Let me try to say what I want to happen for the CAT scoring outputs, and then maybe you can help me understand better.

There are two things happening: (1) spitting out a percentile value for the Theta based on the CAT look-up tables. If the actual theta is below the lowest value in the CAT theta tables, the percentile value that is output should be "< 1". There should never be a 0th percentile score output in any situation.

(2) providing an estimated vocabulary score for that "< 1" percentile based on the 1-percentile level vocabulary tables. If a particular child's theta percentile is "< 1", then the system should output the number from the 1 percentile vocabulary tables that represents the lowest value in the table, i.e., the 1st percentile value, for their estimated vocabulary score.

HenryMehta · 2025-01-29T09:56:22Z

@vmarchman ok, I've got the raw scores in the right place but I do not have estimated thetas in single percent intervals. I only have 5% intervals so that remains what you'll get

HenryMehta closed this as completed Aug 21, 2024

HenryMehta reopened this Aug 21, 2024

HenryMehta closed this as completed Sep 25, 2024

vmarchman reopened this Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CAT vocabulary lookup not working #534

CAT vocabulary lookup not working #534

vmarchman commented Aug 20, 2024

vmarchman commented Aug 20, 2024

HenryMehta commented Aug 21, 2024

HenryMehta commented Aug 21, 2024

vmarchman commented Aug 21, 2024

vmarchman commented Jan 27, 2025 •

edited

Loading

HenryMehta commented Jan 28, 2025

HenryMehta commented Jan 28, 2025

vmarchman commented Jan 28, 2025

HenryMehta commented Jan 28, 2025

vmarchman commented Jan 28, 2025

HenryMehta commented Jan 29, 2025

CAT vocabulary lookup not working #534

CAT vocabulary lookup not working #534

Comments

vmarchman commented Aug 20, 2024

vmarchman commented Aug 20, 2024

HenryMehta commented Aug 21, 2024

HenryMehta commented Aug 21, 2024

vmarchman commented Aug 21, 2024

vmarchman commented Jan 27, 2025 • edited Loading

HenryMehta commented Jan 28, 2025

HenryMehta commented Jan 28, 2025

vmarchman commented Jan 28, 2025

HenryMehta commented Jan 28, 2025

vmarchman commented Jan 28, 2025

HenryMehta commented Jan 29, 2025

vmarchman commented Jan 27, 2025 •

edited

Loading