performance plot #35

siwei-li · 2024-02-15T01:07:14Z

No description provided.

jettjaniak

it's good, let's continue working on it!

make plotly plot instead - I'm not sure if using ipywidgets will be the best solution in this case, as plotly has some built-in support for dropdowns, see original issue description
change the input format to a tuple of (err_low, value, err_high) instead of list of all values - someone else will take care of computing this efficiently on the dataset scale
make it look somewhat like that, with colors as arguments with reasonable defaults; loss is log scale

4. see code comments

src/delphi/dataset/mock_per_token_performance.py

src/delphi/eval/vis_per_token_model.py

siwei-li · 2024-02-19T05:07:07Z

I have changed the plotting by using Plotly, the only thing I'm not sure about is this coloring thing @jettjaniak

make it look somewhat like that, with colors as arguments with reasonable defaults; loss is log scale

jaidhyani · 2024-02-19T22:55:07Z

lgtm, nice work

jaidhyani · 2024-02-20T00:00:06Z

Two things to follow up on:

I think we want the loss (y axis) to be log scale.
We should make color an argument of visualize_per_token_category (it can default to "purple")

siwei-li · 2024-02-20T02:01:03Z

Yeah I just thought that we already have data as in logprobs, so the loss calculated is already in log

I think we want the loss (y axis) to be log scale.

jaidhyani · 2024-02-20T06:41:07Z

Yeah, I think you're right. Jett, thoughts? You mentioned wanting a log scale in an earlier comment, were you just referring to charting logprob?

…

On Mon, Feb 19, 2024, 6:01 PM Siwei Li ***@***.***> wrote: Yeah I just thought that we already have data as in logprobs, so the loss calculated is already in log 1. I think we want the loss (y axis) to be log scale. — Reply to this email directly, view it on GitHub <#35 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAC5BY5AC5BBIH5EEQNXG4TYUP7WVAVCNFSM6AAAAABDJMJ7RCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSNJTGM3DSNZSG4> . You are receiving this because your review was requested.Message ID: ***@***.***>

src/delphi/eval/vis_per_token_model.py

addressed

jettjaniak · 2024-02-20T18:03:01Z

just had a meeting with Sil, a few small bits to address and we can merge

siwei-li · 2024-02-20T18:05:39Z

Todos:

take log/linear kwarg options
coloring kwargs fo the graph
making dots larger

menamerai and others added 2 commits February 14, 2024 16:48

added mock data

714e90b

Add visualization function for per token model comparison

10e557b

siwei-li requested review from jettjaniak and menamerai February 15, 2024 01:07

siwei-li linked an issue Feb 15, 2024 that may be closed by this pull request

interactive performance plot #5

Closed

jettjaniak previously requested changes Feb 17, 2024

View reviewed changes

jettjaniak changed the title ~~5 per token performance plot~~ performance plot Feb 17, 2024

Update plotting by using Plotly

5e0352b

siwei-li force-pushed the 5-per-token-performance-plot branch from 4536eb1 to 5e0352b Compare February 19, 2024 05:03

siwei-li requested a review from jaidhyani February 19, 2024 05:09

jaidhyani approved these changes Feb 19, 2024

View reviewed changes

jettjaniak reviewed Feb 20, 2024

View reviewed changes

src/delphi/eval/vis_per_token_model.py Outdated Show resolved Hide resolved

Siwei Li and others added 2 commits February 20, 2024 19:10

Update the arguments for visualization styling

aeb0ba6

Merge branch 'main' into 5-per-token-performance-plot

3cf1f71

siwei-li merged commit 2d73bfe into main Feb 21, 2024
1 check passed

jettjaniak deleted the 5-per-token-performance-plot branch May 22, 2024 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance plot #35

performance plot #35

siwei-li commented Feb 15, 2024

jettjaniak left a comment

siwei-li commented Feb 19, 2024 •

edited

Loading

jaidhyani commented Feb 19, 2024

jaidhyani commented Feb 20, 2024

siwei-li commented Feb 20, 2024

jaidhyani commented Feb 20, 2024 via email

jettjaniak commented Feb 20, 2024

siwei-li commented Feb 20, 2024

performance plot #35

performance plot #35

Conversation

siwei-li commented Feb 15, 2024

jettjaniak left a comment

Choose a reason for hiding this comment

siwei-li commented Feb 19, 2024 • edited Loading

jaidhyani commented Feb 19, 2024

jaidhyani commented Feb 20, 2024

siwei-li commented Feb 20, 2024

jaidhyani commented Feb 20, 2024 via email

jettjaniak commented Feb 20, 2024

siwei-li commented Feb 20, 2024

siwei-li commented Feb 19, 2024 •

edited

Loading