feat: 🎸 Use snapshots or OCR for E2E tests when comparing terminal #2639

ZedLi · 2024-12-21T00:29:27Z

Description

When reviewing this PR, please take a look at both commits and test both. The goal of this PR is to resolve this TODO as sometimes we want to interact with our integrated terminal. The main problem is that the terminal is built using a canvas element which makes it difficult to test with playwright as it's basically an image with no DOM elements to interact with. The particular problem we're trying to solve in this PR is knowing when the desktop client properly finishes making the SSH connection and is "done" when using the integrated terminal.

The first commit adds a golden screenshot to compare to what we expect a properly connected SSH session to look like. This of course can be quite brittle due to many factors (size of window, if the EC2 instance has new updates, timestamps, etc) but allowing a little bit of difference seems to work still. It's hard to say how brittle it is as the target machines can change so I'm curious if it works well for others. I had to use poll as it turns out the documentation is a bit misleading and toHaveScreenshot doesn't actually wait until they match and just compares screenshots after they have been stabilized, which is usually before the SSH connection finishes.

The second commit adds tesseract.js and takes a different approach by using an OCR package to interpret the screenshot of the canvas element that holds our terminal. I use a phrase that I expect to see every time we connect to an EC2 instance and have it keep checking to see if it appears. The main downside is the setup of the package adds a decent amount of time (~5s on my mac but I set it up so it only runs once per worker and should be reused) and a few more seconds to actually recognize the image in the test. In theory this should be less brittle but may also be unnecessary as it's decently slower.

How to Test

Check out each commit and try running yarn desktop or yarn desktop:dev

Checklist

~~[ ] I have added before and after screenshots for UI changes~~
~~[ ] I have added JSON response output for API changes~~
~~[ ] I have added steps to reproduce and test for bug fixes in the description~~
I have commented on my code, particularly in hard-to-understand areas
~~[ ] My changes generate no new warnings~~
I have added tests that prove my fix is effective or that my feature works

vercel · 2024-12-21T00:29:33Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
boundary-ui	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Feb 13, 2025 9:29pm
boundary-ui-desktop	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Feb 13, 2025 9:29pm

moduli

Some points of consideration

Terminal output may look different if using docker-based infrastructure vs. AWS-based infrastructure. Not sure if there are any test cases that need docker.
If using snapshots, it's possible that some people's personal shells may affect terminal output

ZedLi · 2024-12-26T21:30:50Z

Terminal output may look different if using docker-based infrastructure vs. AWS-based infrastructure. Not sure if there are any test cases that need docker.

I haven't used docker for anything yet in DC tests but my assumption was we would only be testing against targets that were spun up somewhere which I assumed would be EC2 instances when using SSH targets. What do you generally connect to for the docker based tests, localhost?

If using snapshots, it's possible that some people's personal shells may affect terminal output

Yup, the assumption was that the output from logging into an EC2 instance would scroll past it, which may not be true in all cases.

moduli · 2025-01-06T18:49:47Z

Terminal output may look different if using docker-based infrastructure vs. AWS-based infrastructure. Not sure if there are any test cases that need docker.

I haven't used docker for anything yet in DC tests but my assumption was we would only be testing against targets that were spun up somewhere which I assumed would be EC2 instances when using SSH targets. What do you generally connect to for the docker based tests, localhost?

If using snapshots, it's possible that some people's personal shells may affect terminal output

Yup, the assumption was that the output from logging into an EC2 instance would scroll past it, which may not be true in all cases.

On docker-based tests, we spin up a separate container to use as a target (https://hub.docker.com/r/linuxserver/openssh-server).

laurenolivia · 2025-01-07T16:33:45Z

@ZedLi interesting PR. Upon reading the PR description I'm wondering if a short video or screenshots would help review this one? I'm curious about the discovery that led to this work, is there a jira ticket for more context?

ZedLi · 2025-01-07T16:42:29Z

@ZedLi interesting PR. Upon reading the PR description I'm wondering if a short video or screenshots would help review this one? I'm curious about the discovery that led to this work, is there a jira ticket for more context?

I'm not sure video or screenshots would help as it's just the test running and waiting properly so it looks the same between them. The context is really just solving the issue in this TODO, I'll add a little bit more context in the description though!

laurenolivia · 2025-01-08T16:03:38Z

@ZedLi Thanks for adding context to the description.

RE: I had to use poll as it turns out the documentation is a bit misleading...

Can you link the docs you are referencing?

e2e-tests/desktop/fixtures/baseTest.js

ZedLi · 2025-01-08T17:00:30Z

Can you link the docs you are referencing?

I was just referencing the playwright docs for toHaveScreenshot where they mention:

This function will wait until two consecutive page screenshots yield the same result, and then compare the last screenshot with the expectation.

which is not quite accurate.

calcaide

Thanks for the work!!!

I do personally don't like to rely on screenshots for testing, but as you already mention and explain, there isn't much of an option for this specific case.

DhariniJeeva · 2025-01-10T18:55:11Z

e2e-tests/desktop/fixtures/tesseractTest.js

+      await use(worker);
+      await worker.terminate();
+    },
+    { scope: 'worker' },


how is this scope being used?

Here's the doc for worker scopes

hashicc

Testing request changes

Testing

hashicc

Looks good, one small non-blocking suggestion.

I thought of another option that isn't as "real" as this for testing but might work. Taking the last x amount of characters and attaching it to a data-testId and asserting against that. This might not be performant in the case a lot of scrolling text in which case it could be debounced before updating the DOM attribute. But that wouldn't actually test that it made it to the terminal itself.

I'm still leaning towards favoring the OCR option but thought I'd mention it

e2e-tests/desktop/fixtures/tesseractTest.js

moduli

Tried running it locally. It failed one time due to the following

  1) tests/sessions.spec.js:213:3 › Filtering sessions tests › Filters by status ───────────────────

    Error: expect(received).toMatch(expected)

    Expected pattern: /To run a command as administrator|Welcome to OpenSSH Server/
    
    Received string:  "in vscode, nvm not work; use °load-nvm®·
    ssh 127.0.0.1 -p 57748 -o NoHostAuthenticationForLocalhost=yes·
    [oh-my-zsh] Would you like to update? [Y/n]·
    [oh-my-zsh] You can update manually by running “omz update”
    ET FE FPF
    ) sh 127.0.0.1 -p 57748 -o NoHostAuthenticationForLocalhost=yes·
    sh: 127.0.0.1: No such file or directory
    PT FE FPF
    >
    "

It looks like, due to my setup, the terminal prompted to update something, so the first s in the ssh command was used to respond to that prompt. It's just something specific to me, and that prompt doesn't appear every time, but just wanted to share. When that prompt doesn't appear, it looks like things are working great.

ZedLi · 2025-02-12T15:53:36Z

I thought of another option that isn't as "real" as this for testing but might work. Taking the last x amount of characters and attaching it to a data-testId and asserting against that. This might not be performant in the case a lot of scrolling text in which case it could be debounced before updating the DOM attribute. But that wouldn't actually test that it made it to the terminal itself.

How would this work in practice? That is, how are you envisioning the "taking the last x amount of characters? Wouldn't this require OCR to be able to take the characters or am I misunderstanding?

calcaide

Run the tests manually successful! Thanks for the work 🙌

lisbet-alvarez · 2025-02-14T03:05:16Z

e2e-tests/desktop/fixtures/tesseractTest.js

+  tesseract: [
+    async ({}, use) => {
+      const worker = await createWorker('eng', 1, {
+        cachePath: './artifacts',


question for learning purposes: On test failures what type of data is being stored in the artifacts folder? Based on the the docs it says traineddata but not exactly sure what that means in this context.

lisbet-alvarez

The OCR options def. seems to be less brittle on my end (those tests passed with no issues). I was not able to get test cases to pass using the snapshot comparison option.

ZedLi self-assigned this Dec 21, 2024

ZedLi requested a review from a team as a code owner December 21, 2024 00:29

vercel bot deployed to Preview – boundary-ui-desktop December 21, 2024 00:33 View deployment

vercel bot deployed to Preview – boundary-ui December 21, 2024 00:33 View deployment

moduli reviewed Dec 23, 2024

View reviewed changes

laurenolivia reviewed Jan 8, 2025

View reviewed changes

e2e-tests/desktop/fixtures/baseTest.js Outdated Show resolved Hide resolved

calcaide previously approved these changes Jan 8, 2025

View reviewed changes

DhariniJeeva reviewed Jan 10, 2025

View reviewed changes

ZedLi dismissed calcaide’s stale review via d4f8b04 January 21, 2025 23:53

vercel bot deployed to Preview – boundary-ui-desktop January 21, 2025 23:54 View deployment

vercel bot deployed to Preview – boundary-ui January 21, 2025 23:54 View deployment

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from d4f8b04 to 7b92e70 Compare January 21, 2025 23:55

vercel bot deployed to Preview – boundary-ui-desktop January 21, 2025 23:57 View deployment

vercel bot deployed to Preview – boundary-ui January 21, 2025 23:57 View deployment

hashicc previously requested changes Jan 29, 2025

View reviewed changes

ZedLi requested a review from hashicc January 29, 2025 18:19

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from 7b92e70 to 06c33d8 Compare January 29, 2025 18:23

vercel bot deployed to Preview – boundary-ui-desktop January 29, 2025 18:25 View deployment

vercel bot deployed to Preview – boundary-ui January 29, 2025 18:25 View deployment

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from 06c33d8 to 654b2db Compare February 11, 2025 23:06

vercel bot deployed to Preview – boundary-ui-desktop February 11, 2025 23:09 View deployment

vercel bot deployed to Preview – boundary-ui February 11, 2025 23:09 View deployment

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from 654b2db to 1a52b04 Compare February 11, 2025 23:15

vercel bot deployed to Preview – boundary-ui-desktop February 11, 2025 23:17 View deployment

vercel bot deployed to Preview – boundary-ui February 11, 2025 23:17 View deployment

hashicc previously approved these changes Feb 12, 2025

View reviewed changes

e2e-tests/desktop/fixtures/tesseractTest.js Outdated Show resolved Hide resolved

moduli reviewed Feb 12, 2025

View reviewed changes

calcaide previously approved these changes Feb 12, 2025

View reviewed changes

ZedLi dismissed stale reviews from calcaide and hashicc via eaad9da February 13, 2025 17:30

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from 1a52b04 to eaad9da Compare February 13, 2025 17:30

vercel bot deployed to Preview – boundary-ui-desktop February 13, 2025 17:32 View deployment

vercel bot deployed to Preview – boundary-ui February 13, 2025 17:32 View deployment

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from eaad9da to f0f68ea Compare February 13, 2025 18:42

vercel bot deployed to Preview – boundary-ui-desktop February 13, 2025 18:45 View deployment

vercel bot deployed to Preview – boundary-ui February 13, 2025 18:45 View deployment

ZedLi added 2 commits February 13, 2025 16:26

feat: 🎸 Use snapshots for E2E tests when comparing terminal

e49f15c

feat: 🎸 add OCR to interpret terminal instead

79c741a

ZedLi force-pushed the use-snapshots-for-e2e-tests branch from f0f68ea to 79c741a Compare February 13, 2025 21:27

vercel bot deployed to Preview – boundary-ui-desktop February 13, 2025 21:29 View deployment

vercel bot deployed to Preview – boundary-ui February 13, 2025 21:29 View deployment

lisbet-alvarez reviewed Feb 14, 2025

View reviewed changes

lisbet-alvarez approved these changes Feb 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 🎸 Use snapshots or OCR for E2E tests when comparing terminal #2639

feat: 🎸 Use snapshots or OCR for E2E tests when comparing terminal #2639

ZedLi commented Dec 21, 2024 •

edited

Loading

vercel bot commented Dec 21, 2024 •

edited

Loading

moduli left a comment

ZedLi commented Dec 26, 2024

moduli commented Jan 6, 2025

laurenolivia commented Jan 7, 2025

ZedLi commented Jan 7, 2025

laurenolivia commented Jan 8, 2025

ZedLi commented Jan 8, 2025

calcaide left a comment

DhariniJeeva Jan 10, 2025

ZedLi Jan 10, 2025

hashicc left a comment

hashicc left a comment

moduli left a comment

ZedLi commented Feb 12, 2025

calcaide left a comment

lisbet-alvarez Feb 14, 2025

lisbet-alvarez left a comment •

edited

Loading

feat: 🎸 Use snapshots or OCR for E2E tests when comparing terminal #2639

Are you sure you want to change the base?

feat: 🎸 Use snapshots or OCR for E2E tests when comparing terminal #2639

Conversation

ZedLi commented Dec 21, 2024 • edited Loading

Description

How to Test

Checklist

vercel bot commented Dec 21, 2024 • edited Loading

moduli left a comment

Choose a reason for hiding this comment

ZedLi commented Dec 26, 2024

moduli commented Jan 6, 2025

laurenolivia commented Jan 7, 2025

ZedLi commented Jan 7, 2025

laurenolivia commented Jan 8, 2025

ZedLi commented Jan 8, 2025

calcaide left a comment

Choose a reason for hiding this comment

DhariniJeeva Jan 10, 2025

Choose a reason for hiding this comment

ZedLi Jan 10, 2025

Choose a reason for hiding this comment

hashicc left a comment

Choose a reason for hiding this comment

hashicc left a comment

Choose a reason for hiding this comment

moduli left a comment

Choose a reason for hiding this comment

ZedLi commented Feb 12, 2025

calcaide left a comment

Choose a reason for hiding this comment

lisbet-alvarez Feb 14, 2025

Choose a reason for hiding this comment

lisbet-alvarez left a comment • edited Loading

Choose a reason for hiding this comment

ZedLi commented Dec 21, 2024 •

edited

Loading

vercel bot commented Dec 21, 2024 •

edited

Loading

lisbet-alvarez left a comment •

edited

Loading