Dashboard performance issues #1540

jgrau · 2024-11-13T21:58:40Z

I've been looking into slow performance of the dashboard in production and I want to share my findings in case you didn't know about the bottlenecks that we're seeing or you can spot some low hanging fruits.

Here's a typical trace of a request to /good_job/jobs:

It's about a 20s load time where all the time is spent on database queries. The most expensive is

SELECT "good_jobs".*, "pg_locks".locktype, "pg_locks"."pid" = pg_backend_pid() AS owns_advisory_lock FROM "good_jobs" LEFT JOIN pg_locks ON pg_locks.locktype = $1 AND pg_locks.objsubid = $2 AND pg_locks.classid = ($3 || substr(md5($4 || $5 || "good_jobs"."id"::text), $6, $7))::bit(32)::int AND pg_locks.objid = (($8 || substr(md5($9 || $10 || "good_jobs"."id"::text), $11, $12))::bit(64) << $13)::bit(32)::int ORDER BY COALESCE(scheduled_at, created_at) DESC, id DESC LIMIT $14

An explain of that yields this result:

Limit  (cost=414045.20..414045.27 rows=25 width=1016)
  ->  Sort  (cost=414045.20..419319.12 rows=2109565 width=1016)
        Sort Key: (COALESCE(good_jobs.scheduled_at, good_jobs.created_at)) DESC, good_jobs.id DESC
        ->  Hash Left Join  (cost=15.02..354514.80 rows=2109565 width=1016)
              Hash Cond: (((((('x'::text || substr(md5(('good_jobs-'::text || (good_jobs.id)::text)), 1, 16)))::bit(32))::integer)::oid = l.classid) AND ((((((('x'::text || substr(md5(('good_jobs-'::text || (good_jobs.id)::text)), 1, 16)))::bit(64) << 32))::bit(32))::integer)::oid = l.objid))
              ->  Seq Scan on good_jobs  (cost=0.00..270116.65 rows=2109565 width=975)
              ->  Hash  (cost=15.00..15.00 rows=1 width=44)
                    ->  Function Scan on pg_lock_status l  (cost=0.00..15.00 rows=1 width=44)
                          Filter: ((locktype = 'advisory'::text) AND (objsubid = 1))
JIT:
  Functions: 17
  Options: Inlining false, Optimization false, Expressions true, Deforming true

Which visualises as

As I understand that, the sequential scan is the culprit and I believe the scan is because of COALESCE(good_jobs.scheduled_at, good_jobs.created_at). On my system only schedule_at is indexed and not created_at. Shouldn't created_at be indexed as well?

The text was updated successfully, but these errors were encountered:

bensheldon · 2024-11-13T23:49:46Z

Oof! That's slow. I think this is easier now with GoodJob v4:

The pg_locks could be replaced with a locked_at IS NOT NULL query
The COALESCE can be replaced solely with good_jobs.scheduled_at because now all jobs should have a scheduled_at regardless of whether they are expect to run immediately or in the future.

Would you want to try making a PR? Otherwise I can get to it ;-)

github-project-automation bot added this to GoodJob Backlog v2 Nov 13, 2024

github-project-automation bot moved this to Inbox in GoodJob Backlog v2 Nov 13, 2024

jgrau mentioned this issue Nov 14, 2024

Remove unneeded include of pg_locks in query when displaying jobs table #1541

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dashboard performance issues #1540

Dashboard performance issues #1540

jgrau commented Nov 13, 2024 •

edited

Loading

bensheldon commented Nov 13, 2024

Dashboard performance issues #1540

Dashboard performance issues #1540

Comments

jgrau commented Nov 13, 2024 • edited Loading

bensheldon commented Nov 13, 2024

jgrau commented Nov 13, 2024 •

edited

Loading