Improvements to `$&here` w/r/t forking #160

jpco · 2024-12-16T18:22:45Z

First of all -- rewrite $&here so that it more closely matches the other pipefork()ing primitives in prim-io.c, rather than the REDIR()-using primitives. There are aspects of PRIM(here) which match both sets of primitives (with this PR, $&here is the only pipefork()-style primitive which uses a defer_*() function), but I think that holistically $&here "fits" better with the pipefork() style.

Plus, the pipefork() style makes it more straightforward to actually wait for the forked-off process, which we do now, no longer leaking child processes. This fixes #150.

Lastly, we now compare the doc length to PIPE_BUF and if smaller, we just write the whole doc to the new pipe without forking at all. POSIX demands a PIPE_BUF of at least 512 bytes, and most systems use 4096, but I expect that even the smaller value catches a pretty large majority of heredoc uses in modern use. (Technically PIPE_BUF doesn't even indicate pipe capacity on a system, and some shells like bash at least do an active test at build time to probe the actual capacity, but it makes for a pretty reasonable lower bound and, again, I think this lower bound catches most cases). Not all OSes (e.g., the Hurd) define a PIPE_BUF; in that case we just fall back to always forking.

See the following for the performance impact (in the optimal case, where there are no other forks involved) of removing the forks:

$ time es -c 'echo <={for (i = `{seq 1 10000}) {%read <<< herestring.^$i\n}}'
herestring.10000

real	0m4.671s
user	0m0.091s
sys	0m1.894s
$ time ./es -c 'echo <={for (i = `{seq 1 10000}) {%read <<< herestring.^$i\n}}'
herestring.10000

real	0m0.097s
user	0m0.026s
sys	0m0.071s

Even when writing to a fork/exec'd binary, the difference is fairly significant (over 20% reduction in real time):

[jpco@jpco es-fork]$ time es -c '{for (i = `{seq 1 10000}) {cat <<< herestring.^$i\n}} > /dev/null'

real	0m15.968s
user	0m8.739s
sys	0m6.845s
[jpco@jpco es-fork]$ time ./es -c '{for (i = `{seq 1 10000}) {cat <<< herestring.^$i\n}} > /dev/null'

real	0m11.789s
user	0m7.499s
sys	0m3.977s

…d be) and use ewrite() instead of write()

jpco added 5 commits December 14, 2024 22:11

ewaitfor() the forked process in $&here

5e24588

Ref() things

6f9b444

Do not fork in order to write docs shorter than PIPE_BUF

1fb7225

Small fixes; don't assume PIPE_BUF is always present (though it shoul…

581030f

…d be) and use ewrite() instead of write()

Just use pid to track if a fork happened

ccea671

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to `$&here` w/r/t forking #160

Improvements to `$&here` w/r/t forking #160

jpco commented Dec 16, 2024

Improvements to $&here w/r/t forking #160

Are you sure you want to change the base?

Improvements to $&here w/r/t forking #160

Conversation

jpco commented Dec 16, 2024

Improvements to `$&here` w/r/t forking #160

Improvements to `$&here` w/r/t forking #160