More treelist utility functions #5106

AlexKnauth · 2024-11-21T20:08:31Z

Checklist

Bugfix
Feature
tests included
documentation

Description of change

Adds 5 new treelist functions:

treelist-filter: like list filter
treelist-index-of: like list index-of
treelist-flatten: like list flatten
treelist-append*: like list append* on one argument
sequence->treelist: like list sequence->list

jackfirth · 2024-11-21T20:24:58Z

Can you also add sequence->treelist?

AlexKnauth · 2024-11-21T21:40:08Z

sequence->treelist added

jackfirth · 2024-11-21T22:11:43Z

Two performance notes on sequence->treelist:

If the input is already a treelist, return it unchanged without iterating at all. That way functions can accept any sequence and convert it to a treelist immediately without any cost for users who already have a treelist.
If the input is a list, call the list->treelist function. That way there's no performance difference between sequence->treelist and list->treelist for users.

AlexKnauth · 2024-11-22T00:06:37Z

Added shortcuts for conversions to treelist from treelist, vector, and list.

mflatt · 2024-11-22T00:17:43Z

What's the rationale for adding these particular functions? Some of these seem arbitrary to me, liketreelist-keep-members, but maybe there's some precedent or rationale that I'm missing.

AlexKnauth · 2024-11-22T00:25:58Z

These are the list functions that I needed to re-implement for some projects I've been writing in Rhombus. Both keep-members and skip-members came out of wanting to use set intersect and subtract operations on lists in Rhombus, but not wanting to call the subtract operation subtract to avoid potential confusion on duplicate elements.

(edit: and of course, not being able to call the subtract operation remove* due to Rhombus identifier constraints)

mflatt · 2024-11-22T00:55:44Z

These are the list functions that I needed to re-implement for some projects I've been writing in Rhombus.

I'm having trouble seeing this as a rationale for adding to a core library. It seems like we should generally avoid adding things to this library, since everyone who loads racket/treeelist will pay for them.

I can see an argument for adding a few more functions that are known from experience to be very broadly useful. The treelist-flatten and sequence->treelist functions are probably in that category, and maybe treelist-index-of.

As you may guess, I'm not a fan of the way racket/list evolved to have such a large set of functions. A lot of them should have been organized by task, and not in racket/list just because they work on lists.

jackfirth · 2024-11-22T01:02:57Z

Perhaps sequence-index-of and sequence-index-where would be more valuable additions?

AlexKnauth · 2024-11-22T01:06:20Z

I don't think including the functionality of remove* in racket/list was a mistake. It's a very common thing to do with lists, and I think Rhombus should have an operation with the same behavior. I considered other names like remove_all and subtract, but... I didn't want to let people think it was the same as repeatedly calling remove for 1 element at a time.

(edit: remove* is included in racket/base, not racket/list, but my point about it being a very common operation still stands)

AlexKnauth · 2024-11-22T02:26:34Z

Perhaps if a filter method is implemented with ~keep and ~skip keywords, use cases for a.keep_members(b) and a.skip_members(b) could be replaced with a.filter(~keep: b.has_element(_)) and a.filter(~skip: b.has_element(_)).

mflatt · 2024-11-22T11:27:00Z

Switching to treelist-filter is a great improvement! I support adding treelist-filter, treelist-flatten, sequence->treelist and (less strongly) treelist-index-of. I don't think we should add treelist-index-where, treelist-splitf, or treelist-flatten-once.

jackfirth · 2024-11-22T11:40:08Z

I'm not a fan of making treelist-filter work differently from filter, vector-filter, hash-filter, sequence-filter, etc. It should take a single predicate argument rather than separate #:keep and #:skip keyword arguments.

mflatt · 2024-11-22T14:13:48Z

I'm not a fan of making treelist-filter work differently from filter, ...

Good point. I think the Rhombus filter convention should use ~keep and ~skip, but maybe that doesn't belong at the racket/treelist level.

AlexKnauth · 2024-11-22T14:33:45Z

I think treelist-index-where deserves to be here more than treelist-index-of. Use cases for l.index_of(v) can easily be expressed with l.index_where((_ == v)), but it's not quite as easy for use cases of index-where to be translated to index-of.

I have needed to use splitf operations far more often than other related functions like split, take, drop, takef, and dropf. Though if we keep index-where, at least implementing splitf in terms of index-where isn't hard.

As for treelist-flatten-once, list append* on a single argument is an operation I need far more often than flatten, and it's more "type-safe" in that it works to convert list-of-lists of any type A to a list of A consistently, even when values of type A might be lists. flatten-once is a bit of an awkward name. Would treelist-append* fit better? I didn't want to imply that (treelist-append* tlotl) is just equivalent to (apply treelist-append tlotl) though, since apply doesn't work on treelists. Or treelist-concat? treelist-join?

I like the #:keep and #:skip arguments to treelist-filter for the same reason expressed in racket/rhombus#131 (comment): If someone forgets to use (or has not yet learned to use) #:keep, the error message will make it clear. It also allows the arguments in either order, avoiding potential order inconsistency like hash-filter vs filter. It replaces 2 operations with just 1, and (treelist-filter #:skip pred tl) is a lot nicer than (treelist-filter (lambda (x) (not (pred x))) tl), especially when pred is just a function name.

mflatt · 2024-11-22T21:38:20Z

To get a sense of what should be in a Racket library, I tried grepping all packages from a January snapshot, where each grep was of the form ''[ ()]map[ ()]' (but with map replaced). Checking things like map and append provides scale.

map            21930
append          8262
list-ref        4684
filter          3675
member          3075
memq            2464
for-each        2047
take            2197
drop            1120
flatten          816
apply append     735  ; literally, so on the same line
partition        497
sequence->list   476
append*          374
shuffle          316
add-between      280
drop-right       291
memv             272
index-of         227
filter-map       211
filter-not       202
group-by         187
argmax           154
argmin           140
splitf-at        101
check-duplicates  97
take-right        96
index-where       50
remf              15
indexes-of         6
takef-right        6
splitf-at-right    5
indexes-where      3

These are just rough counts, of course, given the grep approach. A number like 6 means basically unused, except that there are things that track or copy "list.rkt".

I read this as confirming that index-of is borderline, and splitf-at and index-where are significantly more rare. It's true that index-of can be written with index-where, but if the point is to provide useful things directly, index-of seems like the one that might be worthwhile.

The append* function and apply append combination do show up a lot. This is is an operation where treelists are different from lists, though, so I'm not sure it's right to conclude that it will be as useful to treelists. If we just go by these numbers, then I'd concede that treelist-append* belongs.

The partition function shows up more than I expected (just because I never use it). I'd say it's borderline, but more worthwhile than other things we're considering.

The shuffle function is useful, but I'd say it belongs in a different module.

The relative rareness of filter-not is a little bit of an argument against keywords for treelist-filter. I still think the stronger argument is to stay like filter to keep Racket libraries more consistent.

My current conclusions on what to include:

treelist-filter - yes (not with keywords)
treelist-append* - yes
treelist-flatten - yes
treelist-partition - probably
sequence->treelist - probably
treelist-index-of - maybe
treelist-index-where - no
treelist-splitf - no

Further Rhombus discussion probably belongs in the other repo, but I would be inclined to omit treelist-append* in favor of writing List.append(& lists) (and, of course, have keyword arguments for List.filter for consistency with other forms).

AlexKnauth · 2024-11-22T21:50:40Z

Would you be happier if functions that split the list into 2 values, such as partition, split, and splitf, were placed in a separate module like racket/treelist/split? Just thinking of that split because not everyone likes dealing with multiple values.

The index operations like index-of and index-where have greater potential for treelists than they did for pairlists because random access by index is more efficient for treelists: should that bump them into the main module? Or should that be separated into a racket/treelist/index module?

mflatt · 2024-11-23T14:17:08Z

As much as I like the treelist datatype, I don't see so many Racket programs moving to them that we need racket/treelist/X libraries, so far. (It looks like 3 non-Rhombus packages use them, based on a freshened package snapshot.) That may change in the long run, but Rhombus seems like a better place to experiment with list-library organizations. Adding to Racket means forever, but Rhombus is free to change, at least for for a little while.

With that in mind, maybe it's best to add only treelist-filter, treelist-append*, and treelist-flatten, because we agree that they're useful, and because list-function uses support that impression (with the caveat that append might be used differently with treelist). Or maybe it's best to not add anything to racket/treelist, since that isn't necessary to add functions to Rhombus — or even particularly useful in the short run, since we wouldn't want to bump the required Racket version for Rhombus.

mflatt · 2024-11-23T20:49:35Z

pkgs/racket-doc/scribblings/reference/treelists.scrbl

+(treelist-filter odd? (treelist 1 2 3 2 4 5 2))
+(treelist-filter (λ (x) (not (even? x))) (treelist 1 2 3 2 4 5 2))
+(treelist-filter (λ (x) (not (odd? x))) (treelist 1 2 3 2 4 5 2))
+]}


Needs a history note (and the same for the other additions)

I've added history notes for 8.15.0.6, the current version. Is that okay, or should this include a version bump to 8.15.0.7?

I think "8.15.0.6" is good.

mflatt · 2024-11-23T20:50:23Z

pkgs/racket-test-core/tests/racket/treelist.rktl

@@ -56,6 +56,9 @@
                                                                      (regexp-quote (symbol->string 'op))
                                                                      ":"))))

+(define-syntax-rule (test-values expected actual)


It looks like this is not used, anymore.

AlexKnauth added 3 commits November 21, 2024 13:30

More treelist utility functions

c3bd6ee

Test treelist utility functions

1e6edf7

Document treelist utility functions

197feae

Add sequence->treelist

00a8ede

AlexKnauth added 3 commits November 21, 2024 17:43

require racket/stream

5f55001

Shortcut same->same sequence conversions

6a76da6

sequence->treelist: special case vector, list

9780dbe

treelist skip-members, keep-members -> filter

d6bc609

AlexKnauth added 3 commits November 23, 2024 09:47

Remove index-where, splitf

9e2622b

Rename flatten-once -> append*

ebc2143

Remove #:keep and #:skip keywords

c0bd1a1

AlexKnauth force-pushed the treelist-util branch from cf40fda to c0bd1a1 Compare November 23, 2024 15:10

mflatt reviewed Nov 23, 2024

View reviewed changes

AlexKnauth added 2 commits November 23, 2024 19:14

Remove test-values

5cee9f2

Add history notes

62e2b39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More treelist utility functions #5106

More treelist utility functions #5106

AlexKnauth commented Nov 21, 2024 •

edited

Loading

jackfirth commented Nov 21, 2024

AlexKnauth commented Nov 21, 2024

jackfirth commented Nov 21, 2024

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 •

edited

Loading

mflatt commented Nov 22, 2024

jackfirth commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 •

edited

Loading

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 22, 2024

jackfirth commented Nov 22, 2024

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 •

edited

Loading

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 23, 2024

mflatt Nov 23, 2024

AlexKnauth Nov 24, 2024

mflatt Nov 24, 2024

mflatt Nov 23, 2024

More treelist utility functions #5106

Are you sure you want to change the base?

More treelist utility functions #5106

Conversation

AlexKnauth commented Nov 21, 2024 • edited Loading

Checklist

Description of change

jackfirth commented Nov 21, 2024

AlexKnauth commented Nov 21, 2024

jackfirth commented Nov 21, 2024

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 • edited Loading

mflatt commented Nov 22, 2024

jackfirth commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 • edited Loading

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 22, 2024

jackfirth commented Nov 22, 2024

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024 • edited Loading

mflatt commented Nov 22, 2024

AlexKnauth commented Nov 22, 2024

mflatt commented Nov 23, 2024

mflatt Nov 23, 2024

Choose a reason for hiding this comment

AlexKnauth Nov 24, 2024

Choose a reason for hiding this comment

mflatt Nov 24, 2024

Choose a reason for hiding this comment

mflatt Nov 23, 2024

Choose a reason for hiding this comment

AlexKnauth commented Nov 21, 2024 •

edited

Loading

AlexKnauth commented Nov 22, 2024 •

edited

Loading

AlexKnauth commented Nov 22, 2024 •

edited

Loading

AlexKnauth commented Nov 22, 2024 •

edited

Loading