Add formatters for built-in values #19

lthibault · 2021-03-20T00:23:24Z

In the spirit of #16, this PR adds formatting options for built-in values.

By convention:

%s returns an s-expression
%#s returns a human-readable representation

I considered inverting the two, but wanted to stay consistent with error formatting, where # means "pretty".

⏱️ Estimated review time: 15 min.
❌ WIP. Do not merge.

lthibault · 2021-03-20T20:53:23Z

@spy16 I'm kinda struggling to sensibly define the formatting directives for Slurp values, and would welcome your input.

As I see it, there are basically three special formatting cases for values (as distinct from errors):

Serialization (s-expressions)
I want an exact representation of the value that can be parsed by a reader. I care about exact correctness.
Pretty-Printing
When calling (print x), the representation of x's value is intended purely for human consumption. Strings should be printed without quotes (i.e. "foo" and 'foo should both print as foo). Large containers should be output in full, albeit split across multiple lines and indented for readability.
REPL output
REPL output is a mix between serialization and pretty-printing. I want a convenient, human-readable representation of a value that preserves some information about type and contents. I should notably be able to distinguish a string from a symbol, i.e. "foo" and 'foo should print as "foo" and foo, respectively. It is however desirable to summarize large containers. For example, a very long vector could be shortened to [0 1 2 3 4 5 6 7 8 9 ...].

It seems like we can distill this behavior down to three orthogonal dimensions:

Rendering vs Printing: rendering preserves type information; printing elides it.
Representing vs Summarizing: representing displays the whole value; summarizing displays part of the value.
Minification vs Formatting: minifying produces a compact output; formatting produces a multiline/indented output.

(These labels are a bit awkward, but bear with me...)

So returning to the use-cases at the beginning of this comment:

Case 1 requires rendering a minified representation.
Case 2 requires printing a formatted representation.
Case 3 requires rendering a minified summary.

An edge case would be applying case 2 to a container such as a vector. In this case, it seems like we should not elide type information for members of the container. Thus, rendering ["foo" 'bar] should produce ["foo" bar] and not [foo bar].

Assuming you agree with this assessment, the next question is "how do we encode this in formatting directives?". One approach is:

"%s" => print a minified representation => produces an s-expression
"%p" => render a minified representation => produces human-readable output with type-elision

From there, we could treat the $ modifier to mean "summarize", thus:

"%$s" => print a minified summary => produces a summarized s-expression
"%$p" => render a minified summary => produces human-readable summary output with type elision

Formatting can be achieved via #, thus:

"%#s" => print a formatted summary => produces a formatted s-expression
"%#p" => render a formatted summary => produces human-readable, formatted output with type elision
"%#$s" => print a formatted summary => produces a formatted, summarized s-expression
"%#$p" => render a formatted summary => produces human-readable, formatted summary output with type elision

Thoughts?

spy16 · 2021-05-19T14:01:51Z

Sorry for the long delay in getting back.

I am bit confused here to be honest 😅 .. I see only 3 cases like you initially listed down (serialisation, pretty-printing, REPL output).. Even in those cases, serialisation and REPL output are similar except for the fact that REPL output should have some pretty formatting (like shortening long container)..

serialisation -- use %s -- "foo", 'foo, [1 2 3 4 "foo" :bar 'baz] etc. (parsable)
pretty-print -- use %v -- foo, foo, [1 2 3 4 "foo" :bar 'baz] etc. (non-containers are unquoted, values inside containers should be quoted i think)
repl output -- use %#s or %$s -- "foo", 'foo, [1 2 3 ...] etc. -- here $ or # acts as the "pretty" modifier for serialisation.

Note: we could use v for serialisation and s for human readable string as well. in which case, the directives will be %v, %s and %#v respectively. other combination is to use %v which will be shortened serialisation (i.e., REPL output) and %+v which will be non-shortened version (i.e., serialisation) and %s will be human readable.

My only concern with so many combinations of printing is the added complexity & confusion. Hence, the suggestion to not distinguish symbol representation when serialised vs in REPL output (i.e., use 'foo for both)

lthibault · 2021-05-19T15:18:49Z

Hi @spy16 ! No worries, it's been a busy couple of months for me as well.

I am bit confused here to be honest 😅 .

To be honest ... me too 😝 . It seems I was overthinking this quite a bit, and looking for orthogonality where there is none.

In the hopes of restoring some clarity, let me just restate my wishlist. Using fmt, I want to be able to:

serialize a form into a parseable s-expression.
pretty-print a form so that I can e.g. visually inspect a big container.
print a summary rendering of a form that fits in one line. Losing information is okay here.

My only concern with so many combinations of printing is the added complexity & confusion. Hence, the suggestion to not distinguish symbol representation when serialised vs in REPL output (i.e., use 'foo for both)

I like this. Also, I'm partial to %v for serialization and %s for a pretty-printed string. The v evokes "whole value". I prefer to use # as the modifier, since it's standard.

So in summary:

serialisation -- use %v -- "foo", 'foo, [1 2 3 4 "foo" :bar 'baz] etc. (parsable)
pretty-print -- use %s -- foo, foo, [1 2 3 4 "foo" :bar 'baz] etc. (non-containers are unquoted, values inside containers should be quoted i think)
repl output -- use %#s -- "foo", 'foo, [1 2 3 ...] etc. -- here $ or # acts as the "pretty" modifier for serialisation.

Sound good?

spy16 · 2021-05-21T08:43:50Z

Yes. That sounds good 👍

lthibault added 5 commits March 19, 2021 19:23

Remove SExpressable

a57940b

Add formatting for builtin values.

667a49f

Change formatting options to be consistent with error-formatting.

e9edcf2

Multiple bugfixes.

a0fd0b3

Add tests for list & vector (failing).

1da8c64

lthibault added the enhancement New feature or request label Mar 20, 2021

lthibault added this to the v1.0 milestone Mar 20, 2021

lthibault requested a review from spy16 March 20, 2021 00:23

lthibault self-assigned this Mar 20, 2021

lthibault marked this pull request as draft March 20, 2021 00:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add formatters for built-in values #19

Add formatters for built-in values #19

lthibault commented Mar 20, 2021

lthibault commented Mar 20, 2021

spy16 commented May 19, 2021 •

edited

lthibault commented May 19, 2021

spy16 commented May 21, 2021

Add formatters for built-in values #19

Are you sure you want to change the base?

Add formatters for built-in values #19

Conversation

lthibault commented Mar 20, 2021

lthibault commented Mar 20, 2021

spy16 commented May 19, 2021 • edited

lthibault commented May 19, 2021

spy16 commented May 21, 2021

spy16 commented May 19, 2021 •

edited