[Python-ideas] Format mini-language for lakh and crore

Nick Coghlan ncoghlan at gmail.com
Mon Jan 29 02:13:29 EST 2018


On 29 January 2018 at 11:48, Nathaniel Smith <njs at pobox.com> wrote:
> On Sun, Jan 28, 2018 at 5:31 PM, David Mertz <mertz at gnosis.cx> wrote:
>> I actually didn't know about `locale.format("%d", 10e9, grouping=True)`.
>> But it's still much less general than having the option in the
>> f-string/.format() mini-language.  This is really about the formatted
>> string, not necessarily about the locale.  So, e.g. I'd like to be able to
>> write:
>>
>>>>> print(f"In European format x is {x:,.2f}, in Indian format it is
>>>>> {x:`.2f}")
>>
>> I don't want the format necessarily to be some pseudo-global setting, even
>> if it can get stored in thread-locals.  That said, having a locale-aware
>> symbol for delimiting numbers in the format mini-language would also not be
>> a bad thing.
>
> I don't understand the format mini-language well enough to know what
> would fit in, but maybe some way to (a) request localified formatting,

Given the example, I think a more useful approach would be to allow an
optional digit grouping specifier after the comma separator, and allow
the separator to be repeated to indicate non-uniform groupings in the
lower order digits.

If we did that, then David's example could become:

    >>> print(f"In European format x is {x:,.2f}, in Indian format it
is {x:,2,3.2f}")

The core elements of interpreting that would then be:

- digit group size specifiers are permited for both "," (decimal
display only) and "_" (all display bases)
- if no digit group size specifier is given, it defaults to 3 for
decimal and 4 for binary, octal, and hexadecimal
- if multiple digit group specifiers are given, then the last one
given is applied starting from the least significant integer digit

so "{x:,2,3.2f}" means:

- an arbitrary number of leading 2-digit groups
- 1 group of 3 digits
- 2 decimal places

It would then be reasonably straightforward to use this as a lower
level primitive to implement locale dependent formatting, as follows:

    - format in English using the locale's grouping rules [1] (either
LC_NUMERIC.grouping or LC_MONETARY.mon_grouping, as appropriate)
    - use str.translate() [2] to replace "," and "." with the locale's
thousands_sep & decimal_point or mon_thousands_sep & mon_decimal_point

[1] https://docs.python.org/3/library/locale.html#locale.localeconv
[2] https://docs.python.org/3/library/stdtypes.html#str.translate

Cheers,
Nick.

-- 
Nick Coghlan   |   ncoghlan at gmail.com   |   Brisbane, Australia


More information about the Python-ideas mailing list