FunASR/fun_text_processing/text_normalization/es
zhifu gao 861147c730
Dev gzf exp (#1654)
* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* sensevoice finetune

* bugfix

* update with main (#1631)

* update seaco finetune

* v1.0.24

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>

* sensevoice

* sensevoice

* sensevoice

* update with main (#1638)

* update seaco finetune

* v1.0.24

* update rwkv template

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sensevoice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* sense voice

* whisper

* whisper

* update style

* update style

---------

Co-authored-by: 维石 <shixian.shi@alibaba-inc.com>
2024-04-24 16:03:38 +08:00
..
data Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00
taggers Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00
verbalizers Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00
__init__.py Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00
graph_utils.py Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00
README.md FunTextProcessing 2022-12-22 13:28:07 +08:00
utils.py Dev gzf exp (#1654) 2024-04-24 16:03:38 +08:00

Localization Note

Depending on locale, Spanish number strings will vary in formatting. In the EU and South American countries, it is common to use a period (".") or space to delineate groupings of three digits. e.g. 1.000.000 -> "un millón" 1 000 000 -> "un millón"

and commas (",") to seperate cardinal and decimal strings. e.g.

`1,00` -> "uno coma cero cero"

While Central and Northern America will use commas (",") to delineate groupings of three digits, e.g. 1,000,000 -> "un millón"

and periods (".") to seperate cardinal and decimal strings. e.g.

`1.00` -> "uno coma cero cero"

As inclusion of both forms will create inherrent ambiguity for verbalization, this module defaults to the former formatting (periods for cardinal delineation and commas for decimals).

To toggle the alternate formatting, you may edit the LOCALIZATION variable in fun_text_processing.text_normalization.es.__init__ with the value of 'am'. This will perform necessary adjustments to all affected classes.