caps: support characters with diacritics (e.g. ÄÖÜ) in caps words #59

antonmosich · 2022-02-23T11:33:53Z

Currently the caps filter is only able to recognize caps words written only using A-Z, which works fine for English texts, but makes problems once you change to German or French for example. During looking this up, I noticed it is quite complicated to do that using regular expressions/re. The regex package might help, where you can use \p{Lu} to match all uppercase unicode characters, but that would add another dependency.
Another possibility might be to somehow use pythons .isupper() method for string objects which does work with characters with diacritics.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

caps: support characters with diacritics (e.g. ÄÖÜ) in caps words #59

caps: support characters with diacritics (e.g. ÄÖÜ) in caps words #59

antonmosich commented Feb 23, 2022

caps: support characters with diacritics (e.g. ÄÖÜ) in caps words #59

caps: support characters with diacritics (e.g. ÄÖÜ) in caps words #59

Comments

antonmosich commented Feb 23, 2022