will be helpful for those in need You could write your code to query the Unicode database, which you can download from the Unicode consortium (or query via the character utility, but that's presumably rate-limited). The database includes things like what glyphs are "confusables" for other glyphs. For instance, your from , is U+1D4F1, which has lots of confusables, one of which is of course the standard latin lower case h (U+0068). So you could go through each char in the input string, look it up, and if it had a latin a-z confusable (perhaps 0-9 as well), replace it with that.
Regular expression to allow all alphabet characters plus unicode characters
By : Gourav Singal
Date : March 29 2020, 07:55 AM
it fixes the issue if you have a finite and short set of elements to replace you could just use a class e.g.
should help you out Use fromEnum to convert a Char to an Int representing its codepoint. The opposite conversion is done by toEnum (e.g. toEnum 65 :: Char gives 'A'). Alternatively, use ord and chr from module Data.Char.