ex: "32 ½ is not very hot " to x = "info: 32, numerator = 1, denominator = 2"
Note: it could be 3/9
, but it cannot be simplified into 1/3
aka literally get what is in the string.
I need to detect the fractional string in a longer string and expand the information to a more usable form.
½
has been given to me decoded and is a string with length one.
There seem to be 19 such forms (here) and they all start with the name VULGAR FRACTION.
import unicodedata
def fraction_finder(s):
for c in s:
try:
name = unicodedata.name(c)
except ValueError:
continue
if name.startswith('VULGAR FRACTION'):
normalized = unicodedata.normalize('NFKC', c)
numerator, _slash, denominator = normalized.partition('⁄')
yield c, int(numerator), int(denominator)
Demo:
>>> s = "32 ½ is not very hot "
>>> print(*fraction_finder(s))
('½', 1, 2)