Is there a way to replace all types of hyphens by the simple ascii "-"? I am looking for something like this that works for spaces:
txt = re.sub(r'[\s]+',' ',txt)
I believe that some non-ascii "-" hyphens are avoiding the correct process of removing some specific stopwords (name of projects that are connected by hyphens):
I want to replace this AR–L1003' for instance by AR-L1003, but I want to do this for the entire text.