Specifically in Python 3.3 and above, is it sufficient to check for orphan surrogates by using the simple match:
re.search(r'[\uD800-\uDFFF]', s)
Based on the assumption that all legal surrogates would have been represented as astral code points and thus would not match, leaving out the illegal surrogates, or is there caveats and edge cases one needs to be aware of?