Something that may be useful is an explicit ability to assert a UTF-8 clean environment, and then have the interpreter operate on that basis:
os.environ
and sys.argv
)At the moment, Python's desire to be tolerant of environmental configuration errors makes it difficult to enforce consistency when you actually want it (treating any deviations as an error in the environment rather than in Python).