-
Story
-
Resolution: Done
-
Major
-
None
-
None
In any textual representation we need to find:
- Host and domain names (RFC 1034/1035) - including node names, also in filenames
There are existing tools that do that, so evaluate whether we can leverage that - otherwise there are a bunch of regular expressions in sos-cleaner:
https://github.com/soscleaner/soscleaner/blob/master/soscleaner/soscleaner.py
https://sos.readthedocs.io/en/master/parsers.html
AC:
- well tested logic to detect all the above shapes and forms of host and domain names in a string