Replies: 1 comment
-
I think it's not a Miller task, but a regular expression task. Only as example, if you run echo "andleben-leymann.de#http://landleben-leymann.de
b-elektronikfirma.com/index.php?
dailymetcon.it/M
www.vidamon.at/
www.wasserspender-deutschland.de/index.html" | grep -oE '\b(https?://|www\.)\S+' you get
So you mostly need to find the best regex or various regex to apply. In Miller in example, starting from this input
you could run
to get
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to process this document with Miller: https://www.watchlist-internet.at/index.php?id=120
And have the following command so far:
Which works, however there are weird records in the list (or edge cases) that need to be handled, like these:
In the first example, the URL has a unique domain I'd like to include in the output. As for the others, simply splitting on "/" would be sufficient. But how specifically can I handle the edge cases in the same Miller command?
Beta Was this translation helpful? Give feedback.
All reactions