You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@frankdilo, @Olaf-: Unfortunately those URLs are not valid according to RFC.
RFC3986 host = IP-literal / IPv4address / reg-name
where reg-name = *( unreserved / pct-encoded / sub-delims )
and from that unreserved = ALPHA / DIGIT / "-" / "." / "_" / "~"
and from that and RFC2234 ALPHA = %x41-5A / %x61-7A ; A-Z / a-z
As you can see domain name can't contain characters from UTF-8 (with some accents, hooks, ... )
I am open to discussion but I would suggest a workaround to convert all characters to ASCII an then use URLExtract to find the URLs and its position and extract the URLs from original text.
URLExtract
does not match this URL as it should:сайт.com
The text was updated successfully, but these errors were encountered: