java代码如何从一个a标签中提取超链接呢?
下文笔者讲述从超链接a标签中提取出url的方法分享,如下所示
a标签中提取url的实现思路: //从一个字符串中提取url public static list<String> extractUrls(String text) { List<String> containedUrls = new ArrayList<String>(); String urlRegex = "((https?|ftp|gopher|telnet|file):((//)|(\\\\))+[\\w\\d:#@%/;$()~_?\\+-=\\\\\\.&]*)"; Pattern pattern = Pattern.compile(urlRegex, Pattern.CASE_INSENSITIVE); Matcher urlMatcher = pattern.matcher(text); while (urlMatcher.find()) { containedUrls.add(text.substring(urlMatcher.start(0), urlMatcher.end(0))); } return containedUrls; } //提取a标签中的内容 public static List<String> extractTitle(String text) { List<String> containedUrls = new ArrayList<String>(); String urlRegex = ">(.*?)<"; Pattern pattern = Pattern.compile(urlRegex, Pattern.CASE_INSENSITIVE); Matcher urlMatcher = pattern.matcher(text); while (urlMatcher.find()) { containedUrls.add(text.substring(urlMatcher.start(0), urlMatcher.end(0)).replace(">", "").replace("<", "")); } return containedUrls; }
版权声明
本文仅代表作者观点,不代表本站立场。
本文系作者授权发表,未经许可,不得转载。