java使用pattern和Matcher獲取html代碼中的壹串字符,主要是使用正則表達式來匹配html的標簽,如下代碼:
package?com.qiu.lin.he;import?java.text.ParseException;
import?java.util.regex.Matcher;
import?java.util.regex.Pattern;
public?class?Ceshi?{
public?static?void?main(String[]?args)?throws?ParseException?{
String?string?=?"<span?class=\"123class\">75757574</span></div><span?class=\"123class\">12312341243</span></div>";
Pattern?pattern?=?Pattern.compile(">([\\d]+)<");//匹配html字符
Matcher?matcher?=?pattern.matcher(string);
if?(matcher.find())?{
matcher.reset();
while?(matcher.find())?{//找到匹配的字符串
System.out.println("hit:?"?+?matcher.group(1));
}
}?else?{
System.out.println("[ERROR]?NOT?FOUND!");
}
}
}
結果如下: