美文网首页
字符当前类型判断(中文)

字符当前类型判断(中文)

作者: 汤瓜瓜 | 来源:发表于2016-11-14 10:21 被阅读20次
判断当前文本是否包含中文

issue:此方法只能判断部分CJK字符(CJK统一汉字)

public boolean isChineseWord(String str) {   
    Pattern p = Pattern.compile("[\u4e00-\u9fa5]");    
    Matcher m = p.matcher(str);    
    return m.matches();
}

issue:完美判断是否包含中文方法(根据Unicode编码完美的判断中文汉字和符号)

private static boolean isChinese(char c) {
     Character.UnicodeBlock ub = Character.UnicodeBlock.of(c);
      if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS || 
          ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS|| 
          ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A || 
          ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_B|| 
          ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION ||
          ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS||
          ub == Character.UnicodeBlock.GENERAL_PUNCTUATION) {
                return true;
      }
      return false;
}

相关文章

网友评论

      本文标题:字符当前类型判断(中文)

      本文链接:https://www.haomeiwen.com/subject/jtdxpttx.html