人物分类

本文介绍了人物分类的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

又是一个简单的问题:拥有一个std::string，根据用户的语言和区域设置(区域设置)确定其哪个字符是数字，符号，空格等.

The simple question again: having an std::string, determine which of its characters are digits, symbols, white spaces etc. with respect to the user's language and regional settings (locale).

我设法使用提升语言环境边界分析工具:

std::string text = u8"生きるか死ぬか";

boost::locale::boundary::segment_index<std::string::const_iterator> characters(
    boost::locale::boundary::character,
    text.begin(), text.end(),
    boost::locale::generator()("ja_JP.UTF-8"));

for (const auto& ch : characters) {
    // each 'ch' is a single character in japanese language
}

但是，我进一步看不到有什么方法可以确定ch是数字还是符号还是其他.有 boost字符串分类算法，但是这些似乎都无法使用..无论*segment_index::iterator是什么.

However, I further do not see any way to determine if ch is a digit or a symbol or anything else.There are boost string classification algorithms, but these don't seem to be working with.. whatever *segment_index::iterator is.

也不能使用 std::isalpha(std::locale) ，因为我不确定是否可以将增强段转换为char或wchar_t.

是否有任何巧妙的方式来对符号进行分类?

Is there any neat way to classify symbols?

is

问题描述

推荐答案