std::regex_traits<CharT>::lookup_classname

template< class ForwardIt > char_class_type lookup_classname( ForwardIt first, ForwardIt last, bool icase = false ) const;

現在のロケールに設定されている文字クラス名（正規表現における [: と :] の間の文字列）を表す文字シーケンス [first, last) が有効な場合、この文字クラスを表す実装定義値を返します。それ以外の場合はゼロを返します。

パラメータ icase が true の場合、文字クラスは文字の大文字・小文字を区別しません。例えば、std::regex_constants::icase を持つ正規表現 [:lower:] は、文字列 "lower" を示し icase == true となる lookup_classname() の呼び出しを生成します。この呼び出しは、icase == false の正規表現 [:alpha:] によって生成される呼び出しと同じビットマスクを返します。

std::regex_traits<char> および std::regex_traits<wchar_t> は、それぞれ以下の狭義および広義の文字クラス名を常に認識し、返される分類（icase == false の場合）は、設定されたロケールの std::ctype ファセットによって得られる対応する分類と一致します。

文字クラス名		std::ctype 分類
狭義	広義	std::ctype 分類
"alnum"	L"alnum"	std::ctype_base::alnum
"alpha"	L"alpha"	std::ctype_base::alpha
"blank"	L"blank"	std::ctype_base::blank
"cntrl"	L"cntrl"	std::ctype_base::cntrl
"digit"	L"digit"	std::ctype_base::digit
"graph"	L"graph"	std::ctype_base::graph
"lower"	L"lower"	std::ctype_base::lower
"print"	L"print"	std::ctype_base::print
"punct"	L"punct"	std::ctype_base::punct
"space"	L"space"	std::ctype_base::space
"upper"	L"upper"	std::ctype_base::upper
"xdigit"	L"xdigit"	std::ctype_base::xdigit
"d"	L"d"	std::ctype_base::digit
"s"	L"s"	std::ctype_base::space
"w"	L"w"	std::ctype_base::alnum オプションで '_' が追加される

文字列 "w" に対して返される分類は、"alnum" の分類と全く同じになる場合があります。この場合、isctype() は明示的に '_' を追加します。

システム供給のロケールによっては、"jdigit" や "jkanji" のような追加の分類が提供される場合があります（この場合、それらは std::wctype を通してもアクセス可能です）。

#include <cwctype>
#include <iostream>
#include <locale>
#include <regex>
 
// This custom regex traits uses wctype/iswctype to implement lookup_classname/isctype.
struct wctype_traits : std::regex_traits<wchar_t>
{
    using char_class_type = std::wctype_t;
 
    template<class It>
    char_class_type lookup_classname(It first, It last, bool = false) const
    {
        return std::wctype(std::string(first, last).c_str());
    }
 
    bool isctype(wchar_t c, char_class_type f) const
    {
        return std::iswctype(c, f);
    }
};
 
int main()
{
    std::locale::global(std::locale("ja_JP.utf8"));
    std::wcout.sync_with_stdio(false);
    std::wcout.imbue(std::locale());
 
    std::wsmatch m;
    std::wstring in = L"風の谷のナウシカ";
    // matches all characters (they are classified as alnum)
    std::regex_search(in, m, std::wregex(L"([[:alnum:]]+)"));
    std::wcout << "alnums: " << m[1] << '\n'; // prints "風の谷のナウシカ"
    // matches only the katakana
    std::regex_search(in, m,
                      std::basic_regex<wchar_t, wctype_traits>(L"([[:jkata:]]+)"));
    std::wcout << "katakana: " << m[1] << '\n'; // prints "ナウシカ"
}

出力

alnums: 風の谷のナウシカ
katakana: ナウシカ

[編集] 関連項目

isctype	文字クラスへの所属を示す (public member function)
wctype	現在のCロケールで文字分類カテゴリを検索する (関数) [編集]

コンパイラサポート
フリースタンディングとホスト
言語
標準ライブラリ
標準ライブラリヘッダー
名前付き要件
機能テストマクロ (C++20)
言語サポートライブラリ
コンセプトライブラリ (C++20)
診断ライブラリ
メモリ管理ライブラリ
メタプログラミングライブラリ (C++11)
汎用ユーティリティライブラリ
コンテナライブラリ
イテレータライブラリ
Rangesライブラリ (C++20)
アルゴリズムライブラリ
文字列ライブラリ
テキスト処理ライブラリ
数値ライブラリ
日付と時刻ライブラリ
入出力ライブラリ
ファイルシステムライブラリ (C++17)
並行サポートライブラリ (C++11)
実行制御ライブラリ (C++26)
Technical specifications (技術仕様)
シンボルインデックス
外部ライブラリ

クラス
basic_regex (C++11)
sub_match (C++11)
match_results (C++11)
アルゴリズム
regex_match (C++11)
regex_search (C++11)
regex_replace (C++11)
イテレータ
regex_iterator (C++11)
regex_token_iterator (C++11)
例外
regex_error (C++11)
Traits
regex_traits (C++11)
定数
syntax_option_type (C++11)
match_flag_type (C++11)
error_type (C++11)
正規表現文法
修正ECMAScript-262 (C++11)

メンバ関数
regex_traits::regex_traits
regex_traits::length
regex_traits::translate
regex_traits::translate_nocase
regex_traits::transform
regex_traits::transform_primary
regex_traits::lookup_collatename
regex_traits::lookup_classname
regex_traits::isctype
regex_traits::value
regex_traits::imbue
regex_traits::getloc

first, last	-	文字クラスの名前を表す文字シーケンスを決定するイテレータのペア
icase	-	文字の大文字・小文字の区別を無視する場合は true
型要件
- `ForwardIt` は LegacyForwardIterator の要件を満たさなければなりません。

cppreference.com

名前空間

変種

表示

操作

std::regex_traits<CharT>::lookup_classname

目次

[編集] パラメータ

[編集] 戻り値

[編集] 例

[編集] 関連項目

ナビゲーション

ツールボックス