22 テキスト処理

22.1 String オブジェクト

22.1.1 String コンストラクター

String コンストラクター:

%String% である。
グローバルオブジェクトの "String" プロパティの初期値である。
コンストラクターとして呼び出されたとき、新しい String オブジェクトを生成し初期化する。
コンストラクターではなく関数として呼び出されたとき、型変換を行う。
クラス定義の extends 句の値として利用できる。指定された String の挙動を継承したいサブクラスのコンストラクターは、サブクラスインスタンスを [[StringData]] 内部スロット付きで生成・初期化するために String コンストラクターへの super 呼び出しを含めなければならない。

22.1.1.1 String ( `value` )

この関数は呼び出されたとき以下の手順を実行する:

value が存在しないなら
1. s を空文字列とする。
それ以外の場合、
1. NewTarget が undefined でかつ value が Symbol なら、SymbolDescriptiveString(value) を返す。
2. s を ? ToString(value) とする。
NewTarget が undefined なら s を返す。
StringCreate(s, ? GetPrototypeFromConstructor(NewTarget, "%String.prototype%")) を返す。

22.1.2 String コンストラクターのプロパティ

String コンストラクター:

値 %Function.prototype% の [[Prototype]] 内部スロットを持つ。
以下のプロパティを持つ:

22.1.2.1 String.fromCharCode ( ...`codeUnits` )

この関数は残余引数 codeUnits を構成する任意個の引数で呼び出すことができる。

呼び出されたとき以下を行う:

result を空文字列とする。
codeUnits の各要素 next について
1. nextCU を ℝ(? ToUint16(next)) の数値を持つコードユニットとする。
2. result を result と nextCU の文字列連結とする。
result を返す。

この関数の "length" プロパティは 1_𝔽 である。

22.1.2.2 String.fromCodePoint ( ...`codePoints` )

この関数は残余引数 codePoints を構成する任意個の引数で呼び出すことができる。

呼び出されたとき以下を行う:

result を空文字列とする。
codePoints の各要素 next について
1. nextCP を ? ToNumber(next) とする。
2. nextCP が整数 Number でなければ RangeError 例外を投げる。
3. ℝ(nextCP) < 0 または ℝ(nextCP) > 0x10FFFF なら RangeError 例外を投げる。
4. result を result と UTF16EncodeCodePoint(ℝ(nextCP)) の文字列連結とする。
アサート: codePoints が空なら result は空文字列。
result を返す。

この関数の "length" プロパティは 1_𝔽 である。

22.1.2.3 String.prototype

String.prototype の初期値は String プロトタイプオブジェクトである。

このプロパティの属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: false } である。

22.1.2.4 String.raw ( `template`, ...`substitutions` )

この関数は可変個の引数で呼び出される。最初の引数が template、残りがリスト substitutions を構成する。

呼び出されたとき以下を行う:

substitutionCount を substitutions の要素数とする。
cooked を ? ToObject(template) とする。
literals を ? ToObject(? Get(cooked, "raw" )) とする。
literalCount を ? LengthOfArrayLike(literals) とする。
literalCount ≤ 0 なら空文字列を返す。
R を空文字列とする。
nextIndex を 0 とする。
繰り返し、
1. nextLiteralVal を ? Get(literals, ! ToString(𝔽(nextIndex))) とする。
2. nextLiteral を ? ToString(nextLiteralVal) とする。
3. R を R と nextLiteral の文字列連結とする。
4. nextIndex + 1 = literalCount なら R を返す。
5. nextIndex < substitutionCount なら
  1. nextSubVal を substitutions[nextIndex] とする。
  2. nextSub を ? ToString(nextSubVal) とする。
  3. R を R と nextSub の文字列連結とする。
6. nextIndex を nextIndex + 1 とする。

Note

この関数はタグ付きテンプレート (13.3.11) のタグ関数として使用することを意図している。その場合最初の引数は正しく整形されたテンプレートオブジェクトであり、残りが置換値となる。

22.1.3 String プロトタイプオブジェクトのプロパティ

String プロトタイプオブジェクトは以下を満たす:

%String.prototype% である。
String エキゾチックオブジェクトであり、そのようなオブジェクトに規定された内部メソッドを持つ。
値が空文字列の [[StringData]] 内部スロットを持つ。
初期値 +0_𝔽 の "length" プロパティを持ち、その属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: false } である。
[[Prototype]] 内部スロットの値は %Object.prototype% である。

特に明記されない限り、以下で定義される String プロトタイプオブジェクトのメソッドはジェネリックではなく、渡される this 値は String 値か、String 値に初期化された [[StringData]] 内部スロットを持つオブジェクトでなければならない。

22.1.3.1 String.prototype.at ( `index` )

O を this の値とする。
? RequireObjectCoercible(O) を実行する。
S を ? ToString(O) とする。
len を S の長さとする。
relativeIndex を ? ToIntegerOrInfinity(index) とする。
relativeIndex ≥ 0 なら
1. k を relativeIndex とする。
それ以外なら
1. k を len + relativeIndex とする。
k < 0 または k ≥ len なら undefined を返す。
S の k から k + 1 までの部分文字列を返す。

22.1.3.2 String.prototype.charAt ( `pos` )

Note 1

このメソッドは、このオブジェクトを String に変換した値のインデックス pos にあるコードユニットを含む 1 文字の String を返す。その位置に要素がなければ結果は空文字列となる。結果は String オブジェクトではなく String 値である。

pos が整数 Number なら x.charAt(pos) の結果は x.substring(pos, pos + 1) の結果と等価である。

呼び出し時に以下を行う:

O を this の値とする。
? RequireObjectCoercible(O)。
S を ? ToString(O) とする。
position を ? ToIntegerOrInfinity(pos) とする。
size を S の長さとする。
position < 0 または position ≥ size なら空文字列を返す。
S の position から position + 1 までの部分文字列を返す。

Note 2

このメソッドは意図的にジェネリックであり、this が String オブジェクトであることを要求しない。そのため他のオブジェクトへ転用できる。

22.1.3.3 String.prototype.charCodeAt ( `pos` )

Note 1

このメソッドは、このオブジェクトを String に変換した結果内のインデックス pos のコードユニットの数値 (0 以上 2¹⁶ 未満の整数 Number) を返す。その位置に要素がなければ NaN を返す。

呼び出し時に以下を行う:

O を this の値とする。
? RequireObjectCoercible(O)。
S を ? ToString(O) とする。
position を ? ToIntegerOrInfinity(pos) とする。
size を S の長さとする。
position < 0 または position ≥ size なら NaN を返す。
S のインデックス position のコードユニットの数値を表す Number を返す。

Note 2

このメソッドは意図的にジェネリックであり、this が String オブジェクトであることを要求しない。

22.1.3.4 String.prototype.codePointAt ( `pos` )

Note 1

このメソッドは 0x10FFFF_𝔽 以下の非負整数 Number を返し、これはこのオブジェクトを String に変換した結果のインデックス pos で始まる UTF-16 エンコードされたコードポイント (6.1.4) の数値である。その位置に要素がなければ undefined を返す。pos で有効なサロゲートペアが開始しなければ、その位置のコードユニットを返す。

呼び出し時の手順:

O を this の値とする。
? RequireObjectCoercible(O)。
S を ? ToString(O) とする。
position を ? ToIntegerOrInfinity(pos) とする。
size を S の長さとする。
position < 0 または position ≥ size なら undefined を返す。
cp を CodePointAt(S, position) とする。
𝔽(cp.[[CodePoint]]) を返す。

Note 2

このメソッドはジェネリックであり他オブジェクトに転用可能。

22.1.3.5 String.prototype.concat ( ...`args` )

Note 1

このメソッドは this の値 (String に変換) のコードユニットに、各引数を String に変換した結果のコードユニットを順に連結した String 値を返す。結果は String オブジェクトではなく String 値。

呼び出し時:

O を this の値とする。
? RequireObjectCoercible(O)。
S を ? ToString(O) とする。
R を S とする。
args の各要素 next について
1. nextString を ? ToString(next) とする。
2. R を R と nextString の文字列連結とする。
R を返す。

このメソッドの "length" は 1_𝔽。

Note 2

ジェネリックであり他オブジェクトに転用可能。

22.1.3.6 String.prototype.constructor

String.prototype.constructor の初期値は %String% である。

22.1.3.7 String.prototype.endsWith ( `searchString` [ , `endPosition` ] )

呼び出し時の手順:

O を this の値とする。
? RequireObjectCoercible(O)。
S を ? ToString(O) とする。
isRegExp を ? IsRegExp(searchString) とする。
isRegExp が true なら TypeError を投げる。
searchStr を ? ToString(searchString) とする。
len を S の長さとする。
endPosition が undefined なら pos を len とし、そうでなければ pos を ? ToIntegerOrInfinity(endPosition) とする。
end を pos を 0 と len の間にクランプした結果とする。
searchLength を searchStr の長さとする。
searchLength = 0 なら true を返す。
start を end - searchLength とする。
start < 0 なら false を返す。
substring を S の start から end の部分文字列とする。
substring が searchStr なら true を返す。
false を返す。

Note 1

endPosition - length(this) から始まる対応するコードユニット列が一致すれば true。

Note 2

最初の引数が RegExp の場合に例外を投げるのは将来の拡張余地のため。

Note 3

ジェネリックであり転用可能。

22.1.3.8 String.prototype.includes ( `searchString` [ , `position` ] )

呼び出し時:

O を this の値。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
isRegExp を ? IsRegExp(searchString)。
isRegExp が true なら TypeError。
searchStr を ? ToString(searchString)。
pos を ? ToIntegerOrInfinity(position)。
アサート: position が undefined なら pos は 0。
len を S の長さ。
start を pos を 0 と len の間にクランプした結果。
index を StringIndexOf(S, searchStr, start)。
index が not-found なら false。
true を返す。

Note 1

position 以上の位置で searchString が部分文字列として現れれば true。

Note 2

RegExp の場合例外を投げる理由は将来拡張のため。

Note 3

ジェネリック。

22.1.3.9 String.prototype.indexOf ( `searchString` [ , `position` ] )

Note 1

position 以上で最初に現れるインデックスを返し、存在しなければ -1_𝔽。position が undefined なら +0_𝔽。

手順:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
searchStr を ? ToString(searchString)。
pos を ? ToIntegerOrInfinity(position)。
アサート: position が undefined なら pos は 0。
len を S の長さ。
start を pos を 0 と len の間にクランプした結果。
result を StringIndexOf(S, searchStr, start)。
result が not-found なら -1_𝔽 を返す。
𝔽(result) を返す。

Note 2

ジェネリック。

22.1.3.10 String.prototype.isWellFormed ( )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
IsStringWellFormedUnicode(S) を返す。

22.1.3.11 String.prototype.lastIndexOf ( `searchString` [ , `position` ] )

Note 1

position 以下で最後に現れるインデックスを返し、存在しなければ -1_𝔽。position が undefined なら文字列長を仮定。

手順:

O に this の値を設定する。
? RequireObjectCoercible(O) を実行する。
S に ? ToString(O) を設定する。
searchStr に ? ToString(searchString) を設定する。
numPos に ? ToNumber(position) を設定する。
アサート: position が undefined の場合、numPos は NaN である。
numPos が NaN なら pos に +∞ を、そうでなければ pos に ! ToIntegerOrInfinity(numPos) を設定する。
len に S の長さを設定する。
searchLen に searchStr の長さを設定する。
len < searchLen の場合、-1_𝔽 を返す。
start に pos を 0 以上 len - searchLen 以下に制限した結果を設定する。
result に StringLastIndexOf(S, searchStr, start) を設定する。
result が not-found なら、-1_𝔽 を返す。
𝔽(result) を返す。

Note 2

ジェネリック。

22.1.3.12 String.prototype.localeCompare ( `that` [ , `reserved1` [ , `reserved2` ] ] )

ECMA-402 国際化 API を含む実装は ECMA-402 の規定に従う。含まない実装では次を用いる:

このメソッドは this 値 (String に変換した S) と that (String に変換した thatValue) のロケール依存比較の結果を NaN 以外の Number で返す。結果はホスト環境の現在のロケールの慣習に従うソート順を表し、S が thatValue の前なら負、後なら正、その他は 0（順序なし）となる。

比較の前に以下を行う:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
thatValue を ? ToString(that)。

第2・第3引数の意味は ECMA-402 仕様に定義され、未実装の場合他用途に用いてはならない。

実際の戻り値は追加情報符号化のため実装定義だが、このメソッドは全 String 上の全順序を与える一貫した比較子でなければならず、Unicode 標準の正規等価性を尊重し、正規等価な区別可能文字列の比較で +0_𝔽 を返さねばならない。

Note 1

2 引数関数を要求する Array.prototype.sort の引数に直接適切ではない。

Note 2

このメソッドはホスト環境の言語・ロケール機能を利用し得るが、常に Unicode の正規等価性を尊重する必要がある。以下はすべて +0_𝔽 を返さねばならない例である:

// Å ANGSTROM SIGN vs.
// Å LATIN CAPITAL LETTER A + COMBINING RING ABOVE
"\u212B".localeCompare("A\u030A")

// Ω OHM SIGN vs.
// Ω GREEK CAPITAL LETTER OMEGA
"\u2126".localeCompare("\u03A9")

// ṩ LATIN SMALL LETTER S WITH DOT BELOW AND DOT ABOVE vs.
// ṩ LATIN SMALL LETTER S + COMBINING DOT ABOVE + COMBINING DOT BELOW
"\u1E69".localeCompare("s\u0307\u0323")

// ḍ̇ LATIN SMALL LETTER D WITH DOT ABOVE + COMBINING DOT BELOW vs.
// ḍ̇ LATIN SMALL LETTER D WITH DOT BELOW + COMBINING DOT ABOVE
"\u1E0B\u0323".localeCompare("\u1E0D\u0307")

// 가 HANGUL CHOSEONG KIYEOK + HANGUL JUNGSEONG A vs.
// 가 HANGUL SYLLABLE GA
"\u1100\u1161".localeCompare("\uAC00")

正規等価性の定義と議論は Unicode Standard 2章・3章、UAX #15、UTN #5、および UTS #10 を参照。

Unicode 互換等価や互換分解は尊重しないことが推奨される。

Note 3

ジェネリックであり転用可能。

22.1.3.13 String.prototype.match ( `regexp` )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
regexp が undefined でも null でもないなら
1. matcher を ? GetMethod(regexp, %Symbol.match%)。
2. matcher が undefined でなければ
  1. ? Call(matcher, regexp, « O ») を返す。
S を ? ToString(O)。
rx を ? RegExpCreate(regexp, undefined)。
? Invoke(rx, %Symbol.match%, « S ») を返す。

Note

ジェネリック。

22.1.3.14 String.prototype.matchAll ( `regexp` )

このメソッドは this を表す String に対して regexp で正規表現マッチを行い、マッチ結果を生成するイテレータを返す。各結果は最初の要素にマッチ全体、その後にキャプチャグループを含む配列。マッチしなければ結果を生成しない。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
regexp が undefined でも null でもないなら
1. isRegExp を ? IsRegExp(regexp)。
2. isRegExp が true なら
  1. flags を ? Get(regexp, "flags")。
  2. ? RequireObjectCoercible(flags)。
  3. ? ToString(flags) が "g" を含まなければ TypeError。
3. matcher を ? GetMethod(regexp, %Symbol.matchAll%)。
4. matcher が undefined でなければ
  1. ? Call(matcher, regexp, « O ») を返す。
S を ? ToString(O)。
rx を ? RegExpCreate(regexp, "g")。
? Invoke(rx, %Symbol.matchAll%, « S ») を返す。

Note 1

このメソッドはジェネリックで、this が String オブジェクトである必要はない。

Note 2

String.prototype.split と同様に通常入力を破壊しないよう設計されている。

22.1.3.15 String.prototype.normalize ( [ `form` ] )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
form が undefined なら f を "NFC" とする。
それ以外なら f を ? ToString(form)。
f が "NFC", "NFD", "NFKC", "NFKD" のいずれでもなければ RangeError。
ns を最新の Unicode Standard の正規化 (Normalization Forms) に従い S を f 指定の正規形に変換した String 値とする。
ns を返す。

Note

ジェネリック。

22.1.3.16 String.prototype.padEnd ( `maxLength` [ , `fillString` ] )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
? StringPaddingBuiltinsImpl(O, maxLength, fillString, end) を返す。

22.1.3.17 String.prototype.padStart ( `maxLength` [ , `fillString` ] )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
? StringPaddingBuiltinsImpl(O, maxLength, fillString, start) を返す。

22.1.3.17.1 StringPaddingBuiltinsImpl ( `O`, `maxLength`, `fillString`, `placement` )

The abstract operation StringPaddingBuiltinsImpl takes arguments O (an ECMAScript language value), maxLength (an ECMAScript language value), fillString (an ECMAScript language value), and placement (start or end) and returns either a normal completion containing a String or a throw completion. It performs the following steps when called:

S を ? ToString(O)。
intMaxLength を ℝ(? ToLength(maxLength))。
stringLength を S の長さ。
intMaxLength ≤ stringLength なら S を返す。
fillString が undefined なら fillString をコードユニット 0x0020 (SPACE) のみからなる String とする。
それ以外なら fillString を ? ToString(fillString)。
StringPad(S, intMaxLength, fillString, placement) を返す。

22.1.3.17.2 StringPad ( `S`, `maxLength`, `fillString`, `placement` )

The abstract operation StringPad takes arguments S (a String), maxLength (a non-negative integer), fillString (a String), and placement (start or end) and returns a String. It performs the following steps when called:

stringLength を S の長さ。
maxLength ≤ stringLength なら S。
fillString が空文字列なら S。
fillLen を maxLength - stringLength。
truncatedStringFiller を fillString を繰り返し連結し長さ fillLen に切り詰めた String。
placement が start なら truncatedStringFiller と S の連結を返す。
それ以外は S と truncatedStringFiller の連結を返す。

Note 1

maxLength は S の長さ未満にならないようクランプされる。

Note 2

fillString の既定は " " (0x0020 SPACE)。

22.1.3.17.3 ToZeroPaddedDecimalString ( `n`, `minLength` )

The abstract operation ToZeroPaddedDecimalString takes arguments n (a non-negative integer) and minLength (a non-negative integer) and returns a String. It performs the following steps when called:

S を n を 10 進数表記した文字列表現。
StringPad(S, minLength, "0", start) を返す。

22.1.3.18 String.prototype.repeat ( `count` )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
n を ? ToIntegerOrInfinity(count)。
n < 0 または n = +∞ なら RangeError。
n = 0 なら空文字列。
S を n 回連結した String を返す。

Note 1

this の値を繰り返したコードユニット列を生成。

Note 2

ジェネリック。

22.1.3.19 String.prototype.replace ( `searchValue`, `replaceValue` )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
searchValue が undefined でも null でもないなら
1. replacer を ? GetMethod(searchValue, %Symbol.replace%)。
2. replacer が undefined でなければ
  1. ? Call(replacer, searchValue, « O, replaceValue ») を返す。
string を ? ToString(O)。
searchString を ? ToString(searchValue)。
functionalReplace を IsCallable(replaceValue)。
functionalReplace が false なら
1. replaceValue を ? ToString(replaceValue) に設定。
searchLength を searchString の長さ。
position を StringIndexOf(string, searchString, 0)。
position が not-found なら string を返す。
preceding を string の 0 から position まで。
following を string の position + searchLength 以降。
functionalReplace が true なら
1. replacement を ? ToString(? Call(replaceValue, undefined, « searchString, 𝔽(position), string »))。
それ以外
1. アサート: replaceValue は String。
2. captures を空リスト。
3. replacement を ! GetSubstitution(searchString, string, position, captures, undefined, replaceValue)。
preceding, replacement, following を連結して返す。

Note

ジェネリック。

22.1.3.19.1 GetSubstitution ( `matched`, `str`, `position`, `captures`, `namedCaptures`, `replacementTemplate` )

The abstract operation GetSubstitution takes arguments matched (a String), str (a String), position (a non-negative integer), captures (a List of either Strings or undefined), namedCaptures (an Object or undefined), and replacementTemplate (a String) and returns either a normal completion containing a String or a throw completion. この抽象操作において decimal digit は 0x0030 (DIGIT ZERO) から 0x0039 (DIGIT NINE) までのコードユニット。 It performs the following steps when called:

stringLength を str の長さ。
アサート: position ≤ stringLength。
result を空文字列。
templateRemainder を replacementTemplate。
templateRemainder が空文字列でない間繰り返し、
1. 注記: 以下の手順で接頭辞 ref を分離し、その置換 refReplacement を決定し result に追加する。
2. templateRemainder が "$$" で始まるなら
  1. ref を "$$"。
  2. refReplacement を "$"。
3. それ以外で "$`" で始まるなら
  1. ref を "$`"。
  2. refReplacement を str の 0 から position まで。
4. それ以外で "$&" で始まるなら
  1. ref を "$&"。
  2. refReplacement を matched。
5. それ以外で "$'" (0x0024 + 0x0027) で始まるなら
  1. ref を "$'"。
  2. matchLength を matched の長さ。
  3. tailPos を position + matchLength。
  4. refReplacement を str の min(tailPos, stringLength) から末尾まで。
  5. 注: tailPos が stringLength を超えるのは %RegExp.prototype% でない "exec" を持つオブジェクトにより呼ばれた場合のみ。
6. それ以外で "$" に 1 個以上の 10 進数字が続くなら
  1. 2 つ以上の数字が続くなら digitCount を 2、そうでなければ 1。
  2. digits を 1 から 1 + digitCount の部分文字列。
  3. index を ℝ(StringToNumber(digits))。
  4. アサート: 0 ≤ index ≤ 99。
  5. captureLen を captures の要素数。
  6. index > captureLen かつ digitCount = 2 なら
    1. 注: 2 桁が範囲外なら 1 桁とリテラル数字に扱い直す。
    2. digitCount を 1。
    3. digits をその先頭 1 桁に。
    4. index を ℝ(StringToNumber(digits))。
  7. ref を 0 から 1 + digitCount の部分文字列。
  8. 1 ≤ index ≤ captureLen なら
    1. capture を captures[index - 1]。
    2. capture が undefined なら
      1. refReplacement を空文字列。
    3. それ以外
      1. refReplacement を capture。
  9. それ以外
    1. refReplacement を ref。
7. それ以外で "$<" で始まるなら
  1. gtPos を StringIndexOf(templateRemainder, ">", 0)。
  2. gtPos が not-found または namedCaptures が undefined なら
    1. ref を "$<"。
    2. refReplacement を ref。
  3. それ以外
    1. ref を 0 から gtPos + 1 の部分。
    2. groupName を 2 から gtPos の部分。
    3. アサート: namedCaptures はオブジェクト。
    4. capture を ? Get(namedCaptures, groupName)。
    5. capture が undefined なら
      1. refReplacement を空文字列。
    6. それ以外
      1. refReplacement を ? ToString(capture)。
8. それ以外
  1. ref を 0 から 1 の部分。
  2. refReplacement を ref。
9. refLength を ref の長さ。
10. templateRemainder を refLength 以降の部分へ。
11. result を result と refReplacement の連結。
result を返す。

22.1.3.20 String.prototype.replaceAll ( `searchValue`, `replaceValue` )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
searchValue が undefined でも null でもないなら
1. isRegExp を ? IsRegExp(searchValue)。
2. isRegExp が true なら
  1. flags を ? Get(searchValue, "flags")。
  2. ? RequireObjectCoercible(flags)。
  3. ? ToString(flags) に "g" が含まれなければ TypeError。
3. replacer を ? GetMethod(searchValue, %Symbol.replace%)。
4. replacer が undefined でなければ
  1. ? Call(replacer, searchValue, « O, replaceValue ») を返す。
string を ? ToString(O)。
searchString を ? ToString(searchValue)。
functionalReplace を IsCallable(replaceValue)。
functionalReplace が false なら
1. replaceValue を ? ToString(replaceValue)。
searchLength を searchString の長さ。
advanceBy を max(1, searchLength)。
matchPositions を空リスト。
position を StringIndexOf(string, searchString, 0)。
position が not-found でない間繰り返し、
1. matchPositions に position を追加。
2. position を StringIndexOf(string, searchString, position + advanceBy) に設定。
endOfLastMatch を 0。
result を空文字列。
matchPositions の各 p について
1. preserved を string の endOfLastMatch から p まで。
2. functionalReplace が true なら
  1. replacement を ? ToString(? Call(replaceValue, undefined, « searchString, 𝔽(p), string »))。
3. それ以外
  1. アサート: replaceValue は String。
  2. captures を空リスト。
  3. replacement を ! GetSubstitution(searchString, string, p, captures, undefined, replaceValue)。
4. result を result, preserved, replacement の連結に。
5. endOfLastMatch を p + searchLength に。
endOfLastMatch < string の長さなら
1. result を result と string の endOfLastMatch 以降の部分の連結に。
result を返す。

22.1.3.21 String.prototype.search ( `regexp` )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
regexp が undefined でも null でもないなら
1. searcher を ? GetMethod(regexp, %Symbol.search%)。
2. searcher が undefined でなければ
  1. ? Call(searcher, regexp, « O ») を返す。
string を ? ToString(O)。
rx を ? RegExpCreate(regexp, undefined)。
? Invoke(rx, %Symbol.search%, « string ») を返す。

Note

ジェネリック。

22.1.3.22 String.prototype.slice ( `start`, `end` )

このメソッドは、このオブジェクトを String に変換した結果の start から (含まない)end まで（end が undefined なら末尾まで）の substring を返す。start が負なら sourceLength + start とみなし、end が負なら sourceLength + end とみなす。結果は String 値。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
len を S の長さ。
intStart を ? ToIntegerOrInfinity(start)。
intStart = -∞ なら from を 0。
それ以外で intStart < 0 なら from を max(len + intStart, 0)。
それ以外は from を min(intStart, len)。
end が undefined なら intEnd を len、そうでなければ ? ToIntegerOrInfinity(end)。
intEnd = -∞ なら to を 0。
それ以外で intEnd < 0 なら to を max(len + intEnd, 0)。
それ以外は to を min(intEnd, len)。
from ≥ to なら空文字列。
S の from から to の部分文字列を返す。

Note

ジェネリック。

22.1.3.23 String.prototype.split ( `separator`, `limit` )

このメソッドは、このオブジェクトを String に変換した結果を左から separator の出現で分割した部分文字列を配列に格納し返す。separator は任意長の String または %Symbol.split% メソッドを持つオブジェクト (例: RegExp)。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
separator が undefined でも null でもないなら
1. splitter を ? GetMethod(separator, %Symbol.split%)。
2. splitter が undefined でなければ
  1. ? Call(splitter, separator, « O, limit ») を返す。
S を ? ToString(O)。
limit が undefined なら lim を 2³² - 1、そうでなければ ℝ(? ToUint32(limit))。
R を ? ToString(separator)。
lim = 0 なら CreateArrayFromList(« »)。
separator が undefined なら CreateArrayFromList(« S »)。
separatorLength を R の長さ。
separatorLength = 0 なら
1. strLen を S の長さ。
2. outLen を lim を 0 と strLen の間にクランプした結果。
3. head を S の 0 から outLen。
4. codeUnits を head のコードユニット列リスト。
5. CreateArrayFromList(codeUnits) を返す。
S が空文字列なら CreateArrayFromList(« S »)。
substrings を空リスト。
i を 0。
j を StringIndexOf(S, R, 0)。
j が not-found でない間
1. T を S の i から j。
2. substrings に T を追加。
3. substrings の要素数が lim なら CreateArrayFromList(substrings)。
4. i を j + separatorLength。
5. j を StringIndexOf(S, R, i)。
T を S の i から末尾。
substrings に T を追加。
CreateArrayFromList(substrings) を返す。

Note 1

separator が空文字列なら先頭末尾や直前マッチ末尾の空部分文字列はマッチしない。結果配列長は文字列長で各要素は1コードユニット。

this が空文字列の場合、separator が空文字列にマッチするなら結果は空、そうでなければ 1 要素（空文字列）。

separator が undefined なら結果は 1 要素で this の文字列表現。limit が指定されればサイズ制限。

Note 2

ジェネリック。

22.1.3.24 String.prototype.startsWith ( `searchString` [ , `position` ] )

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
isRegExp を ? IsRegExp(searchString)。
isRegExp が true なら TypeError。
searchStr を ? ToString(searchString)。
len を S の長さ。
position が undefined なら pos を 0、そうでなければ ? ToIntegerOrInfinity(position)。
start を pos を 0 と len の間にクランプした結果。
searchLength を searchStr の長さ。
searchLength = 0 なら true。
end を start + searchLength。
end > len なら false。
substring を S の start から end。
substring = searchStr なら true。
false を返す。

Note 1

指定位置からの一致で true。

Note 2

RegExp の場合例外を投げる理由は将来拡張のため。

Note 3

ジェネリック。

22.1.3.25 String.prototype.substring ( `start`, `end` )

このメソッドは、このオブジェクトを String に変換した結果のインデックス start から (含まない)end まで（end が undefined なら末尾まで）の substring を返す。結果は String 値。

どちらかの引数が NaN または負なら 0 に置換。長さを超えるなら長さに置換。

start > end の場合は入れ替える。

手順:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
len を S の長さ。
intStart を ? ToIntegerOrInfinity(start)。
end が undefined なら intEnd を len、そうでなければ ? ToIntegerOrInfinity(end)。
finalStart を intStart を 0 と len の間にクランプした結果。
finalEnd を intEnd を 0 と len の間にクランプした結果。
from を min(finalStart, finalEnd)。
to を max(finalStart, finalEnd)。
S の from から to の部分文字列を返す。

Note

ジェネリック。

22.1.3.26 String.prototype.toLocaleLowerCase ( [ `reserved1` [ , `reserved2` ] ] )

ECMA-402 実装時はそちらに従う。未実装時は以下。

このメソッドは 6.1.4 に記載の UTF-16 コードポイント列として String を解釈する。

toLowerCase と同様だがロケール依存（例: トルコ語など特別な場合）。

オプション引数の意味は ECMA-402 に定義され、未実装時は他用途禁止。

Note

ジェネリック。

22.1.3.27 String.prototype.toLocaleUpperCase ( [ `reserved1` [ , `reserved2` ] ] )

ECMA-402 実装時はその仕様に従う。未実装時は以下。

String を UTF-16 コードポイント列として解釈。

toUpperCase と同様だがロケール依存結果を意図。

オプション引数の意味は ECMA-402 にのみ定義。

Note

ジェネリック。

22.1.3.28 String.prototype.toLowerCase ( )

String を UTF-16 コードポイント列として解釈する (6.1.4)。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
sText を StringToCodePoints(S)。
lowerText を Unicode 既定のケース変換アルゴリズムに従い toLowercase(sText)。
L を CodePointsToString(lowerText)。
L を返す。

結果は Unicode Character Database のロケール非依存マッピング（UnicodeData.txt と SpecialCasing.txt のロケール非依存部分）に従う。

Note 1

一部コードポイントは複数コードポイントへ写像され長さが変わり得る。toUpperCase と toLowerCase は文脈依存で対称ではない。

Note 2

ジェネリック。

22.1.3.29 String.prototype.toString ( )

呼び出し時:

? ThisStringValue(this value) を返す。

Note

String オブジェクトにおいては valueOf と同じ結果。

22.1.3.30 String.prototype.toUpperCase ( )

String を UTF-16 コードポイント列として解釈する。

String.prototype.toLowerCase と同様に動作するが Unicode 既定ケース変換の大文字化アルゴリズムを用いる。

Note

ジェネリック。

22.1.3.31 String.prototype.toWellFormed ( )

このオブジェクトの文字列表現から、サロゲートペアを構成しない孤立した先行・後続サロゲートを U+FFFD (REPLACEMENT CHARACTER) に置換した String を返す。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
S を ? ToString(O)。
strLen を S の長さ。
k を 0。
result を空文字列。
k < strLen の間繰り返し
1. cp を CodePointAt(S, k)。
2. cp.[[IsUnpairedSurrogate]] が true なら
  1. result を result と 0xFFFD の連結。
3. それ以外
  1. result を result と UTF16EncodeCodePoint(cp.[[CodePoint]]) の連結。
4. k を k + cp.[[CodeUnitCount]]。
result を返す。

22.1.3.32 String.prototype.trim ( )

String を UTF-16 コードポイント列として解釈する。

呼び出し時:

S を this の値。
? TrimString(S, start+end) を返す。

Note

ジェネリック。

22.1.3.32.1 TrimString ( `string`, `where` )

The abstract operation TrimString takes arguments string (an ECMAScript language value) and where (start, end, or start+end) and returns either a normal completion containing a String or a throw completion. string を UTF-16 コードポイント列として解釈する。 It performs the following steps when called:

? RequireObjectCoercible(string)。
S を ? ToString(string)。
where が start なら
1. T を S から前方空白を除去した String。
それ以外で where が end なら
1. T を S から後方空白を除去した String。
それ以外
1. アサート: where は start+end。
2. T を S の前後空白を除去した String。
T を返す。

空白の定義は WhiteSpace と LineTerminator の和集合。Unicode 一般カテゴリ Space_Separator (“Zs”) の判定は UTF-16 として解釈する。

22.1.3.33 String.prototype.trimEnd ( )

String を UTF-16 コードポイント列として解釈する。

呼び出し時:

S を this の値。
? TrimString(S, end) を返す。

Note

ジェネリック。

22.1.3.34 String.prototype.trimStart ( )

String を UTF-16 コードポイント列として解釈する。

呼び出し時:

S を this の値。
? TrimString(S, start) を返す。

Note

ジェネリック。

22.1.3.35 String.prototype.valueOf ( )

呼び出し時:

? ThisStringValue(this value) を返す。

22.1.3.35.1 ThisStringValue ( `value` )

The abstract operation ThisStringValue takes argument value (an ECMAScript language value) and returns either a normal completion containing a String or a throw completion. It performs the following steps when called:

value が String なら value を返す。
value がオブジェクトで [[StringData]] 内部スロットを持つなら
1. s を value.[[StringData]]。
2. アサート: s は String。
3. s を返す。
TypeError 例外を投げる。

22.1.3.36 String.prototype [ %Symbol.iterator% ] ( )

このメソッドは String 値のコードポイントを順次 (各コードポイントを String として) 返すイテレータオブジェクトを返す。

呼び出し時:

O を this。
? RequireObjectCoercible(O)。
s を ? ToString(O)。
closure を s を捕捉し、呼び出し時に以下を行う抽象クロージャとする:
1. len を s の長さ。
2. position を 0。
3. position < len の間繰り返し
  1. cp を CodePointAt(s, position)。
  2. nextIndex を position + cp.[[CodeUnitCount]]。
  3. resultString を s の position から nextIndex。
  4. position を nextIndex。
  5. ? GeneratorYield(CreateIteratorResultObject(resultString, false))。
4. NormalCompletion(unused) を返す。
CreateIteratorFromClosure(closure, "%StringIteratorPrototype%", %StringIteratorPrototype%) を返す。

このメソッドの "name" プロパティの値は "[Symbol.iterator]" である。

22.1.4 String インスタンスのプロパティ

String インスタンスは String エキゾチックオブジェクトであり、その内部メソッドを持つ。String インスタンスは String プロトタイプオブジェクトからプロパティを継承し、[[StringData]] 内部スロットを持つ。この内部スロットはその String オブジェクトが表す String 値である。

String インスタンスは "length" プロパティと整数インデックス名を持つ列挙可能プロパティ集合を持つ。

22.1.4.1 length

この String オブジェクトが表す String 値の要素数。

一度初期化されると不変。このプロパティの属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: false }。

22.1.5 String 反復子オブジェクト

String Iterator は特定の String インスタンス上の特定の反復を表すオブジェクト。名前付きコンストラクターは存在せず、String インスタンスの特定メソッド呼び出しで生成される。

22.1.5.1 %StringIteratorPrototype% オブジェクト

%StringIteratorPrototype% オブジェクト:

全ての String Iterator オブジェクトが継承するプロパティを持つ。
通常のオブジェクトである。
[[Prototype]] 内部スロットの値は %Iterator.prototype% である。
次のプロパティを持つ:

22.1.5.1.1 %StringIteratorPrototype%.next ( )

? GeneratorResume(this value, empty, "%StringIteratorPrototype%") を返す。

22.1.5.1.2 %StringIteratorPrototype% [ %Symbol.toStringTag% ]

%Symbol.toStringTag% プロパティの初期値は文字列 "String Iterator" である。

属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: true }。

22.2 RegExp (正規表現) オブジェクト

RegExp オブジェクトは正規表現パターンと関連するフラグを保持する。

Note

正規表現の形式と機能は Perl 5 の正規表現機能を手本としている。

22.2.1 パターン

RegExp コンストラクターは入力のパターン文字列に対して以下の文法を適用する。文字列が Pattern の展開として解釈できない場合、エラーが発生する。

構文

Pattern

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Disjunction

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

Alternative

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Alternative

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Alternative

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

[empty]

Alternative

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Term

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Term

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

Assertion

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Atom

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Atom

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

Quantifier

Assertion

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

(?=

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

(?!

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

(?<=

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

(?<!

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

Quantifier

QuantifierPrefix

{

DecimalDigits

[~Sep]

}

{

DecimalDigits

[~Sep]

{

DecimalDigits

[~Sep]

DecimalDigits

[~Sep]

}

Atom

[UnicodeMode, UnicodeSetsMode, NamedCaptureGroups]

PatternCharacter

AtomEscape

[?UnicodeMode, ?NamedCaptureGroups]

CharacterClass

[?UnicodeMode, ?UnicodeSetsMode]

(

GroupSpecifier

[?UnicodeMode]

opt

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

RegularExpressionModifiers

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

RegularExpressionModifiers

Disjunction

[?UnicodeMode, ?UnicodeSetsMode, ?NamedCaptureGroups]

)

RegularExpressionModifiers

[empty]

RegularExpressionModifiers

RegularExpressionModifier

one of

SyntaxCharacter

one of

(

)

[

]

{

}

PatternCharacter

SourceCharacter

but not SyntaxCharacter

AtomEscape

[UnicodeMode, NamedCaptureGroups]

DecimalEscape

CharacterClassEscape

[?UnicodeMode]

CharacterEscape

[?UnicodeMode]

[+NamedCaptureGroups]

[?UnicodeMode]

[UnicodeMode]

[lookahead ∉ DecimalDigit]

HexEscapeSequence

RegExpUnicodeEscapeSequence

[?UnicodeMode]

[?UnicodeMode]

one of

[UnicodeMode]

[?UnicodeMode]

[UnicodeMode]

[?UnicodeMode]

[UnicodeMode]

RegExpIdentifierStart

[?UnicodeMode]

RegExpIdentifierName

[?UnicodeMode]

RegExpIdentifierPart

[?UnicodeMode]

RegExpIdentifierStart

[UnicodeMode]

IdentifierStartChar

RegExpUnicodeEscapeSequence

[+UnicodeMode]

[~UnicodeMode]

UnicodeLeadSurrogate

UnicodeTrailSurrogate

RegExpIdentifierPart

[UnicodeMode]

IdentifierPartChar

RegExpUnicodeEscapeSequence

[+UnicodeMode]

[~UnicodeMode]

UnicodeLeadSurrogate

UnicodeTrailSurrogate

RegExpUnicodeEscapeSequence

[UnicodeMode]

[+UnicodeMode]

[+UnicodeMode]

[+UnicodeMode]

[+UnicodeMode]

[~UnicodeMode]

[+UnicodeMode]

}

any Unicode code point in the inclusive interval from U+D800 to U+DBFF

UnicodeTrailSurrogate

any Unicode code point in the inclusive interval from U+DC00 to U+DFFF

関連付ける \u HexLeadSurrogate の選択が曖昧である各 \u HexTrailSurrogate は、他に対応する \u HexTrailSurrogate を持たない最も近い u HexLeadSurrogate に関連付けられなければならない。

HexLeadSurrogate

Hex4Digits

but only if the MV of Hex4Digits is in the inclusive interval from 0xD800 to 0xDBFF

HexTrailSurrogate

Hex4Digits

but only if the MV of Hex4Digits is in the inclusive interval from 0xDC00 to 0xDFFF

HexNonSurrogate

Hex4Digits

but only if the MV of Hex4Digits is not in the inclusive interval from 0xD800 to 0xDFFF

IdentityEscape

[UnicodeMode]

[+UnicodeMode]

SyntaxCharacter

[+UnicodeMode]

[~UnicodeMode]

SourceCharacter

but not UnicodeIDContinue

DecimalEscape

NonZeroDigit

DecimalDigits

[~Sep]

opt

[lookahead ∉ DecimalDigit]

CharacterClassEscape

[UnicodeMode]

[+UnicodeMode]

UnicodePropertyValueExpression

}

[+UnicodeMode]

UnicodePropertyValueExpression

}

UnicodePropertyValueExpression

UnicodePropertyName

UnicodePropertyValue

LoneUnicodePropertyNameOrValue

UnicodePropertyName

UnicodePropertyNameCharacters

UnicodePropertyNameCharacter

UnicodePropertyNameCharacters

opt

UnicodePropertyValue

UnicodePropertyValueCharacters

LoneUnicodePropertyNameOrValue

UnicodePropertyValueCharacters

UnicodePropertyValueCharacter

UnicodePropertyValueCharacters

opt

UnicodePropertyValueCharacter

UnicodePropertyNameCharacter

DecimalDigit

UnicodePropertyNameCharacter

AsciiLetter

CharacterClass

[UnicodeMode, UnicodeSetsMode]

[

[lookahead ≠ ^]

ClassContents

[?UnicodeMode, ?UnicodeSetsMode]

]

ClassContents

[?UnicodeMode, ?UnicodeSetsMode]

]

ClassContents

[UnicodeMode, UnicodeSetsMode]

[empty]

[~UnicodeSetsMode]

NonemptyClassRanges

[?UnicodeMode]

[+UnicodeSetsMode]

[UnicodeMode]

[?UnicodeMode]

[?UnicodeMode]

NonemptyClassRangesNoDash

[?UnicodeMode]

ClassAtom

[?UnicodeMode]

ClassAtom

[?UnicodeMode]

ClassContents

[?UnicodeMode, ~UnicodeSetsMode]

NonemptyClassRangesNoDash

[UnicodeMode]

ClassAtom

[?UnicodeMode]

ClassAtomNoDash

[?UnicodeMode]

NonemptyClassRangesNoDash

[?UnicodeMode]

ClassAtomNoDash

[?UnicodeMode]

ClassAtom

[?UnicodeMode]

ClassContents

[?UnicodeMode, ~UnicodeSetsMode]

[UnicodeMode]

[?UnicodeMode]

[UnicodeMode]

but not one of \ or ] or -

ClassEscape

[?UnicodeMode]

ClassEscape

[UnicodeMode]

[+UnicodeMode]

[?UnicodeMode]

[?UnicodeMode]

opt

opt

[lookahead ≠ &]

[lookahead ≠ &]

ClassStringDisjunction

ClassSetCharacter

NestedClass

[

[lookahead ≠ ^]

ClassContents

[+UnicodeMode, +UnicodeSetsMode]

]

ClassContents

[+UnicodeMode, +UnicodeSetsMode]

]

CharacterClassEscape

[+UnicodeMode]

Note 1

最初の二つの行は CharacterClass と同等である。

ClassStringDisjunction

\q{

ClassStringDisjunctionContents

}

ClassStringDisjunctionContents

ClassString

ClassStringDisjunctionContents

[empty]

opt

[lookahead ∉ ClassSetReservedDoublePunctuator]

SourceCharacter

but not ClassSetSyntaxCharacter

CharacterEscape

[+UnicodeMode]

ClassSetReservedPunctuator

ClassSetReservedDoublePunctuator

one of

;;

ClassSetSyntaxCharacter

one of

(

)

[

]

{

}

ClassSetReservedPunctuator

one of

;

Note 2

この節の複数の生成規則は B.1.2 で別定義が与えられる。

22.2.1.1 静的セマンティクス: 早期エラー

Note

この節は B.1.2.1 で修正される。

Pattern

Disjunction

CountLeftCapturingParensWithin(Pattern) ≥ 2³² - 1 の場合、構文エラー。
Pattern が互いに異なる二つの GroupSpecifier x と y を含み、それらの CapturingGroupName が等しく、かつ MightBothParticipate(x, y) が true の場合、構文エラー。

QuantifierPrefix

{

DecimalDigits

}

最初の DecimalDigits の MV が二番目の DecimalDigits の MV より大きい場合、構文エラー。

Atom

RegularExpressionModifiers

Disjunction

)

RegularExpressionModifiers が同一コードポイントを重複して含む場合、構文エラー。

Atom

RegularExpressionModifiers

Disjunction

)

最初および二番目の RegularExpressionModifiers がともに空なら構文エラー。
最初の RegularExpressionModifiers が同一コードポイントを重複して含む場合、構文エラー。
二番目の RegularExpressionModifiers が同一コードポイントを重複して含む場合、構文エラー。
最初の RegularExpressionModifiers に含まれる任意のコードポイントが二番目にも含まれる場合、構文エラー。

AtomEscape

GroupName

GroupSpecifiersThatMatch(GroupName) が空なら構文エラー。

AtomEscape

DecimalEscape

DecimalEscape の CapturingGroupNumber が AtomEscape を含む Pattern 内の CountLeftCapturingParensWithin より大きい場合、構文エラー。

NonemptyClassRanges

ClassAtom

ClassContents

最初または二番目の ClassAtom の IsCharacterClass が true なら構文エラー。
両方の IsCharacterClass が false で、かつ最初の CharacterValue > 二番目の CharacterValue の場合構文エラー。

NonemptyClassRangesNoDash

ClassAtomNoDash

ClassAtom

ClassContents

ClassAtomNoDash または ClassAtom の IsCharacterClass が true なら構文エラー。
両方の IsCharacterClass が false で、ClassAtomNoDash の CharacterValue > ClassAtom の CharacterValue なら構文エラー。

RegExpIdentifierStart

RegExpUnicodeEscapeSequence

RegExpUnicodeEscapeSequence の CharacterValue が IdentifierStartChar 字句文法生成規則でマッチするコードポイント値でない場合構文エラー。

RegExpIdentifierStart

UnicodeLeadSurrogate

UnicodeTrailSurrogate

RegExpIdentifierStart の RegExpIdentifierCodePoint が UnicodeIDStart 生成規則でマッチしない場合構文エラー。

RegExpIdentifierPart

RegExpUnicodeEscapeSequence

RegExpUnicodeEscapeSequence の CharacterValue が IdentifierPartChar でマッチするコードポイント値でない場合構文エラー。

RegExpIdentifierPart

UnicodeLeadSurrogate

UnicodeTrailSurrogate

RegExpIdentifierPart の RegExpIdentifierCodePoint が UnicodeIDContinue でマッチしない場合構文エラー。

UnicodePropertyValueExpression

UnicodePropertyName

UnicodePropertyValue

UnicodePropertyName にマッチしたソーステキストが Table 67 の「Property name and aliases」列に記載されている Unicode property name またはプロパティエイリアスでない場合、これは構文エラーとなる。
UnicodePropertyValue にマッチしたソーステキストが、UnicodePropertyName にマッチしたソーステキストで指定された Unicode プロパティまたはプロパティエイリアスのプロパティ値またはプロパティ値エイリアスとして PropertyValueAliases.txt に記載されていない場合、これは構文エラーとなる。

UnicodePropertyValueExpression

LoneUnicodePropertyNameOrValue

LoneUnicodePropertyNameOrValue にマッチしたソーステキストが PropertyValueAliases.txt に記載されている General_Category (gc) プロパティの Unicode プロパティ値またはプロパティ値エイリアスでもなく、「Property name and aliases」列の Table 68 に記載されたバイナリプロパティまたはバイナリプロパティエイリアスでもなく、また Table 69 の「Property name」列に記載されている文字列のバイナリプロパティでもない場合、これは構文エラーとなる。
包含する Pattern が _{[UnicodeSetsMode]} パラメータを持たず、LoneUnicodePropertyNameOrValue にマッチしたソーステキストが Table 69 の「Property name」列に記載されている文字列のバイナリプロパティである場合、これは構文エラーとなる。

CharacterClassEscape

UnicodePropertyValueExpression

}

UnicodePropertyValueExpression の MayContainStrings が true なら構文エラー。

CharacterClass

ClassContents

]

ClassContents の MayContainStrings が true なら構文エラー。

NestedClass

ClassContents

]

ClassContents の MayContainStrings が true なら構文エラー。

ClassSetRange

ClassSetCharacter

最初の ClassSetCharacter の CharacterValue が 2 番目より大きい場合構文エラー。

22.2.1.2 静的セマンティクス: CountLeftCapturingParensWithin ( `node`: a Parse Node, ): 非負整数

The abstract operation UNKNOWN takes UNPARSEABLE ARGUMENTS. node 内の左捕捉括弧の個数を返す。左捕捉括弧とは Atom :: ( GroupSpecifieropt Disjunction ) 生成規則の ( 終端にマッチする任意の ( パターン文字である。

Note

この節は B.1.2.2 で修正される。

It performs the following steps when called:

アサート: node は RegExp パターン文法の生成規則インスタンスである。
node に含まれる Atom :: ( GroupSpecifieropt Disjunction ) の構文木ノード数を返す。

22.2.1.3 静的セマンティクス: CountLeftCapturingParensBefore ( `node`: a Parse Node, ): 非負整数

The abstract operation UNKNOWN takes UNPARSEABLE ARGUMENTS. 外側のパターン内で node の左側に現れる左捕捉括弧の数を返す。

Note

この節は B.1.2.2 で修正される。

It performs the following steps when called:

アサート: node は RegExp パターン文法の生成規則インスタンスである。
pattern を node を含む Pattern とする。
pattern 内で node より前に出現するか、node を包含する Atom :: ( GroupSpecifieropt Disjunction ) ノードの数を返す。

22.2.1.4 静的セマンティクス: MightBothParticipate ( `x`: a Parse Node, `y`: a Parse Node, ): Boolean

The abstract operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It performs the following steps when called:

アサート: x と y は同じ外側の Pattern を持つ。
外側の Pattern が Disjunction :: Alternative | Disjunction ノードを含み、x が Alternative 内に、y が派生した Disjunction 内（または逆）に含まれる場合 false を返す。
true を返す。

22.2.1.5 静的セマンティクス: CapturingGroupNumber : 正の整数

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note

この節は B.1.2.1 で修正される。

It is defined piecewise over the following productions:

DecimalEscape

NonZeroDigit

NonZeroDigit の MV を返す。

DecimalEscape

NonZeroDigit

DecimalDigits

n を DecimalDigits のコードポイント数とする。
(NonZeroDigit の MV × 10ⁿ + DecimalDigits の MV) を返す。

“NonZeroDigit の MV” と “DecimalDigits の MV” の定義は 12.9.3 にある。

22.2.1.6 静的セマンティクス: IsCharacterClass : Boolean

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note

この節は B.1.2.3 で修正される。

It is defined piecewise over the following productions:

ClassAtom

ClassAtomNoDash

SourceCharacter

but not one of \ or ] or -

ClassEscape

CharacterEscape

false を返す。

ClassEscape

CharacterClassEscape

true を返す。

22.2.1.7 静的セマンティクス: CharacterValue : 非負整数

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note 1

この節は B.1.2.4 で修正される。

It is defined piecewise over the following productions:

ClassAtom

U+002D (HYPHEN-MINUS) の数値を返す。

ClassAtomNoDash

SourceCharacter

but not one of \ or ] or -

ch を SourceCharacter にマッチしたコードポイントとする。
ch の数値を返す。

ClassEscape

U+0008 (BACKSPACE) の数値を返す。

ClassEscape

U+002D (HYPHEN-MINUS) の数値を返す。

CharacterEscape

ControlEscape

Table 65 に従う数値を返す。

Table 65: ControlEscape コードポイント値

ControlEscape	数値	Code Point	Unicode 名	記号
`t`	9	`U+0009`	CHARACTER TABULATION	<HT>
`n`	10	`U+000A`	LINE FEED (LF)	<LF>
`v`	11	`U+000B`	LINE TABULATION	<VT>
`f`	12	`U+000C`	FORM FEED (FF)	<FF>
`r`	13	`U+000D`	CARRIAGE RETURN (CR)	<CR>

CharacterEscape

AsciiLetter

ch を AsciiLetter にマッチしたコードポイントとする。
i を ch の数値とする。
i を 32 で割った余りを返す。

CharacterEscape

[lookahead ∉ DecimalDigit]

U+0000 (NULL) の数値を返す。

Note 2

\0 は <NUL> 文字を表し、その後に 10 進数字を続けることはできない。

CharacterEscape

HexEscapeSequence

HexEscapeSequence の MV を返す。

RegExpUnicodeEscapeSequence

HexLeadSurrogate

HexTrailSurrogate

lead を HexLeadSurrogate の CharacterValue とする。
trail を HexTrailSurrogate の CharacterValue とする。
cp を UTF16SurrogatePairToCodePoint(lead, trail) とする。
cp の数値を返す。

RegExpUnicodeEscapeSequence

Hex4Digits

Hex4Digits の MV を返す。

RegExpUnicodeEscapeSequence

CodePoint

}

CodePoint の MV を返す。

Hex4Digits の MV を返す。

CharacterEscape

IdentityEscape

ch を IdentityEscape にマッチしたコードポイントとする。
ch の数値を返す。

ClassSetCharacter

SourceCharacter

but not ClassSetSyntaxCharacter

ch を SourceCharacter にマッチしたコードポイントとする。
ch の数値を返す。

ClassSetCharacter

ClassSetReservedPunctuator

ch を ClassSetReservedPunctuator にマッチしたコードポイントとする。
ch の数値を返す。

ClassSetCharacter

U+0008 (BACKSPACE) の数値を返す。

22.2.1.8 静的セマンティクス: MayContainStrings : Boolean

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

CharacterClassEscape

UnicodePropertyValueExpression

}

UnicodePropertyValueExpression

]

[empty]

false を返す。

UnicodePropertyValueExpression

LoneUnicodePropertyNameOrValue

LoneUnicodePropertyNameOrValue にマッチしたソーステキストが Table 69 の「Property name」列に記載されている文字列のバイナリプロパティである場合、true を返す。
false を返す。

ClassUnion

ClassSetRange

ClassUnion

opt

ClassUnion が存在するならその MayContainStrings を返す。
false を返す。

ClassUnion

ClassSetOperand

ClassUnion

opt

ClassSetOperand の MayContainStrings が true なら true を返す。
ClassUnion が存在するならその MayContainStrings を返す。
false を返す。

ClassIntersection

ClassSetOperand

最初の ClassSetOperand の MayContainStrings が false なら false。
二番目の ClassSetOperand の MayContainStrings が false なら false。
true を返す。

ClassIntersection

ClassSetOperand

ClassIntersection の MayContainStrings が false なら false。
ClassSetOperand の MayContainStrings が false なら false。
true を返す。

ClassSubtraction

ClassSetOperand

最初の ClassSetOperand の MayContainStrings を返す。

ClassSubtraction

ClassSetOperand

ClassSubtraction の MayContainStrings を返す。

ClassStringDisjunctionContents

ClassString

ClassStringDisjunctionContents

ClassString の MayContainStrings が true なら true。
ClassStringDisjunctionContents の MayContainStrings を返す。

ClassString

[empty]

true を返す。

ClassString

NonEmptyClassString

NonEmptyClassString の MayContainStrings を返す。

NonEmptyClassString

ClassSetCharacter

NonEmptyClassString

opt

NonEmptyClassString が存在するなら true。
false を返す。

22.2.1.9 静的セマンティクス: GroupSpecifiersThatMatch ( `thisGroupName`: a GroupName Parse Node, ): GroupSpecifier 構文ノードのリスト

The abstract operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It performs the following steps when called:

name を thisGroupName の CapturingGroupName とする。
pattern を thisGroupName を含む Pattern とする。
result を新しい空リストとする。
pattern が含む各 GroupSpecifier gs について
1. gs の CapturingGroupName が name なら
  1. gs を result に追加。
result を返す。

22.2.1.10 静的セマンティクス: CapturingGroupName : String

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

GroupName

RegExpIdentifierName

idTextUnescaped を RegExpIdentifierName の RegExpIdentifierCodePoints とする。
CodePointsToString(idTextUnescaped) を返す。

22.2.1.11 静的セマンティクス: RegExpIdentifierCodePoints : コードポイントのリスト

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

RegExpIdentifierName

RegExpIdentifierStart

cp を RegExpIdentifierStart の RegExpIdentifierCodePoint とする。
« cp » を返す。

RegExpIdentifierName

RegExpIdentifierPart

cps を派生した RegExpIdentifierName の RegExpIdentifierCodePoints とする。
cp を RegExpIdentifierPart の RegExpIdentifierCodePoint とする。
cps に « cp » を連結したリストを返す。

22.2.1.12 静的セマンティクス: RegExpIdentifierCodePoint : コードポイント

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

RegExpIdentifierStart

IdentifierStartChar

IdentifierStartChar にマッチしたコードポイントを返す。

RegExpIdentifierPart

IdentifierPartChar

IdentifierPartChar にマッチしたコードポイントを返す。

RegExpIdentifierStart

RegExpUnicodeEscapeSequence

RegExpIdentifierPart

RegExpUnicodeEscapeSequence

RegExpUnicodeEscapeSequence の CharacterValue を数値とするコードポイントを返す。

RegExpIdentifierStart

UnicodeLeadSurrogate

UnicodeTrailSurrogate

RegExpIdentifierPart

UnicodeLeadSurrogate

UnicodeTrailSurrogate

lead を UnicodeLeadSurrogate にマッチしたコードポイントの数値値を数値とするコードユニットとする。
trail を UnicodeTrailSurrogate にマッチしたコードポイントの数値値を数値とするコードユニットとする。
UTF16SurrogatePairToCodePoint(lead, trail) を返す。

22.2.2 パターンのセマンティクス

正規表現パターンは以下で記述される手順を用いて抽象クロージャ (Abstract Closure) に変換される。実装は、結果が同一である限り、以下に挙げるものより効率的なアルゴリズムを用いることが推奨される。この抽象クロージャは RegExp オブジェクトの [[RegExpMatcher]] 内部スロットの値として使われる。

Pattern は、その関連フラグに u も v も含まない場合 BMP パターンである。そうでなければ Unicode パターンである。BMP パターンは、基本多言語面 (BMP) の範囲内の Unicode コードポイントから成る 16 ビット値列として解釈される String に対してマッチを行う。Unicode パターンは UTF-16 でエンコードされた Unicode コードポイント列として解釈される String に対してマッチを行う。BMP パターンの挙動を記述する文脈では「文字」は単一の 16 ビット Unicode BMP コードポイントを意味する。Unicode パターンの挙動を記述する文脈では「文字」は UTF-16 でエンコードされたコードポイント (6.1.4) を意味する。いずれの文脈でも「character value」は対応する非エンコードなコードポイントの数値を意味する。

Pattern の構文とセマンティクスは、そのソーステキストが SourceCharacter 値の List であり、各 SourceCharacter が Unicode コードポイントに対応するとして定義される。BMP パターンが非 BMP の SourceCharacter を含む場合、パターン全体は UTF-16 でエンコードされ、そのエンコーディングの個々のコードユニットが List の要素として用いられる。

Note

例えば、ソーステキスト中で単一の非 BMP 文字 U+1D11E (MUSICAL SYMBOL G CLEF) で表されたパターンを考える。Unicode パターンとして解釈されると、それは単一コードポイント U+1D11E を要素とする 1 要素 (1 文字) の List となる。しかし BMP パターンとして解釈される場合、まず UTF-16 にエンコードされ、コードユニット 0xD834 と 0xDD1E から成る 2 要素の List となる。

パターンは非 BMP 文字が UTF-16 エンコードされた ECMAScript の String 値として RegExp コンストラクターに渡される。例えば単一文字 MUSICAL SYMBOL G CLEF のパターンは、長さ 2 の String であり、その要素はコードユニット 0xD834 と 0xDD1E であった。したがって、2 つのパターン文字から成る BMP パターンとして処理するためにこれ以上の変換は不要である。しかし Unicode パターンとして処理するには UTF16SurrogatePairToCodePoint を用いて、その唯一の要素が単一パターン文字 (コードポイント U+1D11E) である List を生成しなければならない。

実装は実際に UTF-16 との間のこのような変換を行わないかもしれないが、本仕様のセマンティクスは、パターンマッチングの結果があたかもそのような変換が行われたかのようであることを要求する。

22.2.2.1 表記

以下の記述では次の内部データ構造を用いる:

CharSetElement は次の 2 種類のいずれかである:
- rer.[[UnicodeSets]] が false の場合、CharSetElement は上記「パターンのセマンティクス」における意味での文字。
- rer.[[UnicodeSets]] が true の場合、CharSetElement は上記「パターンのセマンティクス」における意味での文字列（要素がそのような文字である列）。これには空列、1 文字列、複数文字列が含まれる。利便性のため、この種の CharSetElement を扱う際、単一文字は 1 文字列と同一視して扱う。
CharSet は CharSetElement の数学的集合。
CaptureRange は { [[StartIndex]], [[EndIndex]] } という Record で、キャプチャに含まれる文字の範囲を表す。[[StartIndex]] は Input 内での開始インデックス (含む) を表す整数、[[EndIndex]] は Input 内での終了インデックス (含まない) を表す整数である。任意の CaptureRange について、これらのインデックスは [[StartIndex]] ≤ [[EndIndex]] という不変条件を満たさなければならない。
MatchState は { [[Input]], [[EndIndex]], [[Captures]] } という Record で、[[Input]] はマッチ対象の String を表す文字の List、[[EndIndex]] は整数、[[Captures]] はパターン中の各左捕捉括弧に対応する値の List である。MatchState は正規表現マッチングアルゴリズム中の部分的なマッチ状態を表す。[[EndIndex]] はこれまでにパターンがマッチした最後の入力文字のインデックス + 1 を表し、[[Captures]] は捕捉括弧の結果を保持する。[[Captures]] の n^th 要素は n 番目の捕捉括弧が捕捉した文字範囲を表す CaptureRange か、まだ到達していない場合 undefined である。バックトラッキングのため、マッチング過程の任意時点で多数の MatchState が使用され得る。
MatcherContinuation は 1 つの MatchState 引数を取り、MatchState または failure を返す抽象クロージャ。MatcherContinuation はパターンの残り部分（クロージャが捕捉した値で特定される）を、引数の MatchState が示す中間状態から Input に対してマッチさせようとする。成功すれば最終的な MatchState を返し、失敗すれば failure を返す。
Matcher は 2 つの引数（MatchState と MatcherContinuation）を取り、MatchState または failure を返す抽象クロージャ。Matcher はパターンの中間サブパターン（クロージャが捕捉した値で特定される）をその MatchState の [[Input]] に対し、引数の MatchState が示す中間状態からマッチさせる。MatcherContinuation 引数は残りのパターンをマッチさせるクロージャであるべき。サブパターンをマッチさせて新しい MatchState を得た後、Matcher はその新しい MatchState に対して MatcherContinuation を呼び、残りのパターンがマッチできるか確認する。できれば Matcher は MatcherContinuation が返した MatchState を返し、できなければ選択点での別の選択を試み、成功するか全可能性が尽きるまで MatcherContinuation を繰り返し呼ぶ。

22.2.2.1.1 RegExp レコード

RegExp Record は、コンパイル中および必要に応じてマッチング中に RegExp について必要となる情報を保持するために用いられる Record 値である。

次のフィールドを持つ:

Table 66: RegExp Record Fields

Field Name	Value	Meaning
`[[IgnoreCase]]`	Boolean	フラグに "i" が現れるか
`[[Multiline]]`	Boolean	フラグに "m" が現れるか
`[[DotAll]]`	Boolean	フラグに "s" が現れるか
`[[Unicode]]`	Boolean	フラグに "u" が現れるか
`[[UnicodeSets]]`	Boolean	フラグに "v" が現れるか
`[[CapturingGroupsCount]]`	非負整数	パターン内の左捕捉括弧の数

22.2.2.2 実行時セマンティクス: CompilePattern : 文字の List と非負整数を取り MatchState か failure を返す抽象クロージャ

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

Pattern

Disjunction

m を Disjunction の CompileSubpattern (引数 rer, forward) とする。
rer と m を捕捉し、(Input, index) を引数に取り呼び出し時に以下を行う新しい抽象クロージャを返す:
1. アサート: Input は文字の List。
2. アサート: 0 ≤ index ≤ Input の要素数。
3. c を (y) を引数に取り以下を行う新しい MatcherContinuation（何も捕捉しない）とする:
  1. アサート: y は MatchState。
  2. y を返す。
4. cap を rer.[[CapturingGroupsCount]] 個の undefined を 1 から rer.[[CapturingGroupsCount]] で索引付けした List とする。
5. x を MatchState { [[Input]]: Input, [[EndIndex]]: index, [[Captures]]: cap } とする。
6. m(x, c) を返す。

Note

Pattern は抽象クロージャ値へコンパイルされる。RegExpBuiltinExec はその後、この手続を文字 List とその List 内のオフセットへ適用し、そのパターンがそのオフセットで正確にマッチするか、マッチするなら捕捉括弧の値が何であるかを決定できる。22.2.2 のアルゴリズムは、パターンのコンパイル時に SyntaxError 例外を投げ得るよう設計されている。一方で、一度成功裏にコンパイルされた後、得られる抽象クロージャを用いて文字 List 内でマッチを探索する際には（メモリ不足など実装定義の例外を除き）例外は投げられない。

22.2.2.3 実行時セマンティクス: CompileSubpattern : Matcher

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note 1

この節は B.1.2.5 で修正される。

It is defined piecewise over the following productions:

Disjunction

Alternative

Disjunction

m1 を Alternative の CompileSubpattern (引数 rer, direction) とする。
m2 を Disjunction の CompileSubpattern (引数 rer, direction) とする。
MatchTwoAlternatives(m1, m2) を返す。

Note 2

| 演算子は 2 つの選択肢を分離する。まず左側の Alternative（および正規表現の後続）へのマッチを試み、失敗したら右側の Disjunction（および後続）を試みる。左 Alternative、右 Disjunction、後続がいずれも選択点を持つ場合、左 Alternative の次の選択へ進む前に後続内の全ての選択が試される。左 Alternative の選択が尽きたら、左 Alternative の代わりに右 Disjunction が試される。| によりスキップされたパターン部分内の捕捉括弧は undefined を生成する。例:

/a|ab/.exec("abc")

は結果 "a" を返し "ab" ではない。また

/((a)|(ab))((c)|(bc))/.exec("abc")

は配列

["abc", "a", "a", undefined, "bc", undefined, "bc"]

を返し、

["abc", "ab", undefined, "ab", "c", "c", undefined]

ではない。2 つの選択肢を試す順序は direction の値と無関係。

Alternative

[empty]

EmptyMatcher() を返す。

Alternative

Term

m1 を Alternative の CompileSubpattern (引数 rer, direction) とする。
m2 を Term の CompileSubpattern (引数 rer, direction) とする。
MatchSequence(m1, m2, direction) を返す。

Note 3

連続する Term は Input の連続部分に同時にマッチを試みる。direction が forward のとき、左 Alternative、右 Term、後続がいずれも選択点を持つ場合、右 Term の次の選択へ進む前に後続内の全選択が試され、左 Alternative の次の選択へ進む前に右 Term の全選択が試される。direction が backward のとき、Alternative と Term の評価順序は逆転する。

Term

Assertion

Assertion の CompileAssertion (引数 rer) を返す。

Note 4

得られる Matcher は direction に依存しない。

Term

Atom

Atom の CompileAtom (引数 rer, direction) を返す。

Term

Atom

Quantifier

m を Atom の CompileAtom (引数 rer, direction) とする。
q を Quantifier の CompileQuantifier とする。
アサート: q.[[Min]] ≤ q.[[Max]].
parenIndex を CountLeftCapturingParensBefore(Term) とする。
parenCount を CountLeftCapturingParensWithin(Atom) とする。
(x, c) を引数に取り m, q, parenIndex, parenCount を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. RepeatMatcher(m, q.[[Min]], q.[[Max]], q.[[Greedy]], x, c, parenIndex, parenCount) を返す。

22.2.2.3.1 RepeatMatcher ( `m`, `min`, `max`, `greedy`, `x`, `c`, `parenIndex`, `parenCount` )

The abstract operation RepeatMatcher takes arguments m (a Matcher), min (非負整数), max (非負整数または +∞), greedy (Boolean), x (MatchState), c (MatcherContinuation), parenIndex (非負整数), and parenCount (非負整数) and returns MatchState または failure. It performs the following steps when called:

max = 0 なら c(x) を返す。
(y) を引数に取り m, min, max, greedy, x, c, parenIndex, parenCount を捕捉し以下を行う新しい MatcherContinuation d を作る:
1. アサート: y は MatchState。
2. min = 0 かつ y.[[EndIndex]] = x.[[EndIndex]] なら failure を返す。
3. min = 0 なら min2 を 0、そうでなければ min - 1。
4. max = +∞ なら max2 を +∞、そうでなければ max - 1。
5. RepeatMatcher(m, min2, max2, greedy, y, c, parenIndex, parenCount) を返す。
cap を x.[[Captures]] のコピーとする。
parenIndex + 1 から parenIndex + parenCount までの各整数 k について cap[k] に undefined を設定する。
Input を x.[[Input]] とする。
e を x.[[EndIndex]] とする。
xr を MatchState { [[Input]]: Input, [[EndIndex]]: e, [[Captures]]: cap } とする。
min ≠ 0 なら m(xr, d) を返す。
greedy が false なら
1. z を c(x) とする。
2. z が failure でなければ z を返す。
3. m(xr, d) を返す。
z を m(xr, d) とする。
z が failure でなければ z を返す。
c(x) を返す。

Note 1

Atom に Quantifier が続く場合、Quantifier に指定された回数だけ繰り返される。Quantifier は非貪欲 (non-greedy) の場合、後続にマッチしつつ可能な限り少ない回数繰り返され、貪欲 (greedy) の場合、後続にマッチしつつ可能な限り多く繰り返される。繰り返されるのは入力文字列ではなく Atom パターンであるため、各反復で異なる入力部分文字列にマッチし得る。

Note 2

Atom と後続の正規表現がいずれも選択点を持つ場合、まず Atom は可能な限り多く (非貪欲なら少なく) マッチする。後続内の全選択が試されてから、Atom の最後の反復で次の選択へ進む。最後 (n 回目) の反復の全選択が試されてから (n - 1) 回目の反復で次の選択へ進む。その時点で Atom の反復回数を増減できる可能性があり（再度、少ないか多いかから開始）、それらが尽きてから (n - 1) 回目の反復で次の選択へ進む……。

比較:

/a[a-z]{2,4}/.exec("abcdefghi")

は "abcde" を返し、

/a[a-z]{2,4}?/.exec("abcdefghi")

は "abc" を返す。

さらに:

/(aa|aabaac|ba|b|c)*/.exec("aabaac")

は選択点の順序により配列

["aaba", "ba"]

を返し、以下ではない:

["aabaac", "aabaac"]
["aabaac", "c"]

この選択点の順序は、単項表記の 2 つの数の最大公約数 (GCD) を計算する正規表現を書くのに利用できる。以下は 10 と 15 の gcd を計算する例:

"aaaaaaaaaa,aaaaaaaaaaaaaaa".replace(/^(a+)\1*,\1+$/, "$1")

結果は単項表記の "aaaaa"。

Note 3

RepeatMatcher のステップは Atom が繰り返されるたびにその捕捉をクリアする。次の正規表現で挙動が分かる:

/(z)((a+)?(b+)?(c))*/.exec("zaacbbbcac")

これは配列

["zaacbbbcac", "z", "ac", "a", undefined, "c"]

を返し、

["zaacbbbcac", "z", "ac", "a", "bbb", "c"]

ではない。これは外側の * の各反復が量指定された Atom に含まれる全捕捉文字列（ここでは 2, 3, 4, 5 番）をクリアするためである。

Note 4

RepeatMatcher のステップは、最小反復回数が満たされた後、空文字列にマッチする Atom のさらなる展開は追加反復として考慮しないと述べる。これは以下のようなパターンで無限ループに陥るのを防ぐ:

/(a*)*/.exec("b")

またはやや複雑な:

/(a*)b\1+/.exec("baaaac")

これは配列

["b", ""]

を返す。

22.2.2.3.2 EmptyMatcher ( )

The abstract operation EmptyMatcher takes no arguments and returns Matcher. It performs the following steps when called:

(x, c) を引数に取り何も捕捉せず以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. c(x) を返す。

22.2.2.3.3 MatchTwoAlternatives ( `m1`, `m2` )

The abstract operation MatchTwoAlternatives takes arguments m1 (Matcher) and m2 (Matcher) and returns Matcher. It performs the following steps when called:

(x, c) を引数に取り m1, m2 を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. r を m1(x, c) とする。
4. r が failure でなければ r を返す。
5. m2(x, c) を返す。

22.2.2.3.4 MatchSequence ( `m1`, `m2`, `direction` )

The abstract operation MatchSequence takes arguments m1 (Matcher), m2 (Matcher), and direction (forward または backward) and returns Matcher. It performs the following steps when called:

direction が forward なら
1. (x, c) を引数に取り m1, m2 を捕捉し以下を行う新しい Matcher を返す:
  1. アサート: x は MatchState。
  2. アサート: c は MatcherContinuation。
  3. (y) を引数に取り c, m2 を捕捉し以下を行う新しい MatcherContinuation d を作る:
    1. アサート: y は MatchState。
    2. m2(y, c) を返す。
  4. m1(x, d) を返す。
それ以外
1. アサート: direction は backward。
2. (x, c) を引数に取り m1, m2 を捕捉し以下を行う新しい Matcher を返す:
  1. アサート: x は MatchState。
  2. アサート: c は MatcherContinuation。
  3. (y) を引数に取り c, m1 を捕捉し以下を行う新しい MatcherContinuation d を作る:
    1. アサート: y は MatchState。
    2. m1(y, c) を返す。
  4. m2(x, d) を返す。

22.2.2.4 実行時セマンティクス: CompileAssertion : Matcher

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note 1

この節は B.1.2.6 で修正される。

It is defined piecewise over the following productions:

Assertion

(x, c) を引数に取り rer を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. e を x.[[EndIndex]] とする。
5. e = 0 または rer.[[Multiline]] が true かつ文字 Input[e - 1] が LineTerminator にマッチするなら
  1. c(x) を返す。
6. failure を返す。

Note 2

y フラグがパターンに使われている場合でも、^ は常に Input の先頭、または (rer.[[Multiline]] が true の場合) 行頭にのみマッチする。

Assertion

(x, c) を引数に取り rer を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. e を x.[[EndIndex]] とする。
5. InputLength を Input の要素数とする。
6. e = InputLength または rer.[[Multiline]] が true かつ文字 Input[e] が LineTerminator にマッチするなら
  1. c(x) を返す。
7. failure を返す。

Assertion

(x, c) を引数に取り rer を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. e を x.[[EndIndex]] とする。
5. a を IsWordChar(rer, Input, e - 1) とする。
6. b を IsWordChar(rer, Input, e) とする。
7. (a が true かつ b が false) または (a が false かつ b が true) なら c(x) を返す。
8. failure を返す。

Assertion

(x, c) を引数に取り rer を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. e を x.[[EndIndex]] とする。
5. a を IsWordChar(rer, Input, e - 1) とする。
6. b を IsWordChar(rer, Input, e) とする。
7. (a が true かつ b が true) または (a が false かつ b が false) なら c(x) を返す。
8. failure を返す。

Assertion

(?=

Disjunction

)

m を Disjunction の CompileSubpattern (引数 rer, forward) とする。
(x, c) を引数に取り m を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. (y) を引数に取り何も捕捉しない新しい MatcherContinuation d を作る:
  1. アサート: y は MatchState。
  2. y を返す。
4. r を m(x, d) とする。
5. r が failure なら failure を返す。
6. アサート: r は MatchState。
7. cap を r.[[Captures]] とする。
8. Input を x.[[Input]] とする。
9. xe を x.[[EndIndex]] とする。
10. z を MatchState { [[Input]]: Input, [[EndIndex]]: xe, [[Captures]]: cap } とする。
11. c(z) を返す。

Note 3

(?= Disjunction ) 形式はゼロ幅正の先読み。成功するには Disjunction 内のパターンが現在位置でマッチしなければならないが、後続をマッチする前に現在位置は進まない。Disjunction が現在位置で複数のマッチ方法を持つ場合、最初の 1 つのみ試す。他の演算子と異なり、(?= 形式内へのバックトラッキングは行われない (Perl 由来)。これは Disjunction が捕捉括弧を含み、パターン後続がそれらへの後方参照を含む場合のみ影響する。

例:

/(?=(a+))/.exec("baaabac")

は最初の b の直後で空文字列にマッチし、配列:

["", "aaa"]

を返す。先読み内へのバックトラッキング欠如を示すため:

/(?=(a+))a*b\1/.exec("baaabac")

は

["aba", "a"]

を返し、

["aaaba", "a"]

ではない。

Assertion

(?!

Disjunction

)

m を Disjunction の CompileSubpattern (引数 rer, forward) とする。
(x, c) を引数に取り m を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. (y) を引数に取り何も捕捉しない MatcherContinuation d を作る:
  1. アサート: y は MatchState。
  2. y を返す。
4. r を m(x, d) とする。
5. r が failure でなければ failure を返す。
6. c(x) を返す。

Note 4

(?! Disjunction ) 形式はゼロ幅負の先読み。成功には Disjunction 内のパターンが現在位置でマッチに失敗する必要がある。現在位置は後続をマッチする前に進まない。Disjunction は捕捉括弧を含み得るが、それらへの後方参照は Disjunction 内部でのみ意味を持つ。この負の先読みが成功するには失敗が必要であり、負の先読み外からのその捕捉への後方参照は常に undefined を返す。例:

/(.*?)a(?!(a+)b\2c)\2(.*)/.exec("baaabaac")

は「a の直後に n (>0) 個の a、b、さらに n 個の a (\2)、c」が続かない a を探す。2 つ目の \2 は負の先読みの外なので undefined にマッチし常に成功する。式は配列:

["baaabaac", "ba", undefined, "abaac"]

を返す。

Assertion

(?<=

Disjunction

)

m を Disjunction の CompileSubpattern (引数 rer, backward) とする。
(x, c) を引数に取り m を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. (y) を引数に取り何も捕捉しない MatcherContinuation d を作る:
  1. アサート: y は MatchState。
  2. y を返す。
4. r を m(x, d) とする。
5. r が failure なら failure を返す。
6. アサート: r は MatchState。
7. cap を r.[[Captures]] とする。
8. Input を x.[[Input]] とする。
9. xe を x.[[EndIndex]] とする。
10. z を MatchState { [[Input]]: Input, [[EndIndex]]: xe, [[Captures]]: cap } とする。
11. c(z) を返す。

Assertion

(?<!

Disjunction

)

m を Disjunction の CompileSubpattern (引数 rer, backward) とする。
(x, c) を引数に取り m を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. (y) を引数に取り何も捕捉しない MatcherContinuation d を作る:
  1. アサート: y は MatchState。
  2. y を返す。
4. r を m(x, d) とする。
5. r が failure でなければ failure を返す。
6. c(x) を返す。

22.2.2.4.1 IsWordChar ( `rer`, `Input`, `e` )

The abstract operation IsWordChar takes arguments rer (a RegExp Record), Input (文字の List), and e (整数) and returns Boolean. It performs the following steps when called:

InputLength を Input の要素数とする。
e = -1 または e = InputLength なら false を返す。
c を Input[e] の文字とする。
WordCharacters(rer) に c が含まれるなら true を返す。
false を返す。

22.2.2.5 実行時セマンティクス: CompileQuantifier を持つ Record

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

Quantifier

QuantifierPrefix

qp を QuantifierPrefix の CompileQuantifierPrefix とする。
Record { [[Min]]: qp.[[Min]], [[Max]]: qp.[[Max]], [[Greedy]]: true } を返す。

Quantifier

QuantifierPrefix

qp を QuantifierPrefix の CompileQuantifierPrefix とする。
Record { [[Min]]: qp.[[Min]], [[Max]]: qp.[[Max]], [[Greedy]]: false } を返す。

22.2.2.6 実行時セマンティクス: CompileQuantifierPrefix を持つ Record

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

QuantifierPrefix

Record { [[Min]]: 0, [[Max]]: +∞ } を返す。

QuantifierPrefix

Record { [[Min]]: 1, [[Max]]: +∞ } を返す。

QuantifierPrefix

Record { [[Min]]: 0, [[Max]]: 1 } を返す。

QuantifierPrefix

{

DecimalDigits

}

i を DecimalDigits の MV (12.9.3 参照) とする。
Record { [[Min]]: i, [[Max]]: i } を返す。

QuantifierPrefix

{

DecimalDigits

i を DecimalDigits の MV とする。
Record { [[Min]]: i, [[Max]]: +∞ } を返す。

QuantifierPrefix

{

DecimalDigits

}

i を最初の DecimalDigits の MV とする。
j を 2 番目の DecimalDigits の MV とする。
Record { [[Min]]: i, [[Max]]: j } を返す。

22.2.2.7 実行時セマンティクス: CompileAtom : Matcher

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note 1

この節は B.1.2.7 で修正される。

It is defined piecewise over the following productions:

Atom

PatternCharacter

ch を PatternCharacter にマッチした文字とする。
A を文字 ch を含む 1 要素 CharSet とする。
CharacterSetMatcher(rer, A, false, direction) を返す。

Atom

A を AllCharacters(rer) とする。
rer.[[DotAll]] が true でなければ
1. LineTerminator 生成規則右辺のコードポイントに対応する全ての文字を A から除去する。
CharacterSetMatcher(rer, A, false, direction) を返す。

Atom

CharacterClass

cc を CharacterClass の CompileCharacterClass (引数 rer) とする。
cs を cc.[[CharSet]] とする。
rer.[[UnicodeSets]] が false または cs の全 CharSetElement が単一文字（cs が空の場合を含む）から成るなら CharacterSetMatcher(rer, cs, cc.[[Invert]], direction) を返す。
アサート: cc.[[Invert]] は false。
lm を空の Matcher の List とする。
cs 内で 1 文字を超える文字列を含む各 CharSetElement s について長さ降順で:
1. cs2 を s の最後のコードポイントを含む 1 要素 CharSet とする。
2. m2 を CharacterSetMatcher(rer, cs2, false, direction)。
3. s の 2 番目から最後の 1 つ前までの各コードポイント c1 を逆順で:
  1. cs1 を c1 を含む 1 要素 CharSet とする。
  2. m1 を CharacterSetMatcher(rer, cs1, false, direction)。
  3. m2 を MatchSequence(m1, m2, direction) に更新。
4. m2 を lm に追加。
singles を cs のうち単一文字から成る全 CharSetElement を含む CharSet とする。
CharacterSetMatcher(rer, singles, false, direction) を lm に追加。
cs が空文字列を含むなら EmptyMatcher() を lm に追加。
m2 を lm の最後の Matcher とする。
lm の 2 番目から最後の要素を逆順に各 Matcher m1 について
1. m2 を MatchTwoAlternatives(m1, m2) に更新。
m2 を返す。

Atom

(

GroupSpecifier

opt

Disjunction

)

m を Disjunction の CompileSubpattern (引数 rer, direction) とする。
parenIndex を CountLeftCapturingParensBefore(Atom) とする。
(x, c) を引数に取り direction, m, parenIndex を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. (y) を引数に取り x, c, direction, parenIndex を捕捉し以下を行う新しい MatcherContinuation d を作る:
  1. アサート: y は MatchState。
  2. cap を y.[[Captures]] のコピーとする。
  3. Input を x.[[Input]] とする。
  4. xe を x.[[EndIndex]] とする。
  5. ye を y.[[EndIndex]] とする。
  6. direction が forward なら
    1. アサート: xe ≤ ye。
    2. r を CaptureRange { [[StartIndex]]: xe, [[EndIndex]]: ye } とする。
  7. それ以外
    1. アサート: direction は backward。
    2. アサート: ye ≤ xe。
    3. r を CaptureRange { [[StartIndex]]: ye, [[EndIndex]]: xe } とする。
  8. cap[parenIndex + 1] に r を設定。
  9. z を MatchState { [[Input]]: Input, [[EndIndex]]: ye, [[Captures]]: cap } とする。
  10. c(z) を返す。
4. m(x, d) を返す。

Note 2

( Disjunction ) 形式の括弧は Disjunction パターンの構成要素をグループ化し、マッチ結果を保存する。結果は後方参照（\ + 非ゼロ 10 進数）、置換文字列で参照、または正規表現マッチ抽象クロージャが返す配列の一部として利用できる。捕捉挙動を抑止するには (?: Disjunction ) を用いる。

Atom

RegularExpressionModifiers

Disjunction

)

addModifiers を RegularExpressionModifiers にマッチしたソーステキストとする。
removeModifiers を空文字列とする。
modifiedRer を UpdateModifiers(rer, CodePointsToString(addModifiers), removeModifiers) とする。
Disjunction の CompileSubpattern (引数 modifiedRer, direction) を返す。

Atom

RegularExpressionModifiers

Disjunction

)

addModifiers を最初の RegularExpressionModifiers にマッチしたソーステキストとする。
removeModifiers を 2 番目の RegularExpressionModifiers にマッチしたソーステキストとする。
modifiedRer を UpdateModifiers(rer, CodePointsToString(addModifiers), CodePointsToString(removeModifiers)) とする。
Disjunction の CompileSubpattern (引数 modifiedRer, direction) を返す。

AtomEscape

DecimalEscape

n を DecimalEscape の CapturingGroupNumber とする。
アサート: n ≤ rer.[[CapturingGroupsCount]]。
BackreferenceMatcher(rer, « n », direction) を返す。

Note 3

\ に非ゼロ 10 進数 n が続くエスケープは n 番目の捕捉括弧集合の結果にマッチする (22.2.2.1)。正規表現内の捕捉括弧数が n 未満ならエラー。n 以上あるが n 番目が何も捕捉せず undefined なら後方参照は常に成功する。

AtomEscape

CharacterEscape

cv を CharacterEscape の CharacterValue とする。
ch を character value が cv の文字とする。
A を文字 ch を含む 1 要素 CharSet とする。
CharacterSetMatcher(rer, A, false, direction) を返す。

AtomEscape

CharacterClassEscape

cs を CharacterClassEscape の CompileToCharSet (引数 rer) とする。
rer.[[UnicodeSets]] が false または cs の全 CharSetElement が単一文字（cs が空の場合含む）から成るなら CharacterSetMatcher(rer, cs, false, direction) を返す。
lm を空の Matcher の List とする。
cs 内で 1 文字を超える文字列を含む各 CharSetElement s について長さ降順で:
1. cs2 を s の最後のコードポイントを含む 1 要素 CharSet とする。
2. m2 を CharacterSetMatcher(rer, cs2, false, direction)。
3. s の 2 番目から最後の 1 つ前までの各コードポイント c1 を逆順で:
  1. cs1 を c1 を含む 1 要素 CharSet。
  2. m1 を CharacterSetMatcher(rer, cs1, false, direction)。
  3. m2 を MatchSequence(m1, m2, direction) に更新。
4. m2 を lm に追加。
singles を cs のうち単一文字から成る全 CharSetElement を含む CharSet とする。
CharacterSetMatcher(rer, singles, false, direction) を lm に追加。
cs が空文字列を含むなら EmptyMatcher() を lm に追加。
m2 を lm の最後の Matcher に。
lm の 2 番目から最後の要素を逆順に各 m1 について
1. m2 を MatchTwoAlternatives(m1, m2) に更新。
m2 を返す。

AtomEscape

GroupName

matchingGroupSpecifiers を GroupSpecifiersThatMatch(GroupName) とする。
parenIndices を新しい空 List とする。
matchingGroupSpecifiers の各 GroupSpecifier groupSpecifier について
1. parenIndex を CountLeftCapturingParensBefore(groupSpecifier) とする。
2. parenIndex を parenIndices に追加。
BackreferenceMatcher(rer, parenIndices, direction) を返す。

22.2.2.7.1 CharacterSetMatcher ( `rer`, `A`, `invert`, `direction` )

The abstract operation CharacterSetMatcher takes arguments rer (a RegExp Record), A (CharSet), invert (Boolean), and direction (forward または backward) and returns Matcher. It performs the following steps when called:

rer.[[UnicodeSets]] が true なら
1. アサート: invert は false。
2. アサート: A の全 CharSetElement は単一文字。
(x, c) を引数に取り rer, A, invert, direction を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. e を x.[[EndIndex]] とする。
5. direction が forward なら f を e + 1 とし、そうでなければ f を e - 1。
6. InputLength を Input の要素数とする。
7. f < 0 または f > InputLength なら failure。
8. index を min(e, f) とする。
9. ch を Input[index] の文字とする。
10. cc を Canonicalize(rer, ch) とする。
11. A 内に正確に 1 文字 a を含む CharSetElement が存在し、Canonicalize(rer, a) が cc なら found を true、そうでなければ false。
12. invert が false かつ found が false なら failure。
13. invert が true かつ found が true なら failure。
14. cap を x.[[Captures]] とする。
15. y を MatchState { [[Input]]: Input, [[EndIndex]]: f, [[Captures]]: cap } とする。
16. c(y) を返す。

22.2.2.7.2 BackreferenceMatcher ( `rer`, `ns`, `direction` )

The abstract operation BackreferenceMatcher takes arguments rer (a RegExp Record), ns (正の整数の List), and direction (forward または backward) and returns Matcher. It performs the following steps when called:

(x, c) を引数に取り rer, ns, direction を捕捉し以下を行う新しい Matcher を返す:
1. アサート: x は MatchState。
2. アサート: c は MatcherContinuation。
3. Input を x.[[Input]] とする。
4. cap を x.[[Captures]] とする。
5. r を undefined とする。
6. 各整数 n ∈ ns について
  1. cap[n] が undefined でなければ
    1. アサート: r は undefined。
    2. r を cap[n] に設定。
7. r が undefined なら c(x) を返す。
8. e を x.[[EndIndex]] とする。
9. rs を r.[[StartIndex]] とする。
10. re を r.[[EndIndex]] とする。
11. len を re - rs とする。
12. direction が forward なら f を e + len、そうでなければ f を e - len。
13. InputLength を Input の要素数とする。
14. f < 0 または f > InputLength なら failure。
15. g を min(e, f) とする。
16. 0 ≤ i < len の整数 i で Canonicalize(rer, Input[rs + i]) ≠ Canonicalize(rer, Input[g + i]) となるものが存在するなら failure。
17. y を MatchState { [[Input]]: Input, [[EndIndex]]: f, [[Captures]]: cap } とする。
18. c(y) を返す。

22.2.2.7.3 Canonicalize ( `rer`, `ch` )

The abstract operation Canonicalize takes arguments rer (a RegExp Record) and ch (文字) and returns 文字. It performs the following steps when called:

HasEitherUnicodeFlag(rer) が true かつ rer.[[IgnoreCase]] が true なら
1. Unicode Character Database の CaseFolding.txt が ch に単純または共通のケースフォールディングを提供するなら、その写像結果を返す。
2. ch を返す。
rer.[[IgnoreCase]] が false なら ch を返す。
アサート: ch は UTF-16 コードユニット。
cp を数値が ch の数値と等しいコードポイントとする。
u を Unicode 既定ケース変換アルゴリズムに従い toUppercase(« cp ») とする。
uStr を CodePointsToString(u) とする。
uStr の長さ ≠ 1 なら ch を返す。
cu を uStr の単一コードユニット要素とする。
ch の数値 ≥ 128 かつ cu の数値 < 128 なら ch を返す。
cu を返す。

Note

HasEitherUnicodeFlag(rer) が true の大文字小文字無視マッチでは、比較直前に全ての文字が Unicode 標準の simple case folding により暗黙にフォールディングされる。simple mapping は常に単一コードポイントへ写像するため ß は ss や SS には写らない。基本ラテンブロック外から内へ写像する場合がある (例: ſ → s, K → k)。これらを含む文字列は /[a-z]/ui などでマッチする。

HasEitherUnicodeFlag(rer) が false の大文字小文字無視マッチでは toCasefold ではなく toUppercase に基づくため差異がある。例: Ω は toUppercase では自身、toCasefold では ω に写るため "\u2126" は /[ω]/ui や /[\u03A9]/ui にマッチするが /[ω]/i や /[\u03A9]/i にはマッチしない。また基本ラテン外から内への写像は行われないので "\u017F ſ", "\u212A K" は /[a-z]/i にマッチしない。

22.2.2.7.4 UpdateModifiers ( `rer`, `add`, `remove` )

The abstract operation UpdateModifiers takes arguments rer (a RegExp Record), add (String), and remove (String) and returns RegExp Record. It performs the following steps when called:

アサート: add と remove は共通要素を持たない。
ignoreCase を rer.[[IgnoreCase]]。
multiline を rer.[[Multiline]]。
dotAll を rer.[[DotAll]]。
unicode を rer.[[Unicode]]。
unicodeSets を rer.[[UnicodeSets]]。
capturingGroupsCount を rer.[[CapturingGroupsCount]]。
remove に "i" が含まれるなら ignoreCase を false に。
そうでなく add に "i" が含まれるなら ignoreCase を true に。
remove に "m" が含まれるなら multiline を false に。
そうでなく add に "m" が含まれるなら multiline を true に。
remove に "s" が含まれるなら dotAll を false に。
そうでなく add に "s" が含まれるなら dotAll を true に。
RegExp Record { [[IgnoreCase]]: ignoreCase, [[Multiline]]: multiline, [[DotAll]]: dotAll, [[Unicode]]: unicode, [[UnicodeSets]]: unicodeSets, [[CapturingGroupsCount]]: capturingGroupsCount } を返す。

22.2.2.8 実行時セマンティクス: CompileCharacterClass を持つ Record

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

CharacterClass

[

ClassContents

]

A を ClassContents の CompileToCharSet (引数 rer) とする。
Record { [[CharSet]]: A, [[Invert]]: false } を返す。

CharacterClass

ClassContents

]

A を ClassContents の CompileToCharSet (引数 rer) とする。
rer.[[UnicodeSets]] が true なら
1. Record { [[CharSet]]: CharacterComplement(rer, A), [[Invert]]: false } を返す。
Record { [[CharSet]]: A, [[Invert]]: true } を返す。

22.2.2.9 実行時セマンティクス: CompileToCharSet : CharSet

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note 1

この節は B.1.2.8 で修正される。

It is defined piecewise over the following productions:

ClassContents

[empty]

空の CharSet を返す。

NonemptyClassRanges

ClassAtom

NonemptyClassRangesNoDash

A を ClassAtom の CompileToCharSet (引数 rer)。
B を NonemptyClassRangesNoDash の CompileToCharSet (引数 rer)。
CharSet A と B の和集合を返す。

NonemptyClassRanges

ClassAtom

ClassContents

A を最初の ClassAtom の CompileToCharSet (引数 rer)。
B を 2 番目の ClassAtom の CompileToCharSet (引数 rer)。
C を ClassContents の CompileToCharSet (引数 rer)。
D を CharacterRange(A, B)。
D と C の和集合を返す。

NonemptyClassRangesNoDash

ClassAtomNoDash

NonemptyClassRangesNoDash

A を ClassAtomNoDash の CompileToCharSet (引数 rer)。
B を NonemptyClassRangesNoDash の CompileToCharSet (引数 rer)。
CharSet A と B の和集合を返す。

NonemptyClassRangesNoDash

ClassAtomNoDash

ClassAtom

ClassContents

A を ClassAtomNoDash の CompileToCharSet (引数 rer)。
B を ClassAtom の CompileToCharSet (引数 rer)。
C を ClassContents の CompileToCharSet (引数 rer)。
D を CharacterRange(A, B)。
D と C の和集合を返す。

Note 2

ClassContents は単一の ClassAtom、およびダッシュで区切られた 2 つの ClassAtom の範囲になり得る。後者の場合、ClassContents には第 1 と第 2 の ClassAtom 間（含む）の全ての文字が含まれる。どちらかの ClassAtom が単一文字を表さない (例: \w) 場合、または第 1 の ClassAtom の character value が第 2 のそれより大きい場合はエラー。

Note 3

パターンが大文字小文字を無視する場合でも、範囲両端の大文字小文字は範囲に含まれる文字を決定する上で重要。例: /[E-F]/i は E, F, e, f のみ、/[E-f]/i は Unicode Basic Latin ブロックの全大文字小文字および [, \, ], ^, _, ` にマッチ。

Note 4

- は文字通りにも範囲指定にも使える。ClassContents の先頭または末尾、範囲指定の開始/終了端、または範囲指定直後に現れる場合はリテラルとして扱われる。

ClassAtom

単一文字 - U+002D (HYPHEN-MINUS) を含む CharSet を返す。

ClassAtomNoDash

SourceCharacter

but not one of \ or ] or -

SourceCharacter にマッチした文字を含む CharSet を返す。

ClassEscape

CharacterEscape

cv をこの ClassEscape の CharacterValue。
c を character value が cv の文字。
c を含む 1 要素 CharSet を返す。

Note 5

ClassAtom 内では、\b, \B, 後方参照を除く正規表現中で許されるエスケープを利用できる。CharacterClass 内では \b はバックスペース文字、\B と後方参照はエラー。ClassAtom 内で後方参照を用いるとエラー。

CharacterClassEscape

文字 0〜9 を含む 10 要素 CharSet を返す。

CharacterClassEscape

S を CharacterClassEscape :: d の返す CharSet とする。
CharacterComplement(rer, S) を返す。

CharacterClassEscape

WhiteSpace または LineTerminator 生成規則右辺のコードポイントに対応する全ての文字を含む CharSet を返す。

CharacterClassEscape

S を CharacterClassEscape :: s の返す CharSet とする。
CharacterComplement(rer, S) を返す。

CharacterClassEscape

MaybeSimpleCaseFolding(rer, WordCharacters(rer)) を返す。

CharacterClassEscape

S を CharacterClassEscape :: w の返す CharSet とする。
CharacterComplement(rer, S) を返す。

CharacterClassEscape

UnicodePropertyValueExpression

}

UnicodePropertyValueExpression の CompileToCharSet (引数 rer) を返す。

CharacterClassEscape

UnicodePropertyValueExpression

}

S を UnicodePropertyValueExpression の CompileToCharSet (引数 rer) とする。
アサート: S は単一コードポイントのみ含む。
CharacterComplement(rer, S) を返す。

UnicodePropertyValueExpression

UnicodePropertyName

UnicodePropertyValue

ps を UnicodePropertyName にマッチしたソーステキストとする。
p を UnicodeMatchProperty(rer, ps) とする。
ただし、p は Table 67 の「Property name and aliases」列に記載された Unicode property name またはプロパティエイリアスであることを保証する。
vs を UnicodePropertyValue にマッチしたソーステキストとする。
v を UnicodeMatchPropertyValue(p, vs) とする。
A を、プロパティ p が値 v を持つという文字データベース定義を含むすべての Unicode コードポイントからなる CharSet とする。
MaybeSimpleCaseFolding(rer, A) を返す。

UnicodePropertyValueExpression

LoneUnicodePropertyNameOrValue

s を LoneUnicodePropertyNameOrValue にマッチしたソーステキストとする。
UnicodeMatchPropertyValue(General_Category, s) が PropertyValueAliases.txt に記載された General_Category (gc) プロパティの Unicode プロパティ値またはプロパティ値エイリアスである場合、
1. プロパティ “General_Category” が値 s を持つという文字データベース定義を含むすべての Unicode コードポイントからなる CharSet を返す。
p を UnicodeMatchProperty(rer, s) とする。
ただし、p は Table 68 の「Property name and aliases」列に記載されたバイナリ Unicode プロパティまたはバイナリプロパティエイリアス、または Table 69 の「Property name」列に記載された文字列のバイナリ Unicode プロパティであることを保証する。
A を、プロパティ p が値 “True” を持つという文字データベース定義を含むすべての CharSetElement からなる CharSet とする。
MaybeSimpleCaseFolding(rer, A) を返す。

ClassUnion

ClassSetRange

ClassUnion

opt

A を ClassSetRange の CompileToCharSet (引数 rer)。
ClassUnion が存在するなら
1. B を ClassUnion の CompileToCharSet (引数 rer)。
2. CharSet A と B の和集合を返す。
A を返す。

ClassUnion

ClassSetOperand

ClassUnion

opt

A を ClassSetOperand の CompileToCharSet (引数 rer)。
ClassUnion が存在するなら
1. B を ClassUnion の CompileToCharSet (引数 rer)。
2. CharSet A と B の和集合を返す。
A を返す。

ClassIntersection

ClassSetOperand

A を最初の ClassSetOperand の CompileToCharSet (引数 rer)。
B を 2 番目の ClassSetOperand の CompileToCharSet (引数 rer)。
CharSet A と B の共通部分を返す。

ClassIntersection

ClassSetOperand

A を ClassIntersection の CompileToCharSet (引数 rer)。
B を ClassSetOperand の CompileToCharSet (引数 rer)。
CharSet A と B の共通部分を返す。

ClassSubtraction

ClassSetOperand

最初の ClassSetOperand の CompileToCharSet (引数 rer) を A。
2 番目の ClassSetOperand の CompileToCharSet (引数 rer) を B。
A のうち B でない CharSetElement を含む CharSet を返す。

ClassSubtraction

ClassSetOperand

A を ClassSubtraction の CompileToCharSet (引数 rer)。
B を ClassSetOperand の CompileToCharSet (引数 rer)。
A のうち B でない CharSetElement を含む CharSet を返す。

ClassSetRange

ClassSetCharacter

A を最初の ClassSetCharacter の CompileToCharSet (引数 rer)。
B を 2 番目の ClassSetCharacter の CompileToCharSet (引数 rer)。
MaybeSimpleCaseFolding(rer, CharacterRange(A, B)) を返す。

Note 6

結果はしばしば 2 個以上の範囲で構成される。UnicodeSets が true かつ IgnoreCase が true のとき、MaybeSimpleCaseFolding(rer, [Ā-č]) はその範囲の奇数番コードポイントのみを含む。

ClassSetOperand

ClassSetCharacter

A を ClassSetCharacter の CompileToCharSet (引数 rer)。
MaybeSimpleCaseFolding(rer, A) を返す。

ClassSetOperand

ClassStringDisjunction

A を ClassStringDisjunction の CompileToCharSet (引数 rer)。
MaybeSimpleCaseFolding(rer, A) を返す。

ClassSetOperand

NestedClass

NestedClass の CompileToCharSet (引数 rer) を返す。

NestedClass

[

ClassContents

]

ClassContents の CompileToCharSet (引数 rer) を返す。

NestedClass

ClassContents

]

A を ClassContents の CompileToCharSet (引数 rer)。
CharacterComplement(rer, A) を返す。

NestedClass

CharacterClassEscape

CharacterClassEscape の CompileToCharSet (引数 rer) を返す。

ClassStringDisjunction

\q{

ClassStringDisjunctionContents

}

ClassStringDisjunctionContents の CompileToCharSet (引数 rer) を返す。

ClassStringDisjunctionContents

ClassString

s を ClassString の CompileClassSetString (引数 rer)。
文字列 s を 1 つだけ含む CharSet を返す。

ClassStringDisjunctionContents

ClassString

ClassStringDisjunctionContents

s を ClassString の CompileClassSetString (引数 rer)。
A を文字列 s を 1 つ含む CharSet。
B を ClassStringDisjunctionContents の CompileToCharSet (引数 rer)。
CharSet A と B の和集合を返す。

ClassSetCharacter

SourceCharacter

but not ClassSetSyntaxCharacter

CharacterEscape

ClassSetReservedPunctuator

cv をこの ClassSetCharacter の CharacterValue。
c を character value が cv の文字。
c を含む 1 要素 CharSet を返す。

ClassSetCharacter

U+0008 (BACKSPACE) を含む 1 要素 CharSet を返す。

22.2.2.9.1 CharacterRange ( `A`, `B` )

The abstract operation CharacterRange takes arguments A (CharSet) and B (CharSet) and returns CharSet. It performs the following steps when called:

アサート: A, B はそれぞれ正確に 1 文字を含む。
a を CharSet A の唯一の文字。
b を CharSet B の唯一の文字。
i を文字 a の character value。
j を文字 b の character value。
アサート: i ≤ j。
i から j まで（含む）の character value を持つ全ての文字を含む CharSet を返す。

22.2.2.9.2 HasEitherUnicodeFlag ( `rer` )

The abstract operation HasEitherUnicodeFlag takes argument rer (a RegExp Record) and returns Boolean. It performs the following steps when called:

rer.[[Unicode]] が true または rer.[[UnicodeSets]] が true なら
1. true を返す。
false を返す。

22.2.2.9.3 WordCharacters ( `rer` )

The abstract operation WordCharacters takes argument rer (a RegExp Record) and returns CharSet. \b, \B, \w, \W のために「word characters」と見なされる文字を含む CharSet を返す。 It performs the following steps when called:

basicWordChars を ASCII の word characters 全てを含む CharSet。
extraWordChars を、basicWordChars には含まれないが Canonicalize(rer, c) が basicWordChars に含まれる全ての文字 c を含む CharSet。
アサート: HasEitherUnicodeFlag(rer) が true かつ rer.[[IgnoreCase]] が true でない限り extraWordChars は空。
basicWordChars と extraWordChars の和集合を返す。

22.2.2.9.4 AllCharacters ( `rer` )

The abstract operation AllCharacters takes argument rer (a RegExp Record) and returns CharSet. 正規表現フラグに従う「全ての文字」の集合を返す。 It performs the following steps when called:

rer.[[UnicodeSets]] が true かつ rer.[[IgnoreCase]] が true なら
1. Simple Case Folding を持たない (scf(c) = c) 全 Unicode コードポイント c を含む CharSet を返す。
それ以外で HasEitherUnicodeFlag(rer) が true なら
1. 全コードポイント値を含む CharSet を返す。
それ以外
1. 全コードユニット値を含む CharSet を返す。

22.2.2.9.5 MaybeSimpleCaseFolding ( `rer`, `A` )

The abstract operation MaybeSimpleCaseFolding takes arguments rer (a RegExp Record) and A (CharSet) and returns CharSet. rer.[[UnicodeSets]] が false または rer.[[IgnoreCase]] が false なら A を返す。そうでなければ Simple Case Folding (scf(cp)) の定義 (CaseFolding.txt) を用い、A の各 CharSetElement を文字ごとに正規化して得られる CharSet を返す。 It performs the following steps when called:

rer.[[UnicodeSets]] が false または rer.[[IgnoreCase]] が false なら A を返す。
B を新しい空の CharSet。
A の各 CharSetElement s について
1. t を空の文字列シーケンス。
2. s 内の各単一コードポイント cp について
  1. scf(cp) を t に追加。
3. t を B に追加。
B を返す。

22.2.2.9.6 CharacterComplement ( `rer`, `S` )

The abstract operation CharacterComplement takes arguments rer (a RegExp Record) and S (CharSet) and returns CharSet. It performs the following steps when called:

A を AllCharacters(rer)。
A のうち S に含まれない CharSetElement を含む CharSet を返す。

22.2.2.9.7 UnicodeMatchProperty ( `rer`, `p` )

The abstract operation UnicodeMatchProperty takes arguments rer (RegExp レコード) and p (ECMAScript ソーステキスト) and returns Unicode property name. It performs the following steps when called:

rer.[[UnicodeSets]] が true であり、かつ p が Table 69 の「Property name」列に記載されている Unicode property name である場合、
1. Unicode コードポイント p のリストを返す。
p は Table 67 または Table 68 の「Property name and aliases」列に記載されている Unicode property name またはプロパティエイリアスであることを保証する。
c を該当する行の「Canonical property name」列に記載されている p の正規 property name とする。
Unicode コードポイント c のリストを返す。

実装は、Table 67、Table 68、および Table 69 に記載されている Unicode property names およびエイリアスをサポートしなければならない。相互運用性を確保するため、他のproperty namesやエイリアスはサポートしてはならない。

Note 1

例えば、Script_Extensions（property name）や scx（プロパティエイリアス）は有効だが、script_extensions や Scx は無効である。

Note 2

掲載されているプロパティは UTS18 RL1.2 の要件の上位集合である。

Note 3

これらの表のエントリのスペル（大文字小文字を含む）は、Unicode Character Database の PropertyAliases.txt ファイルで使われているスペルと一致する。そのファイル内の正確なスペルは安定性が保証されている。

Table 67: Non-binary Unicode property aliases and their canonical property names

Property name and aliases	Canonical property name
`General_Category`	`General_Category`
`gc`	`General_Category`
`Script`	`Script`
`sc`	`Script`
`Script_Extensions`	`Script_Extensions`
`scx`	`Script_Extensions`

Table 68: Binary Unicode property aliases and their canonical property names

Property name and aliases	Canonical property name
`ASCII`	`ASCII`
`ASCII_Hex_Digit`	`ASCII_Hex_Digit`
`AHex`	`ASCII_Hex_Digit`
`Alphabetic`	`Alphabetic`
`Alpha`	`Alphabetic`
`Any`	`Any`
`Assigned`	`Assigned`
`Bidi_Control`	`Bidi_Control`
`Bidi_C`	`Bidi_Control`
`Bidi_Mirrored`	`Bidi_Mirrored`
`Bidi_M`	`Bidi_Mirrored`
`Case_Ignorable`	`Case_Ignorable`
`CI`	`Case_Ignorable`
`Cased`	`Cased`
`Changes_When_Casefolded`	`Changes_When_Casefolded`
`CWCF`	`Changes_When_Casefolded`
`Changes_When_Casemapped`	`Changes_When_Casemapped`
`CWCM`	`Changes_When_Casemapped`
`Changes_When_Lowercased`	`Changes_When_Lowercased`
`CWL`	`Changes_When_Lowercased`
`Changes_When_NFKC_Casefolded`	`Changes_When_NFKC_Casefolded`
`CWKCF`	`Changes_When_NFKC_Casefolded`
`Changes_When_Titlecased`	`Changes_When_Titlecased`
`CWT`	`Changes_When_Titlecased`
`Changes_When_Uppercased`	`Changes_When_Uppercased`
`CWU`	`Changes_When_Uppercased`
`Dash`	`Dash`
`Default_Ignorable_Code_Point`	`Default_Ignorable_Code_Point`
`DI`	`Default_Ignorable_Code_Point`
`Deprecated`	`Deprecated`
`Dep`	`Deprecated`
`Diacritic`	`Diacritic`
`Dia`	`Diacritic`
`Emoji`	`Emoji`
`Emoji_Component`	`Emoji_Component`
`EComp`	`Emoji_Component`
`Emoji_Modifier`	`Emoji_Modifier`
`EMod`	`Emoji_Modifier`
`Emoji_Modifier_Base`	`Emoji_Modifier_Base`
`EBase`	`Emoji_Modifier_Base`
`Emoji_Presentation`	`Emoji_Presentation`
`EPres`	`Emoji_Presentation`
`Extended_Pictographic`	`Extended_Pictographic`
`ExtPict`	`Extended_Pictographic`
`Extender`	`Extender`
`Ext`	`Extender`
`Grapheme_Base`	`Grapheme_Base`
`Gr_Base`	`Grapheme_Base`
`Grapheme_Extend`	`Grapheme_Extend`
`Gr_Ext`	`Grapheme_Extend`
`Hex_Digit`	`Hex_Digit`
`Hex`	`Hex_Digit`
`IDS_Binary_Operator`	`IDS_Binary_Operator`
`IDSB`	`IDS_Binary_Operator`
`IDS_Trinary_Operator`	`IDS_Trinary_Operator`
`IDST`	`IDS_Trinary_Operator`
`ID_Continue`	`ID_Continue`
`IDC`	`ID_Continue`
`ID_Start`	`ID_Start`
`IDS`	`ID_Start`
`Ideographic`	`Ideographic`
`Ideo`	`Ideographic`
`Join_Control`	`Join_Control`
`Join_C`	`Join_Control`
`Logical_Order_Exception`	`Logical_Order_Exception`
`LOE`	`Logical_Order_Exception`
`Lowercase`	`Lowercase`
`Lower`	`Lowercase`
`Math`	`Math`
`Noncharacter_Code_Point`	`Noncharacter_Code_Point`
`NChar`	`Noncharacter_Code_Point`
`Pattern_Syntax`	`Pattern_Syntax`
`Pat_Syn`	`Pattern_Syntax`
`Pattern_White_Space`	`Pattern_White_Space`
`Pat_WS`	`Pattern_White_Space`
`Quotation_Mark`	`Quotation_Mark`
`QMark`	`Quotation_Mark`
`Radical`	`Radical`
`Regional_Indicator`	`Regional_Indicator`
`RI`	`Regional_Indicator`
`Sentence_Terminal`	`Sentence_Terminal`
`STerm`	`Sentence_Terminal`
`Soft_Dotted`	`Soft_Dotted`
`SD`	`Soft_Dotted`
`Terminal_Punctuation`	`Terminal_Punctuation`
`Term`	`Terminal_Punctuation`
`Unified_Ideograph`	`Unified_Ideograph`
`UIdeo`	`Unified_Ideograph`
`Uppercase`	`Uppercase`
`Upper`	`Uppercase`
`Variation_Selector`	`Variation_Selector`
`VS`	`Variation_Selector`
`White_Space`	`White_Space`
`space`	`White_Space`
`XID_Continue`	`XID_Continue`
`XIDC`	`XID_Continue`
`XID_Start`	`XID_Start`
`XIDS`	`XID_Start`

Table 69: Binary Unicode properties of strings

Property name
`Basic_Emoji`
`Emoji_Keycap_Sequence`
`RGI_Emoji_Modifier_Sequence`
`RGI_Emoji_Flag_Sequence`
`RGI_Emoji_Tag_Sequence`
`RGI_Emoji_ZWJ_Sequence`
`RGI_Emoji`

22.2.2.9.8 UnicodeMatchPropertyValue ( `p`, `v` )

The abstract operation UnicodeMatchPropertyValue takes arguments p (ECMAScript ソーステキスト) and v (ECMAScript ソーステキスト) and returns Unicode プロパティ値. It performs the following steps when called:

p が Table 67 の「Canonical property name」列に記載されている正規・非エイリアスの Unicode property name であることを保証する。
v が PropertyValueAliases.txt に記載されている Unicode プロパティ p のプロパティ値またはプロパティ値エイリアスであることを保証する。
value を該当する行の「Canonical property value」列に記載されている v の正規プロパティ値とする。
Unicode コードポイント value のリストを返す。

実装は、Table 67 に記載されたプロパティについて、PropertyValueAliases.txt に記載されている Unicode プロパティ値およびプロパティ値エイリアスをサポートしなければならない。相互運用性を確保するため、他のプロパティ値やプロパティ値エイリアスはサポートしてはならない。

Note 1

例えば、Xpeo や Old_Persian は Script_Extensions の有効な値だが、xpeo や Old Persian は無効である。

Note 2

このアルゴリズムは UAX44 で記載されている記号値のマッチングルールとは異なる：大文字・小文字、空白、U+002D（ハイフンマイナス）、および U+005F（アンダースコア）は無視されず、Is プレフィックスもサポートされない。

22.2.2.10 実行時セマンティクス: CompileClassSetString : 文字列シーケンス

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

ClassString

[empty]

空の文字列シーケンスを返す。

ClassString

NonEmptyClassString

NonEmptyClassString の CompileClassSetString (引数 rer) を返す。

NonEmptyClassString

ClassSetCharacter

NonEmptyClassString

opt

cs を ClassSetCharacter の CompileToCharSet (引数 rer) とする。
s1 を cs の単一 CharSetElement である文字列シーケンスとする。
NonEmptyClassString が存在するなら
1. s2 を NonEmptyClassString の CompileClassSetString (引数 rer)。
2. s1 と s2 の連結を返す。
s1 を返す。

22.2.3 RegExp 生成のための抽象操作

22.2.3.1 RegExpCreate ( `P`, `F` )

The abstract operation RegExpCreate takes arguments P (ECMAScript 言語値) and F (String または undefined) and returns オブジェクトを含む通常完了または throw 完了. It performs the following steps when called:

obj を ! RegExpAlloc(%RegExp%) とする。
? RegExpInitialize(obj, P, F) を返す。

22.2.3.2 RegExpAlloc ( `newTarget` )

The abstract operation RegExpAlloc takes argument newTarget (constructor) and returns オブジェクトを含む通常完了または throw 完了. It performs the following steps when called:

obj を ? OrdinaryCreateFromConstructor(newTarget, "%RegExp.prototype%", « [[OriginalSource]], [[OriginalFlags]], [[RegExpRecord]], [[RegExpMatcher]] ») とする。
! DefinePropertyOrThrow(obj, "lastIndex", PropertyDescriptor { [[Writable]]: true, [[Enumerable]]: false, [[Configurable]]: false }) を実行する。
obj を返す。

22.2.3.3 RegExpInitialize ( `obj`, `pattern`, `flags` )

The abstract operation RegExpInitialize takes arguments obj (オブジェクト), pattern (ECMAScript 言語値), and flags (ECMAScript 言語値) and returns オブジェクトを含む通常完了または throw 完了. It performs the following steps when called:

pattern が undefined なら P を空文字列とする。
そうでなければ P を ? ToString(pattern) とする。
flags が undefined なら F を空文字列とする。
そうでなければ F を ? ToString(flags) とする。
F が "d", "g", "i", "m", "s", "u", "v", "y" 以外のコードユニットを含むか、あるいは同じコードユニットを複数回含むなら SyntaxError 例外を投げる。
F が "i" を含むなら i を true、そうでなければ false とする。
F が "m" を含むなら m を true、そうでなければ false とする。
F が "s" を含むなら s を true、そうでなければ false とする。
F が "u" を含むなら u を true、そうでなければ false とする。
F が "v" を含むなら v を true、そうでなければ false とする。
u が true もしくは v が true の場合
1. patternText を StringToCodePoints(P) とする。
そうでなければ
1. patternText を P の各 16-bit 要素を Unicode BMP コードポイントとして解釈した結果とする (UTF-16 デコードは行わない)。
parseResult を ParsePattern(patternText, u, v) とする。
parseResult が空でない SyntaxError オブジェクトの List なら SyntaxError 例外を投げる。
アサート: parseResult は Pattern パースノードである。
obj.[[OriginalSource]] に P を設定する。
obj.[[OriginalFlags]] に F を設定する。
capturingGroupsCount を CountLeftCapturingParensWithin(parseResult) とする。
rer を RegExp Record { [[IgnoreCase]]: i, [[Multiline]]: m, [[DotAll]]: s, [[Unicode]]: u, [[UnicodeSets]]: v, [[CapturingGroupsCount]]: capturingGroupsCount } とする。
obj.[[RegExpRecord]] に rer を設定する。
obj.[[RegExpMatcher]] に引数 rer で parseResult の CompilePattern を設定する。
? Set(obj, "lastIndex", +0_𝔽, true) を実行する。
obj を返す。

22.2.3.4 静的セマンティクス: ParsePattern ( `patternText`: Unicode コードポイント列, `u`: Boolean, `v`: Boolean, ): パースノードまたは空でない SyntaxError オブジェクト List

The abstract operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

Note

この節は B.1.2.9 で修正される。

It performs the following steps when called:

v が true かつ u が true なら
1. parseResult を 1 個以上の SyntaxError オブジェクトを含む List とする。
そうでなく v が true なら
1. parseResult を ParseText(patternText, Pattern[+UnicodeMode, +UnicodeSetsMode, +NamedCaptureGroups]) とする。
そうでなく u が true なら
1. parseResult を ParseText(patternText, Pattern[+UnicodeMode, ~UnicodeSetsMode, +NamedCaptureGroups]) とする。
それ以外
1. parseResult を ParseText(patternText, Pattern[~UnicodeMode, ~UnicodeSetsMode, +NamedCaptureGroups]) とする。
parseResult を返す。

22.2.4 RegExp コンストラクター

RegExp コンストラクター:

%RegExp% である。
グローバルオブジェクトの "RegExp" プロパティの初期値である。
コンストラクターとして呼び出されたとき新しい RegExp オブジェクトを生成し初期化する。
関数として呼び出された場合、新しい RegExp オブジェクトを返すか、引数が RegExp オブジェクト 1 つのみならその引数自体を返す。
クラス定義の extends 句の値として使用できる。指定された RegExp の挙動を継承するサブクラスのコンストラクターは、必要な内部スロットを持つサブクラスインスタンスを生成・初期化するため RegExp コンストラクターへの super 呼び出しを含めなければならない。

22.2.4.1 RegExp ( `pattern`, `flags` )

この関数は呼び出し時に以下を行う:

patternIsRegExp を ? IsRegExp(pattern) とする。
NewTarget が undefined なら
1. newTarget をアクティブな関数オブジェクトとする。
2. patternIsRegExp が true かつ flags が undefined なら
  1. patternConstructor を ? Get(pattern, "constructor") とする。
  2. SameValue(newTarget, patternConstructor) が true なら pattern を返す。
そうでなければ
1. newTarget を NewTarget とする。
pattern がオブジェクトで [[RegExpMatcher]] 内部スロットを持つなら
1. P を pattern.[[OriginalSource]] とする。
2. flags が undefined なら F を pattern.[[OriginalFlags]] とし、そうでなければ F を flags とする。
そうでなく patternIsRegExp が true なら
1. P を ? Get(pattern, "source") とする。
2. flags が undefined なら
  1. F を ? Get(pattern, "flags") とする。
3. そうでなければ
  1. F を flags とする。
それ以外
1. P を pattern とする。
2. F を flags とする。
O を ? RegExpAlloc(newTarget) とする。
? RegExpInitialize(O, P, F) を返す。

Note

pattern が StringLiteral で与えられる場合、通常のエスケープシーケンス置換が本関数で処理される前に適用される。pattern がこの関数に認識させるためにエスケープシーケンスを含む必要があるなら、StringLiteral 内で U+005C (REVERSE SOLIDUS) は削除されないようエスケープされなければならない。

22.2.5 RegExp コンストラクターのプロパティ

RegExp コンストラクター:

値 %Function.prototype% の [[Prototype]] 内部スロットを持つ。
以下のプロパティを持つ:

22.2.5.1 RegExp.escape ( `S` )

この関数は、正規表現 Pattern 内で特別な意味を持ち得る文字が等価なエスケープシーケンスに置換された S のコピーを返す。

呼び出し時に以下を行う:

S が String でなければ TypeError 例外を投げる。
escaped を空文字列とする。
cpList を StringToCodePoints(S) とする。
cpList の各コードポイント cp について
1. escaped が空文字列でかつ cp が DecimalDigit または AsciiLetter のいずれかにマッチするなら
  1. 注記: 先頭の数字をエスケープすることで、\0 や \1 などの DecimalEscape の後で文字列 S をマッチさせる際、前のエスケープシーケンス拡張と解釈されるのを防ぐ。先頭の ASCII 文字も \c の後の文脈で同様。
  2. numericValue を cp の数値とする。
  3. hex を Number::toString(𝔽(numericValue), 16) とする。
  4. アサート: hex の長さは 2。
  5. escaped を 0x005C (REVERSE SOLIDUS), "x", hex の連結に設定する。
2. そうでなければ
  1. escaped を escaped と EncodeForRegExpEscape(cp) の連結に設定する。
escaped を返す。

Note

名前が類似していても EscapeRegExpPattern と RegExp.escape は異なる働きをする。前者はパターンを文字列表現へエスケープし、後者は文字列をパターン内表現のためにエスケープする。

22.2.5.1.1 EncodeForRegExpEscape ( `cp` )

The abstract operation EncodeForRegExpEscape takes argument cp (コードポイント) and returns String. cp をマッチする Pattern 表現用 String を返す。cp が空白または ASCII 句読文字なら返値はエスケープシーケンス、それ以外は cp 自身の String 表現。 It performs the following steps when called:

cp が SyntaxCharacter にマッチするか、cp が U+002F (SOLIDUS) なら
1. 0x005C (REVERSE SOLIDUS) と UTF16EncodeCodePoint(cp) の連結を返す。
そうでなく cp が Table 65 の “Code Point” 列に列挙されるコードポイントなら
1. 0x005C (REVERSE SOLIDUS) と対応行の “ControlEscape” 列の文字列との連結を返す。
otherPunctuators を ",-=<>#&!%:;@~'`" とコードユニット 0x0022 (QUOTATION MARK) の連結とする。
toEscape を StringToCodePoints(otherPunctuators) とする。
toEscape が cp を含む、または cp が WhiteSpace もしくは LineTerminator にマッチする、または cp が先行サロゲートまたは後続サロゲートと同じ数値を持つなら
1. cpNum を cp の数値とする。
2. cpNum ≤ 0xFF なら
  1. hex を Number::toString(𝔽(cpNum), 16) とする。
  2. 0x005C (REVERSE SOLIDUS), "x", StringPad(hex, 2, "0", start) の連結を返す。
3. escaped を空文字列とする。
4. codeUnits を UTF16EncodeCodePoint(cp) とする。
5. 各コードユニット cu について
  1. escaped を escaped と UnicodeEscape(cu) の連結にする。
6. escaped を返す。
UTF16EncodeCodePoint(cp) を返す。

22.2.5.2 RegExp.prototype

RegExp.prototype の初期値は RegExp プロトタイプオブジェクトである。

このプロパティの属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: false } である。

22.2.5.3 get RegExp [ %Symbol.species% ]

RegExp[%Symbol.species%] は set アクセサが undefined のアクセサプロパティであり、get アクセサは呼び出し時以下を行う:

this 値を返す。

この関数の "name" プロパティ値は "get [Symbol.species]" である。

Note

RegExp プロトタイプメソッドは通常 this 値の constructor を用いて派生オブジェクトを生成する。サブクラスの constructor は %Symbol.species% プロパティを再定義することで既定挙動を上書きできる。

22.2.6 RegExp プロトタイプオブジェクトのプロパティ

RegExp プロトタイプオブジェクト:

%RegExp.prototype% である。
通常のオブジェクトである。
RegExp インスタンスではなく [[RegExpMatcher]] 内部スロットや他の RegExp インスタンス内部スロットを持たない。
[[Prototype]] 内部スロットの値は %Object.prototype% である。

Note

RegExp プロトタイプオブジェクトは自身の "valueOf" プロパティを持たないが、Object プロトタイプオブジェクトから継承する。

22.2.6.1 RegExp.prototype.constructor

RegExp.prototype.constructor の初期値は %RegExp% である。

22.2.6.2 RegExp.prototype.exec ( `string` )

このメソッドは string 内で正規表現パターンの出現を検索し、マッチ結果を含む Array を返す。マッチしなければ null を返す。

呼び出し時に以下を行う:

R を this 値とする。
? RequireInternalSlot(R, [[RegExpMatcher]]) を実行する。
S を ? ToString(string) とする。
? RegExpBuiltinExec(R, S) を返す。

22.2.6.3 get RegExp.prototype.dotAll

RegExp.prototype.dotAll は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値とする。
cu をコードユニット 0x0073 (LATIN SMALL LETTER S) とする。
? RegExpHasFlag(R, cu) を返す。

22.2.6.4 get RegExp.prototype.flags

RegExp.prototype.flags は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値とする。
R がオブジェクトでなければ TypeError 例外を投げる。
codeUnits を空 List とする。
hasIndices を ToBoolean(? Get(R, "hasIndices")) とする。
hasIndices が true ならコードユニット 0x0064 (LATIN SMALL LETTER D) を codeUnits に追加。
global を ToBoolean(? Get(R, "global")) とする。
global が true なら 0x0067 (LATIN SMALL LETTER G) を追加。
ignoreCase を ToBoolean(? Get(R, "ignoreCase")) とする。
ignoreCase が true なら 0x0069 (LATIN SMALL LETTER I) を追加。
multiline を ToBoolean(? Get(R, "multiline")) とする。
multiline が true なら 0x006D (LATIN SMALL LETTER M) を追加。
dotAll を ToBoolean(? Get(R, "dotAll")) とする。
dotAll が true なら 0x0073 (LATIN SMALL LETTER S) を追加。
unicode を ToBoolean(? Get(R, "unicode")) とする。
unicode が true なら 0x0075 (LATIN SMALL LETTER U) を追加。
unicodeSets を ToBoolean(? Get(R, "unicodeSets")) とする。
unicodeSets が true なら 0x0076 (LATIN SMALL LETTER V) を追加。
sticky を ToBoolean(? Get(R, "sticky")) とする。
sticky が true なら 0x0079 (LATIN SMALL LETTER Y) を追加。
codeUnits の要素をコードユニットとする String 値を返す。要素が無ければ空文字列を返す。

22.2.6.4.1 RegExpHasFlag ( `R`, `codeUnit` )

The abstract operation RegExpHasFlag takes arguments R (ECMAScript 言語値) and codeUnit (コードユニット) and returns Boolean または undefined を含む通常完了または throw 完了. It performs the following steps when called:

R がオブジェクトでなければ TypeError 例外。
R が [[OriginalFlags]] 内部スロットを持たないなら
1. SameValue(R, %RegExp.prototype%) が true なら undefined を返す。
2. そうでなければ TypeError 例外。
flags を R.[[OriginalFlags]] とする。
flags が codeUnit を含むなら true を返す。
false を返す。

22.2.6.5 get RegExp.prototype.global

RegExp.prototype.global は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0067 (LATIN SMALL LETTER G)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.6 get RegExp.prototype.hasIndices

RegExp.prototype.hasIndices は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0064 (LATIN SMALL LETTER D)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.7 get RegExp.prototype.ignoreCase

RegExp.prototype.ignoreCase は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0069 (LATIN SMALL LETTER I)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.8 RegExp.prototype [ %Symbol.match% ] ( `string` )

このメソッドは呼び出し時以下を行う:

rx を this 値とする。
rx がオブジェクトでなければ TypeError 例外。
S を ? ToString(string) とする。
flags を ? ToString(? Get(rx, "flags" )) とする。
flags が "g" を含まなければ
1. ? RegExpExec(rx, S) を返す。
そうでなければ
1. flags が "u" または "v" を含むなら fullUnicode を true、そうでなければ false とする。
2. ? Set(rx, "lastIndex", +0_𝔽, true) を実行する。
3. A を ! ArrayCreate(0) とする。
4. n を 0 とする。
5. 繰り返し、
  1. result を ? RegExpExec(rx, S) とする。
  2. result が null なら
    1. n = 0 なら null を返す。
    2. A を返す。
  3. そうでなければ
    1. matchStr を ? ToString(? Get(result, "0" )) とする。
    2. ! CreateDataPropertyOrThrow(A, ! ToString(𝔽(n)), matchStr) を実行。
    3. matchStr が空文字列なら
      1. thisIndex を ℝ(? ToLength(? Get(rx, "lastIndex"))) とする。
      2. nextIndex を AdvanceStringIndex(S, thisIndex, fullUnicode) とする。
      3. ? Set(rx, "lastIndex", 𝔽(nextIndex), true) を実行。
    4. n を n + 1 にする。

このメソッドの "name" プロパティ値は "[Symbol.match]" である。

Note

%Symbol.match% プロパティは IsRegExp 抽象操作が正規表現基本挙動を持つオブジェクトを識別するのに用いられる。%Symbol.match% が存在しないか、その値が真へ強制されない場合、そのオブジェクトは正規表現オブジェクトとして意図されない。

22.2.6.9 RegExp.prototype [ %Symbol.matchAll% ] ( `string` )

このメソッドは呼び出し時以下を行う:

R を this 値とする。
R がオブジェクトでなければ TypeError 例外。
S を ? ToString(string) とする。
C を ? SpeciesConstructor(R, %RegExp%) とする。
flags を ? ToString(? Get(R, "flags" )) とする。
matcher を ? Construct(C, « R, flags ») とする。
lastIndex を ? ToLength(? Get(R, "lastIndex" )) とする。
? Set(matcher, "lastIndex", lastIndex, true) を実行。
flags が "g" を含むなら global を true、そうでなければ false。
flags が "u" または "v" を含むなら fullUnicode を true、そうでなければ false。
CreateRegExpStringIterator(matcher, S, global, fullUnicode) を返す。

このメソッドの "name" プロパティ値は "[Symbol.matchAll]" である。

22.2.6.10 get RegExp.prototype.multiline

RegExp.prototype.multiline は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x006D (LATIN SMALL LETTER M)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.11 RegExp.prototype [ %Symbol.replace% ] ( `string`, `replaceValue` )

このメソッドは呼び出し時以下を行う:

rx を this 値とする。
rx がオブジェクトでなければ TypeError 例外。
S を ? ToString(string) とする。
lengthS を S の長さとする。
functionalReplace を IsCallable(replaceValue) とする。
functionalReplace が false なら
1. replaceValue を ? ToString(replaceValue) とする。
flags を ? ToString(? Get(rx, "flags" )) とする。
flags が "g" を含むなら global を true、そうでなければ false。
global が true なら
1. ? Set(rx, "lastIndex", +0_𝔽, true) を実行。
results を空 List とする。
done を false とする。
done が false の間繰り返し、
1. result を ? RegExpExec(rx, S) とする。
2. result が null なら
  1. done を true にする。
3. そうでなければ
  1. result を results に追加。
  2. global が false なら
    1. done を true にする。
  3. そうでなければ
    1. matchStr を ? ToString(? Get(result, "0" )) とする。
    2. matchStr が空文字列なら
      1. thisIndex を ℝ(? ToLength(? Get(rx, "lastIndex"))) とする。
      2. flags が "u" または "v" を含むなら fullUnicode を true、そうでなければ false。
      3. nextIndex を AdvanceStringIndex(S, thisIndex, fullUnicode) とする。
      4. ? Set(rx, "lastIndex", 𝔽(nextIndex), true) を実行。
accumulatedResult を空文字列。
nextSourcePosition を 0。
各 result ∈ results について
1. resultLength を ? LengthOfArrayLike(result)。
2. nCaptures を max(resultLength - 1, 0)。
3. matched を ? ToString(? Get(result, "0" ))。
4. matchLength を matched の長さ。
5. position を ? ToIntegerOrInfinity(? Get(result, "index" ))。
6. position を 0 と lengthS の間にクランプ。
7. captures を新しい空 List。
8. n を 1。
9. n ≤ nCaptures の間繰り返し、
  1. capN を ? Get(result, ! ToString(𝔽(n)))。
  2. capN が undefined でなければ
    1. capN を ? ToString(capN)。
  3. capN を captures に追加。
  4. 注記: n = 1 のとき最初のキャプチャが captures[0] に入る。一般に n 番目のキャプチャは captures[n - 1]。
  5. n を n + 1。
10. namedCaptures を ? Get(result, "groups")。
11. functionalReplace が true なら
  1. replacerArgs を « matched » と captures と « 𝔽(position), S » のリスト連結とする。
  2. namedCaptures が undefined でなければ
    1. replacerArgs に namedCaptures を追加。
  3. replacementValue を ? Call(replaceValue, undefined, replacerArgs)。
  4. replacementString を ? ToString(replacementValue)。
12. そうでなければ
  1. namedCaptures が undefined でなければ
    1. namedCaptures を ? ToObject(namedCaptures)。
  2. replacementString を ? GetSubstitution(matched, S, position, captures, namedCaptures, replaceValue)。
13. position ≥ nextSourcePosition なら
  1. 注記: position が後退するのは通常想定外であり、不正な RegExp サブクラス動作や副作用でフラグ等を変更した兆候である。その場合対応置換は無視される。
  2. accumulatedResult を accumulatedResult と S の nextSourcePosition から position まで、および replacementString の連結にする。
  3. nextSourcePosition を position + matchLength にする。
nextSourcePosition ≥ lengthS なら accumulatedResult を返す。
accumulatedResult と S の nextSourcePosition から末尾までの部分文字列の連結を返す。

このメソッドの "name" プロパティ値は "[Symbol.replace]" である。

22.2.6.12 RegExp.prototype [ %Symbol.search% ] ( `string` )

このメソッドは呼び出し時以下を行う:

rx を this 値。
rx がオブジェクトでなければ TypeError 例外。
S を ? ToString(string)。
previousLastIndex を ? Get(rx, "lastIndex" )。
previousLastIndex が +0_𝔽 でなければ
1. ? Set(rx, "lastIndex", +0_𝔽, true) を実行。
result を ? RegExpExec(rx, S)。
currentLastIndex を ? Get(rx, "lastIndex" )。
SameValue(currentLastIndex, previousLastIndex) が false なら
1. ? Set(rx, "lastIndex", previousLastIndex, true) を実行。
result が null なら -1_𝔽 を返す。
? Get(result, "index") を返す。

このメソッドの "name" プロパティ値は "[Symbol.search]" である。

Note

検索時この RegExp オブジェクトの "lastIndex" と "global" プロパティは無視される。"lastIndex" は変更されない。

22.2.6.13 get RegExp.prototype.source

RegExp.prototype.source は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
R がオブジェクトでなければ TypeError 例外。
R が [[OriginalSource]] 内部スロットを持たないなら
1. SameValue(R, %RegExp.prototype%) が true なら "(?:)" を返す。
2. そうでなければ TypeError 例外。
アサート: R は [[OriginalFlags]] 内部スロットを持つ。
src を R.[[OriginalSource]]。
flags を R.[[OriginalFlags]]。
EscapeRegExpPattern(src, flags) を返す。

22.2.6.13.1 EscapeRegExpPattern ( `P`, `F` )

The abstract operation EscapeRegExpPattern takes arguments P (String) and F (String) and returns String. It performs the following steps when called:

F が "v" を含むなら
1. patternSymbol を Pattern[+UnicodeMode, +UnicodeSetsMode] とする。
そうでなく F が "u" を含むなら
1. patternSymbol を Pattern[+UnicodeMode, ~UnicodeSetsMode] とする。
それ以外
1. patternSymbol を Pattern[~UnicodeMode, ~UnicodeSetsMode] とする。
S を、特定コードポイントが下記のようにエスケープされた、P (UTF-16 エンコードされた Unicode コードポイントと解釈) に等価な patternSymbol 形式の String とする。S は P と同一である場合と異なる場合があるが、S を patternSymbol として評価して得られる抽象クロージャは生成オブジェクトの [[RegExpMatcher]] 内部スロットの抽象クロージャと同一に振る舞わなければならない。同じ P, F での複数回呼び出しは同一結果を生成しなければならない。
パターンに現れる / または任意の LineTerminator コードポイントは、"/", S, "/", F の連結が (適切な字句文脈で) 同一に振る舞う RegularExpressionLiteral としてパース可能となるよう S 内で必要に応じエスケープされる。例: P が "/" の場合 S は "\/" や "\u002F" 等が許されるが "/" は不可 ( /// + F が SingleLineComment と解釈されるため )。 P が空文字列なら S を "(?:)" として要件を満たせる。
S を返す。

Note

名前が類似していても RegExp.escape と EscapeRegExpPattern は異なる。前者は文字列をパターン内部表現用にエスケープし、後者はパターンを文字列表現用にエスケープする。

22.2.6.14 RegExp.prototype [ %Symbol.split% ] ( `string`, `limit` )

Note 1

このメソッドは string を String に変換した結果の部分文字列を格納した配列を返す。部分文字列は this 値である正規表現のマッチを左から右に探索して決定され、マッチ位置自体は結果配列要素には含まれず文字列を区切る役割をする。

this 値は空の正規表現、または空文字列にマッチする正規表現であり得る。その場合、入力文字列の先頭・末尾、前の区切りマッチ末尾における空 substring にはマッチしない。（例: 正規表現が空文字列にマッチするなら文字列は各コードユニット要素に分割され、結果配列長は文字列長に等しく、各 substring は 1 コードユニット。）あるインデックスで考慮されるマッチは最初の一つのみで、バックトラッキングにより非空マッチが得られても再考しない。（例: /a*?/[Symbol.split]("ab") は ["a","b"]、/a*/[Symbol.split]("ab") は ["","b"]。）

string が（または変換後）空文字列の場合、正規表現が空文字列にマッチ可能かどうかで結果が異なる。マッチ可能なら結果配列は空、そうでなければ空文字列 1 要素を含む。

正規表現が捕捉括弧を含むとき、separator がマッチする毎にその結果（undefined を含む）が出力配列に挿入される。例:

/<(\/)?([^<>]+)>/[Symbol.split]("A<B>bold</B>and<CODE>coded</CODE>")

は配列

["A", undefined, "B", "bold", "/", "B", "and", undefined, "CODE", "coded", "/", "CODE", ""]

を生成する。

limit が undefined でなければ、出力配列は limit 要素を超えないよう切り詰められる。

このメソッドは呼び出し時以下を行う:

rx を this 値。
rx がオブジェクトでなければ TypeError 例外。
S を ? ToString(string)。
C を ? SpeciesConstructor(rx, %RegExp%)。
flags を ? ToString(? Get(rx, "flags" ))。
flags が "u" または "v" を含むなら unicodeMatching を true、そうでなければ false。
flags が "y" を含むなら newFlags を flags、そうでなければ newFlags を flags と "y" の連結とする。
splitter を ? Construct(C, « rx, newFlags »)。
A を ! ArrayCreate(0)。
lengthA を 0。
limit が undefined なら lim を 2³² - 1、そうでなければ ℝ(? ToUint32(limit))。
lim = 0 なら A を返す。
S が空文字列なら
1. z を ? RegExpExec(splitter, S)。
2. z が null でなければ A を返す。
3. ! CreateDataPropertyOrThrow(A, "0", S)。
4. A を返す。
size を S の長さ。
p を 0。
q を p。
q < size の間繰り返し、
1. ? Set(splitter, "lastIndex", 𝔽(q), true)。
2. z を ? RegExpExec(splitter, S)。
3. z が null なら
  1. q を AdvanceStringIndex(S, q, unicodeMatching) とする。
4. そうでなければ
  1. e を ℝ(? ToLength(? Get(splitter, "lastIndex")))。
  2. e を min(e, size)。
  3. e = p なら
    1. q を AdvanceStringIndex(S, q, unicodeMatching)。
  4. そうでなければ
    1. T を S の p から q の部分文字列。
    2. ! CreateDataPropertyOrThrow(A, ! ToString(𝔽(lengthA)), T)。
    3. lengthA を lengthA + 1。
    4. lengthA = lim なら A を返す。
    5. p を e。
    6. numberOfCaptures を ? LengthOfArrayLike(z)。
    7. numberOfCaptures を max(numberOfCaptures - 1, 0)。
    8. i を 1。
    9. i ≤ numberOfCaptures の間繰り返し、
      1. nextCapture を ? Get(z, ! ToString(𝔽(i)))。
      2. ! CreateDataPropertyOrThrow(A, ! ToString(𝔽(lengthA)), nextCapture)。
      3. i を i + 1。
      4. lengthA を lengthA + 1。
      5. lengthA = lim なら A を返す。
    10. q を p に設定。
T を S の p から size の部分文字列。
! CreateDataPropertyOrThrow(A, ! ToString(𝔽(lengthA)), T)。
A を返す。

このメソッドの "name" プロパティ値は "[Symbol.split]" である。

Note 2

このメソッドは RegExp オブジェクトの "global" および "sticky" プロパティ値を無視する。

22.2.6.15 get RegExp.prototype.sticky

RegExp.prototype.sticky は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0079 (LATIN SMALL LETTER Y)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.16 RegExp.prototype.test ( `S` )

このメソッドは呼び出し時以下を行う:

R を this 値。
R がオブジェクトでなければ TypeError 例外。
string を ? ToString(S)。
match を ? RegExpExec(R, string)。
match が null でなければ true、そうでなければ false を返す。

22.2.6.17 RegExp.prototype.toString ( )

R を this 値。
R がオブジェクトでなければ TypeError 例外。
pattern を ? ToString(? Get(R, "source" ))。
flags を ? ToString(? Get(R, "flags" ))。
result を "/", pattern, "/", flags の連結とする。
result を返す。

Note

返される String は RegularExpressionLiteral の形式であり、同じ挙動の別 RegExp オブジェクトを評価する。

22.2.6.18 get RegExp.prototype.unicode

RegExp.prototype.unicode は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0075 (LATIN SMALL LETTER U)。
? RegExpHasFlag(R, cu) を返す。

22.2.6.19 get RegExp.prototype.unicodeSets

RegExp.prototype.unicodeSets は set アクセサが undefined のアクセサプロパティであり、get アクセサは以下を行う:

R を this 値。
cu を 0x0076 (LATIN SMALL LETTER V)。
? RegExpHasFlag(R, cu) を返す。

22.2.7 RegExp マッチングのための抽象操作

22.2.7.1 RegExpExec ( `R`, `S` )

The abstract operation RegExpExec takes arguments R (オブジェクト) and S (String) and returns オブジェクトまたは null を含む通常完了、または throw 完了. It performs the following steps when called:

exec を ? Get(R, "exec") とする。
IsCallable(exec) が true なら
1. result を ? Call(exec, R, « S ») とする。
2. result がオブジェクトでも null でもなければ TypeError 例外。
3. result を返す。
? RequireInternalSlot(R, [[RegExpMatcher]]) を実行。
? RegExpBuiltinExec(R, S) を返す。

Note

呼び出し可能な "exec" プロパティが見つからない場合、このアルゴリズムは組み込み正規表現マッチングアルゴリズムにフォールバックする。これは過去版でほとんどの組み込みアルゴリズムが "exec" の動的プロパティ参照を行わなかったコードとの互換性を提供する。

22.2.7.2 RegExpBuiltinExec ( `R`, `S` )

The abstract operation RegExpBuiltinExec takes arguments R (初期化済み RegExp インスタンス) and S (String) and returns Array エキゾチックオブジェクトまたは null を含む通常完了、または throw 完了. It performs the following steps when called:

length を S の長さ。
lastIndex を ℝ(? ToLength(! Get(R, "lastIndex" )))。
flags を R.[[OriginalFlags]]。
flags が "g" を含むなら global を true、そうでなければ false。
flags が "y" を含むなら sticky を true、そうでなければ false。
flags が "d" を含むなら hasIndices を true、そうでなければ false。
global が false かつ sticky が false なら lastIndex を 0 に設定。
matcher を R.[[RegExpMatcher]]。
flags が "u" または "v" を含むなら fullUnicode を true、そうでなければ false。
matchSucceeded を false。
fullUnicode が true なら input を StringToCodePoints(S)、そうでなければ input を S のコードユニット列 List とする。
注記: input の各要素は文字と見なす。
matchSucceeded が false の間繰り返し、
1. lastIndex > length なら
  1. global または sticky が true なら
    1. ? Set(R, "lastIndex", +0_𝔽, true) を実行。
  2. null を返す。
2. inputIndex を S の lastIndex 番目要素から得た文字の input 内インデックスとする。
3. r を matcher(input, inputIndex) とする。
4. r が failure なら
  1. sticky が true なら
    1. ? Set(R, "lastIndex", +0_𝔽, true)。
    2. null を返す。
  2. lastIndex を AdvanceStringIndex(S, lastIndex, fullUnicode) に設定。
5. そうでなければ
  1. アサート: r は MatchState。
  2. matchSucceeded を true にする。
e を r.[[EndIndex]]。
fullUnicode が true なら e を GetStringIndex(S, e) に設定。
global または sticky が true なら
1. ? Set(R, "lastIndex", 𝔽(e), true)。
n を r.[[Captures]] の要素数。
アサート: n = R.[[RegExpRecord]].[[CapturingGroupsCount]]。
アサート: n < 2³² - 1。
A を ! ArrayCreate(n + 1)。
アサート: A."length" の数学的値は n + 1。
! CreateDataPropertyOrThrow(A, "index", 𝔽(lastIndex))。
! CreateDataPropertyOrThrow(A, "input", S)。
match を Match Record { [[StartIndex]]: lastIndex, [[EndIndex]]: e }。
indices を空 List。
groupNames を空 List。
indices に match を追加。
matchedSubstr を GetMatchString(S, match)。
! CreateDataPropertyOrThrow(A, "0", matchedSubstr)。
R が GroupName を含むなら
1. groups を OrdinaryObjectCreate(null)。
2. hasGroups を true。
そうでなければ
1. groups を undefined。
2. hasGroups を false。
! CreateDataPropertyOrThrow(A, "groups", groups)。
matchedGroupNames を空 List。
1 ≤ i ≤ n を昇順で各 i について
1. captureI を r.[[Captures]] の i 番目要素。
2. captureI が undefined なら
  1. capturedValue を undefined。
  2. indices に undefined を追加。
3. そうでなければ
  1. captureStart を captureI.[[StartIndex]]。
  2. captureEnd を captureI.[[EndIndex]]。
  3. fullUnicode が true なら
    1. captureStart を GetStringIndex(S, captureStart)。
    2. captureEnd を GetStringIndex(S, captureEnd)。
  4. capture を Match Record { [[StartIndex]]: captureStart, [[EndIndex]]: captureEnd }。
  5. capturedValue を GetMatchString(S, capture)。
  6. indices に capture を追加。
4. ! CreateDataPropertyOrThrow(A, ! ToString(𝔽(i)), capturedValue)。
5. i 番目のキャプチャが GroupName で定義されているなら
  1. s をその GroupName の CapturingGroupName とする。
  2. matchedGroupNames が s を含むなら
    1. アサート: capturedValue は undefined。
    2. groupNames に undefined を追加。
  3. そうでなければ
    1. capturedValue が undefined でなければ s を matchedGroupNames に追加。
    2. 注記: 同名グループが複数ある場合 groups に既に s プロパティが存在することがあるが、すべて可書きデータプロパティなので CreateDataPropertyOrThrow は成功する。
    3. ! CreateDataPropertyOrThrow(groups, s, capturedValue)。
    4. groupNames に s を追加。
6. そうでなければ
  1. groupNames に undefined を追加。
hasIndices が true なら
1. indicesArray を MakeMatchIndicesIndexPairArray(S, indices, groupNames, hasGroups) とする。
2. ! CreateDataPropertyOrThrow(A, "indices", indicesArray)。
A を返す。

22.2.7.3 AdvanceStringIndex ( `S`, `index`, `unicode` )

The abstract operation AdvanceStringIndex takes arguments S (String), index (非負整数), and unicode (Boolean) and returns 整数. It performs the following steps when called:

アサート: index ≤ 2⁵³ - 1。
unicode が false なら index + 1 を返す。
length を S の長さ。
index + 1 ≥ length なら index + 1 を返す。
cp を CodePointAt(S, index)。
index + cp.[[CodeUnitCount]] を返す。

22.2.7.4 GetStringIndex ( `S`, `codePointIndex` )

The abstract operation GetStringIndex takes arguments S (String) and codePointIndex (非負整数) and returns 非負整数. S を UTF-16 エンコードされたコードポイント列として解釈し (6.1.4)、codePointIndex に対応するコードユニットインデックスが存在すればそれを返し、存在しなければ S の長さを返す。 It performs the following steps when called:

S が空文字列なら 0 を返す。
len を S の長さ。
codeUnitCount を 0。
codePointCount を 0。
codeUnitCount < len の間繰り返し、
1. codePointCount = codePointIndex なら codeUnitCount を返す。
2. cp を CodePointAt(S, codeUnitCount)。
3. codeUnitCount を codeUnitCount + cp.[[CodeUnitCount]] に。
4. codePointCount を codePointCount + 1 に。
len を返す。

22.2.7.5 Match レコード

Match Record は正規表現のマッチまたはキャプチャの開始・終了インデックスを保持する Record 値である。

Match Record は Table 70 に列挙するフィールドを持つ。

Table 70: Match Record Fields

Field Name	Value	Meaning
`[[StartIndex]]`	非負整数	マッチ開始位置 (含む) までのコードユニット数。
`[[EndIndex]]`	`[[StartIndex]]` 以上の整数	マッチ終了位置 (含まない) までのコードユニット数。

22.2.7.6 GetMatchString ( `S`, `match` )

The abstract operation GetMatchString takes arguments S (String) and match (Match Record) and returns String. It performs the following steps when called:

アサート: match.[[StartIndex]] ≤ match.[[EndIndex]] ≤ S の長さ。
S の match.[[StartIndex]] から match.[[EndIndex]] までの部分文字列を返す。

22.2.7.7 GetMatchIndexPair ( `S`, `match` )

The abstract operation GetMatchIndexPair takes arguments S (String) and match (Match Record) and returns Array. It performs the following steps when called:

アサート: match.[[StartIndex]] ≤ match.[[EndIndex]] ≤ S の長さ。
CreateArrayFromList(« 𝔽(match.[[StartIndex]]), 𝔽(match.[[EndIndex]]) ») を返す。

22.2.7.8 MakeMatchIndicesIndexPairArray ( `S`, `indices`, `groupNames`, `hasGroups` )

The abstract operation MakeMatchIndicesIndexPairArray takes arguments S (String), indices (Match Record または undefined の List), groupNames (String または undefined の List), and hasGroups (Boolean) and returns Array. It performs the following steps when called:

n を indices の要素数。
アサート: n < 2³² - 1。
アサート: groupNames は n - 1 要素を持つ。
注記: groupNames の要素は indices[1] から整列。
A を ! ArrayCreate(n)。
hasGroups が true なら
1. groups を OrdinaryObjectCreate(null)。
そうでなければ
1. groups を undefined。
! CreateDataPropertyOrThrow(A, "groups", groups)。
0 ≤ i < n を昇順で各 i について
1. matchIndices を indices[i]。
2. matchIndices が undefined でなければ
  1. matchIndexPair を GetMatchIndexPair(S, matchIndices)。
3. そうでなければ
  1. matchIndexPair を undefined。
4. ! CreateDataPropertyOrThrow(A, ! ToString(𝔽(i)), matchIndexPair)。
5. i > 0 なら
  1. s を groupNames[i - 1]。
  2. s が undefined でなければ
    1. アサート: groups は undefined でない。
    2. 注記: 同名グループが複数ある場合でも groups は通常オブジェクトなので再作成は成功する。
    3. ! CreateDataPropertyOrThrow(groups, s, matchIndexPair)。
A を返す。

22.2.8 RegExp インスタンスのプロパティ

RegExp インスタンスは RegExp プロトタイプオブジェクトからプロパティを継承する通常オブジェクトである。RegExp インスタンスは内部スロット [[OriginalSource]], [[OriginalFlags]], [[RegExpRecord]], [[RegExpMatcher]] を持つ。[[RegExpMatcher]] 内部スロットの値は RegExp オブジェクトの Pattern の抽象クロージャ表現である。

Note

ECMAScript 2015 以前は RegExp インスタンスは独自データプロパティ "source", "global", "ignoreCase", "multiline" を持つと規定されていた。これらは現在 RegExp.prototype のアクセサプロパティとして規定される。

RegExp インスタンスは次のプロパティも持つ:

22.2.8.1 lastIndex

"lastIndex" プロパティの値は次のマッチを開始する String インデックスを指定する。使用時に整数 Number へ強制される (22.2.7.2 参照)。属性は { [[Writable]]: true, [[Enumerable]]: false, [[Configurable]]: false } とする。

22.2.9 RegExp 文字列イテレータオブジェクト

RegExp String Iterator は、特定の RegExp インスタンスオブジェクトに対して、特定の String インスタンスオブジェクト上の反復処理を表すオブジェクトである。RegExp String Iterator オブジェクトに対する名前付きコンストラクターは存在しない。代わりに、RegExp インスタンスオブジェクトの特定メソッド呼び出しによって生成される。

22.2.9.1 CreateRegExpStringIterator ( `R`, `S`, `global`, `fullUnicode` )

The abstract operation CreateRegExpStringIterator takes arguments R (オブジェクト), S (String), global (Boolean), and fullUnicode (Boolean) and returns オブジェクト. It performs the following steps when called:

iterator を OrdinaryObjectCreate(%RegExpStringIteratorPrototype%, « [[IteratingRegExp]], [[IteratedString]], [[Global]], [[Unicode]], [[Done]] ») とする。
iterator.[[IteratingRegExp]] に R を設定する。
iterator.[[IteratedString]] に S を設定する。
iterator.[[Global]] に global を設定する。
iterator.[[Unicode]] に fullUnicode を設定する。
iterator.[[Done]] に false を設定する。
iterator を返す。

22.2.9.2 %RegExpStringIteratorPrototype% オブジェクト

%RegExpStringIteratorPrototype% オブジェクト:

全ての RegExp String Iterator オブジェクトに継承されるプロパティを持つ。
通常のオブジェクトである。
[[Prototype]] 内部スロットの値は %Iterator.prototype% である。
以下のプロパティを持つ:

22.2.9.2.1 %RegExpStringIteratorPrototype%.next ( )

O を this 値とする。
O がオブジェクトでなければ TypeError 例外を投げる。
O が RegExp String Iterator Object Instance の全内部スロット（22.2.9.3 参照）を持たなければ TypeError 例外を投げる。
O.[[Done]] が true なら
1. CreateIteratorResultObject(undefined, true) を返す。
R を O.[[IteratingRegExp]] とする。
S を O.[[IteratedString]] とする。
global を O.[[Global]] とする。
fullUnicode を O.[[Unicode]] とする。
match を ? RegExpExec(R, S) とする。
match が null なら
1. O.[[Done]] に true を設定する。
2. CreateIteratorResultObject(undefined, true) を返す。
global が false なら
1. O.[[Done]] に true を設定する。
2. CreateIteratorResultObject(match, false) を返す。
matchStr を ? ToString(? Get(match, "0" )) とする。
matchStr が空文字列なら
1. thisIndex を ℝ(? ToLength(? Get(R, "lastIndex"))) とする。
2. nextIndex を AdvanceStringIndex(S, thisIndex, fullUnicode) とする。
3. ? Set(R, "lastIndex", 𝔽(nextIndex), true) を実行する。
CreateIteratorResultObject(match, false) を返す。

22.2.9.2.2 %RegExpStringIteratorPrototype% [ %Symbol.toStringTag% ]

%Symbol.toStringTag% プロパティの初期値は文字列 "RegExp String Iterator" である。

このプロパティの属性は { [[Writable]]: false, [[Enumerable]]: false, [[Configurable]]: true } である。

22.2.9.3 RegExp String Iterator インスタンスのプロパティ

RegExp String Iterator インスタンスは %RegExpStringIteratorPrototype% 組込みオブジェクトからプロパティを継承する通常オブジェクトである。RegExp String Iterator インスタンスは初期化時に Table 71 に列挙された内部スロットを持つ。

Table 71: RegExp String Iterator インスタンスの内部スロット

Internal Slot	Type	Description
`[[IteratingRegExp]]`	an Object	反復に使用される正規表現。IsRegExp(`[[IteratingRegExp]]`) は初期状態で true。
`[[IteratedString]]`	a String	反復対象となる String 値。
`[[Global]]`	a Boolean	`[[IteratingRegExp]]` が global かどうかを示す。
`[[Unicode]]`	a Boolean	`[[IteratingRegExp]]` が Unicode モードかどうかを示す。
`[[Done]]`	a Boolean	反復処理が完了しているかどうかを示す。

22 テキスト処理

22.1 String オブジェクト

22.1.1 String コンストラクター

22.1.1.1 String ( value )

22.1.2 String コンストラクターのプロパティ

22.1.2.1 String.fromCharCode ( ...codeUnits )

22.1.2.2 String.fromCodePoint ( ...codePoints )

22.1.2.3 String.prototype

22.1.2.4 String.raw ( template, ...substitutions )

22.1.3 String プロトタイプオブジェクトのプロパティ

22.1.3.1 String.prototype.at ( index )

22.1.3.2 String.prototype.charAt ( pos )

22.1.3.3 String.prototype.charCodeAt ( pos )

22.1.3.4 String.prototype.codePointAt ( pos )

22.1.3.5 String.prototype.concat ( ...args )

22.1.3.6 String.prototype.constructor

22.1.3.7 String.prototype.endsWith ( searchString [ , endPosition ] )

22.1.3.8 String.prototype.includes ( searchString [ , position ] )

22.1.3.9 String.prototype.indexOf ( searchString [ , position ] )

22.1.3.10 String.prototype.isWellFormed ( )

22.1.3.11 String.prototype.lastIndexOf ( searchString [ , position ] )

22.1.3.12 String.prototype.localeCompare ( that [ , reserved1 [ , reserved2 ] ] )

22.1.3.13 String.prototype.match ( regexp )

22.1.3.14 String.prototype.matchAll ( regexp )

22.1.3.15 String.prototype.normalize ( [ form ] )

22.1.3.16 String.prototype.padEnd ( maxLength [ , fillString ] )

22.1.3.17 String.prototype.padStart ( maxLength [ , fillString ] )

22.1.3.17.1 StringPaddingBuiltinsImpl ( O, maxLength, fillString, placement )

22.1.3.17.2 StringPad ( S, maxLength, fillString, placement )

22.1.3.17.3 ToZeroPaddedDecimalString ( n, minLength )

22.1.3.18 String.prototype.repeat ( count )

22.1.3.19 String.prototype.replace ( searchValue, replaceValue )

22.1.3.19.1 GetSubstitution ( matched, str, position, captures, namedCaptures, replacementTemplate )

22.1.3.20 String.prototype.replaceAll ( searchValue, replaceValue )

22.1.3.21 String.prototype.search ( regexp )

22.1.3.22 String.prototype.slice ( start, end )

22.1.3.23 String.prototype.split ( separator, limit )

22.1.3.24 String.prototype.startsWith ( searchString [ , position ] )

22.1.3.25 String.prototype.substring ( start, end )

22.1.3.26 String.prototype.toLocaleLowerCase ( [ reserved1 [ , reserved2 ] ] )

22.1.3.27 String.prototype.toLocaleUpperCase ( [ reserved1 [ , reserved2 ] ] )

22.1.3.28 String.prototype.toLowerCase ( )

22.1.3.29 String.prototype.toString ( )

22.1.3.30 String.prototype.toUpperCase ( )

22.1.3.31 String.prototype.toWellFormed ( )

22.1.3.32 String.prototype.trim ( )

22.1.3.32.1 TrimString ( string, where )

22.1.3.33 String.prototype.trimEnd ( )

22.1.3.34 String.prototype.trimStart ( )

22.1.3.35 String.prototype.valueOf ( )

22.1.3.35.1 ThisStringValue ( value )

22.1.3.36 String.prototype [ %Symbol.iterator% ] ( )

22.1.4 String インスタンスのプロパティ

22.1.4.1 length

22.1.5 String 反復子オブジェクト

22.1.5.1 %StringIteratorPrototype% オブジェクト

22.1.5.1.1 %StringIteratorPrototype%.next ( )

22.1.5.1.2 %StringIteratorPrototype% [ %Symbol.toStringTag% ]

22.2 RegExp (正規表現) オブジェクト

22.2.1 パターン

構文

22.2.1.1 静的セマンティクス: 早期エラー

22.2.1.2 静的セマンティクス: CountLeftCapturingParensWithin ( node: a Parse Node, ): 非負整数

22.2.1.3 静的セマンティクス: CountLeftCapturingParensBefore ( node: a Parse Node, ): 非負整数

22.2.1.4 静的セマンティクス: MightBothParticipate ( x: a Parse Node, y: a Parse Node, ): Boolean

22.2.1.5 静的セマンティクス: CapturingGroupNumber : 正の整数

22.2.1.6 静的セマンティクス: IsCharacterClass : Boolean

22.2.1.7 静的セマンティクス: CharacterValue : 非負整数

22.2.1.8 静的セマンティクス: MayContainStrings : Boolean

22.2.1.9 静的セマンティクス: GroupSpecifiersThatMatch ( thisGroupName: a GroupName Parse Node, ): GroupSpecifier 構文ノードのリスト

22.2.1.10 静的セマンティクス: CapturingGroupName : String

22.2.1.11 静的セマンティクス: RegExpIdentifierCodePoints : コードポイントのリスト

22.2.1.12 静的セマンティクス: RegExpIdentifierCodePoint : コードポイント

22.2.2 パターンのセマンティクス

22.2.2.1 表記

22.2.2.1.1 RegExp レコード

22.2.2.2 実行時セマンティクス: CompilePattern : 文字の List と非負整数を取り MatchState か failure を返す抽象クロージャ

22.2.2.3 実行時セマンティクス: CompileSubpattern : Matcher

22.2.2.3.1 RepeatMatcher ( m, min, max, greedy, x, c, parenIndex, parenCount )

22.2.2.3.2 EmptyMatcher ( )

22.1.1.1 String ( `value` )

22.1.2.1 String.fromCharCode ( ...`codeUnits` )

22.1.2.2 String.fromCodePoint ( ...`codePoints` )

22.1.2.4 String.raw ( `template`, ...`substitutions` )

22.1.3.1 String.prototype.at ( `index` )

22.1.3.2 String.prototype.charAt ( `pos` )

22.1.3.3 String.prototype.charCodeAt ( `pos` )

22.1.3.4 String.prototype.codePointAt ( `pos` )

22.1.3.5 String.prototype.concat ( ...`args` )

22.1.3.7 String.prototype.endsWith ( `searchString` [ , `endPosition` ] )

22.1.3.8 String.prototype.includes ( `searchString` [ , `position` ] )

22.1.3.9 String.prototype.indexOf ( `searchString` [ , `position` ] )

22.1.3.11 String.prototype.lastIndexOf ( `searchString` [ , `position` ] )

22.1.3.12 String.prototype.localeCompare ( `that` [ , `reserved1` [ , `reserved2` ] ] )

22.1.3.13 String.prototype.match ( `regexp` )

22.1.3.14 String.prototype.matchAll ( `regexp` )

22.1.3.15 String.prototype.normalize ( [ `form` ] )

22.1.3.16 String.prototype.padEnd ( `maxLength` [ , `fillString` ] )

22.1.3.17 String.prototype.padStart ( `maxLength` [ , `fillString` ] )

22.1.3.17.1 StringPaddingBuiltinsImpl ( `O`, `maxLength`, `fillString`, `placement` )

22.1.3.17.2 StringPad ( `S`, `maxLength`, `fillString`, `placement` )

22.1.3.17.3 ToZeroPaddedDecimalString ( `n`, `minLength` )

22.1.3.18 String.prototype.repeat ( `count` )

22.1.3.19 String.prototype.replace ( `searchValue`, `replaceValue` )

22.1.3.19.1 GetSubstitution ( `matched`, `str`, `position`, `captures`, `namedCaptures`, `replacementTemplate` )

22.1.3.20 String.prototype.replaceAll ( `searchValue`, `replaceValue` )

22.1.3.21 String.prototype.search ( `regexp` )

22.1.3.22 String.prototype.slice ( `start`, `end` )

22.1.3.23 String.prototype.split ( `separator`, `limit` )

22.1.3.24 String.prototype.startsWith ( `searchString` [ , `position` ] )

22.1.3.25 String.prototype.substring ( `start`, `end` )

22.1.3.26 String.prototype.toLocaleLowerCase ( [ `reserved1` [ , `reserved2` ] ] )

22.1.3.27 String.prototype.toLocaleUpperCase ( [ `reserved1` [ , `reserved2` ] ] )

22.1.3.32.1 TrimString ( `string`, `where` )

22.1.3.35.1 ThisStringValue ( `value` )

22.2.1.2 静的セマンティクス: CountLeftCapturingParensWithin ( `node`: a Parse Node, ): 非負整数

22.2.1.3 静的セマンティクス: CountLeftCapturingParensBefore ( `node`: a Parse Node, ): 非負整数

22.2.1.4 静的セマンティクス: MightBothParticipate ( `x`: a Parse Node, `y`: a Parse Node, ): Boolean

22.2.1.9 静的セマンティクス: GroupSpecifiersThatMatch ( `thisGroupName`: a GroupName Parse Node, ): GroupSpecifier 構文ノードのリスト

22.2.2.3.1 RepeatMatcher ( `m`, `min`, `max`, `greedy`, `x`, `c`, `parenIndex`, `parenCount` )

22.2.2.3.3 MatchTwoAlternatives ( `m1`, `m2` )

22.2.2.3.4 MatchSequence ( `m1`, `m2`, `direction` )

22.2.2.4.1 IsWordChar ( `rer`, `Input`, `e` )

22.2.2.7.1 CharacterSetMatcher ( `rer`, `A`, `invert`, `direction` )

22.2.2.7.2 BackreferenceMatcher ( `rer`, `ns`, `direction` )

22.2.2.7.3 Canonicalize ( `rer`, `ch` )

22.2.2.7.4 UpdateModifiers ( `rer`, `add`, `remove` )

22.2.2.9.1 CharacterRange ( `A`, `B` )

22.2.2.9.2 HasEitherUnicodeFlag ( `rer` )

22.2.2.9.3 WordCharacters ( `rer` )

22.2.2.9.4 AllCharacters ( `rer` )

22.2.2.9.5 MaybeSimpleCaseFolding ( `rer`, `A` )

22.2.2.9.6 CharacterComplement ( `rer`, `S` )

22.2.2.9.7 UnicodeMatchProperty ( `rer`, `p` )

22.2.2.9.8 UnicodeMatchPropertyValue ( `p`, `v` )

22.2.3.1 RegExpCreate ( `P`, `F` )

22.2.3.2 RegExpAlloc ( `newTarget` )

22.2.3.3 RegExpInitialize ( `obj`, `pattern`, `flags` )

22.2.3.4 静的セマンティクス: ParsePattern ( `patternText`: Unicode コードポイント列, `u`: Boolean, `v`: Boolean, ): パースノードまたは空でない SyntaxError オブジェクト List

22.2.4.1 RegExp ( `pattern`, `flags` )

22.2.5.1 RegExp.escape ( `S` )

22.2.5.1.1 EncodeForRegExpEscape ( `cp` )

22.2.6.2 RegExp.prototype.exec ( `string` )

22.2.6.4.1 RegExpHasFlag ( `R`, `codeUnit` )

22.2.6.8 RegExp.prototype [ %Symbol.match% ] ( `string` )

22.2.6.9 RegExp.prototype [ %Symbol.matchAll% ] ( `string` )

22.2.6.11 RegExp.prototype [ %Symbol.replace% ] ( `string`, `replaceValue` )

22.2.6.12 RegExp.prototype [ %Symbol.search% ] ( `string` )

22.2.6.13.1 EscapeRegExpPattern ( `P`, `F` )

22.2.6.14 RegExp.prototype [ %Symbol.split% ] ( `string`, `limit` )

22.2.6.16 RegExp.prototype.test ( `S` )

22.2.7.1 RegExpExec ( `R`, `S` )

22.2.7.2 RegExpBuiltinExec ( `R`, `S` )

22.2.7.3 AdvanceStringIndex ( `S`, `index`, `unicode` )

22.2.7.4 GetStringIndex ( `S`, `codePointIndex` )

22.2.7.6 GetMatchString ( `S`, `match` )

22.2.7.7 GetMatchIndexPair ( `S`, `match` )

22.2.7.8 MakeMatchIndicesIndexPairArray ( `S`, `indices`, `groupNames`, `hasGroups` )

22.2.9.1 CreateRegExpStringIterator ( `R`, `S`, `global`, `fullUnicode` )