12 ECMAScript 言語: 字句文法 (ECMAScript Language: Lexical Grammar)

InputElementRegExpOrTemplateTail

InputElementHashbangOrRegExp

TemplateSubstitutionTail

InputElementTemplateTail

TemplateSubstitutionTail

12.1 Unicode 形式制御文字 (Unicode Format-Control Characters)

Unicode 形式制御文字（すなわち Unicode 文字データベースにおける “Cf” カテゴリ、例えば LEFT-TO-RIGHT MARK や RIGHT-TO-LEFT MARK）は、上位プロトコル（マークアップ言語など）が存在しない場合にテキスト範囲の整形を制御するために使用される制御コードである。

編集および表示を容易にするため、ソーステキスト内で形式制御文字を許可することは有用である。すべての形式制御文字はコメント内、ならびに文字列リテラル、テンプレートリテラル、正規表現リテラル内で使用できる。

U+FEFF (ZERO WIDTH NO-BREAK SPACE) は主としてテキストの冒頭で Unicode であることを示し、テキストのエンコーディングとバイト順を検出するために用いられる形式制御文字である。この目的で用いられる <ZWNBSP> 文字は、ファイル連結の結果などとしてテキスト開始後にも現れることがある。ECMAScript ソーステキストでは、コメント、文字列リテラル、テンプレートリテラル、正規表現リテラルの外側で <ZWNBSP> 符号位置は空白文字として扱われる（12.2 参照）。

12.2 空白 (White Space)

空白符号位置はソーステキストの可読性を高め、トークン（分割不可能な字句単位）を相互に分離するために使用されるが、それ以外の点では意味を持たない。空白符号位置は任意の 2 つのトークンの間および入力の開始・末尾に現れうる。空白符号位置は StringLiteral, RegularExpressionLiteral, Template, TemplateSubstitutionTail の内部に現れ、その場合リテラル値を構成する有意味な符号位置として扱われる。また Comment 内に現れうるが、他の種類のトークン内部には現れない。

ECMAScript の空白符号位置は Table 33 に列挙される。

Table 33: White Space Code Points

Code Points	Name	Abbreviation
`U+0009`	CHARACTER TABULATION	<TAB>
`U+000B`	LINE TABULATION	<VT>
`U+000C`	FORM FEED (FF)	<FF>
`U+FEFF`	ZERO WIDTH NO-BREAK SPACE	<ZWNBSP>
一般カテゴリ “Space_Separator” のいかなる符号位置		<USP>

Note 1

U+0020 (SPACE) と U+00A0 (NO-BREAK SPACE) は <USP> の一部である。

Note 2

Table 33 に列挙される符号位置を除き、ECMAScript WhiteSpace は Unicode “White_Space” プロパティを持つが一般カテゴリ “Space_Separator” (“Zs”) に分類されないすべての符号位置を意図的に除外する。

構文 (Syntax)

WhiteSpace

<TAB>

<VT>

<FF>

<USP>

12.3 行終端子 (Line Terminators)

空白符号位置と同様に、行終端子符号位置はソーステキストの可読性を高め、トークンを相互に分離するために使用される。しかし空白符号位置と異なり、行終端子は構文文法の振る舞いに影響を及ぼす。一般に、行終端子は任意の 2 つのトークン間に現れうるが、構文文法により禁止される箇所がいくつか存在する。行終端子は自動セミコロン挿入（12.10）の過程にも影響する。行終端子は StringLiteral, Template, TemplateSubstitutionTail 以外のいかなるトークン内部にも現れない。<LF> および <CR> 行終端子は LineContinuation の一部である場合を除き StringLiteral トークン内部に現れない。

行終端子は MultiLineComment 内には現れることができるが、SingleLineComment 内には現れない。

行終端子は正規表現において \s クラスによりマッチされる空白符号位置集合に含まれる。

ECMAScript の行終端子符号位置は Table 34 に列挙される。

Table 34: Line Terminator Code Points

Code Point	Unicode Name	Abbreviation
`U+000A`	LINE FEED (LF)	<LF>
`U+000D`	CARRIAGE RETURN (CR)	<CR>
`U+2028`	LINE SEPARATOR	<LS>
`U+2029`	PARAGRAPH SEPARATOR	<PS>

Table 34 に示す Unicode 符号位置のみが行終端子として扱われる。その他の改行または行分割の Unicode 符号位置は行終端子とは扱われないが、Table 33 に記述された要件を満たす場合は空白として扱われる。シーケンス <CR><LF> は行終端子として一般的に使用される。行番号の報告目的においては一つの SourceCharacter と見なされるべきである。

構文 (Syntax)

LineTerminator

<LF>

<CR>

<LS>

<PS>

LineTerminatorSequence

<LF>

<CR>

[lookahead ≠ <LF>]

<LS>

<PS>

<CR>

<LF>

12.4 コメント (Comments)

コメントは単一行または複数行のいずれかである。複数行コメントは入れ子にできない。

単一行コメントは LineTerminator 符号位置以外の任意の Unicode 符号位置を含むことができ、また一般規則としてトークンは常に可能な限り長くなるため、単一行コメントは // マーカーから行末までのすべての符号位置で構成される。ただし行末の LineTerminator は単一行コメントの一部とは見なされず、字句文法により別途認識され構文文法用の入力要素列の一部となる。これは重要な点であり、単一行コメントの有無が自動セミコロン挿入（12.10 参照）に影響しないことを意味する。

コメントは空白のように振る舞い破棄されるが、MultiLineComment に行終端子符号位置が含まれる場合、構文文法によるパース目的ではそのコメント全体が一つの LineTerminator と見なされる。

構文 (Syntax)

opt

MultiLineNotAsteriskChar

MultiLineNotForwardSlashOrAsteriskChar

opt

PostAsteriskCommentChars

opt

PostAsteriskCommentChars

opt

PostAsteriskCommentChars

opt

MultiLineNotAsteriskChar

MultiLineNotForwardSlashOrAsteriskChar

but not *

but not one of / or *

SingleLineComment

opt

SingleLineCommentChar

opt

SingleLineCommentChar

but not LineTerminator

この節のいくつかの生成規則は B.1.1 節で代替定義が与えられる。

12.5 ハッシュバンコメント (Hashbang Comments)

ハッシュバンコメントは位置依存であり、他の種類のコメントと同様に構文文法への入力要素列からは破棄される。

構文 (Syntax)

HashbangComment

opt

12.6 トークン (Tokens)

構文 (Syntax)

Note

DivPunctuator, RegularExpressionLiteral, RightBracePunctuator, TemplateSubstitutionTail の生成規則は CommonToken 生成規則に含まれない追加トークンを導出する。

12.7 名前とキーワード (Names and Keywords)

IdentifierName および ReservedWord は Unicode Standard Annex #31「Identifier and Pattern Syntax」で与えられる既定の識別子構文（わずかな修正付き）に従って解釈されるトークンである。ReservedWord は IdentifierName の列挙された部分集合である。構文文法は Identifier を ReservedWord ではない IdentifierName として定義する。Unicode 識別子文法は Unicode Standard が定義する文字プロパティに基づく。Unicode 標準の最新バージョンで指定カテゴリに属する Unicode 符号位置は、すべての適合 ECMAScript 実装によりそのカテゴリとして扱われなければならない。実装は Unicode 標準の後続版で定義された識別子用符号位置を認識してもよい。

Note 1

この規格は追加の特定符号位置を許可する: U+0024 (DOLLAR SIGN) および U+005F (LOW LINE) は IdentifierName 内の任意の位置で許可される。

構文 (Syntax)

IdentifierPartChar

one of

any Unicode code point with the Unicode property “ID_Start”

UnicodeIDContinue

any Unicode code point with the Unicode property “ID_Continue”

非終端記号 UnicodeEscapeSequence の定義は 12.9.4 に示される。

Note 2

非終端記号 IdentifierPart は UnicodeIDContinue を通じて _ を導出する。

Note 3

Unicode プロパティ “ID_Start” および “ID_Continue” の集合には、それぞれ “Other_ID_Start” および “Other_ID_Continue” プロパティを持つ符号位置が含まれる。

12.7.1 識別子名 (Identifier Names)

Unicode エスケープシーケンスは IdentifierName 内で許可され、その場合 UnicodeEscapeSequence の IdentifierCodePoint に等しい一つの Unicode 符号位置として寄与する。UnicodeEscapeSequence に先行する ` はいかなる符号位置も寄与しない。|UnicodeEscapeSequence| は、それが寄与する符号位置をエスケープ無しで書いた場合に無効となるような符号位置を |IdentifierName| に寄与するためには使用できない。言い換えると、 ` UnicodeEscapeSequence の並びをそれが寄与する SourceCharacter に置換した場合、結果は元の IdentifierName と同一の SourceCharacter 列を持つ有効な IdentifierName でなければならない。本仕様内の IdentifierName の解釈は、特定符号位置がエスケープシーケンスで与えられたかどうかに関わらず実際のコードポイントに基づく。

Unicode 標準に従い正規等価な 2 つの IdentifierName は、各 UnicodeEscapeSequence を置換した後に完全に同じコードポイント列で表されない限り等しくない。

12.7.1.1 静的セマンティクス: 早期エラー (Early Errors)

IdentifierStart

UnicodeEscapeSequence の IdentifierCodePoint が IdentifierStartChar 字句文法生成規則でマッチされる Unicode 符号位置でなければ構文エラー。

UnicodeEscapeSequence の IdentifierCodePoint が IdentifierPartChar 字句文法生成規則でマッチされる Unicode 符号位置でなければ構文エラー。

12.7.1.2 静的セマンティクス: IdentifierCodePoints : 符号位置の List

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

IdentifierName

IdentifierStart

cp を IdentifierStart の IdentifierCodePoint とする。
« cp » を返す。

IdentifierName

cps を導出された IdentifierName の IdentifierCodePoints とする。
cp を IdentifierPart の IdentifierCodePoint とする。
cps と « cp » のリスト結合を返す。

12.7.1.3 静的セマンティクス: IdentifierCodePoint : 符号位置

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

IdentifierStart

IdentifierStartChar

IdentifierStartChar によりマッチされた符号位置を返す。

IdentifierPartChar

IdentifierPartChar によりマッチされた符号位置を返す。

Hex4Digits

Hex4Digits の MV の数値値を持つ符号位置を返す。

OptionalChainingPunctuator

CodePoint

}

CodePoint の MV の数値値を持つ符号位置を返す。

12.7.2 キーワードと予約語 (Keywords and Reserved Words)

キーワード (keyword) とは IdentifierName にマッチしかつ構文上の用途（生成規則中に等幅フォントで文字通り出現する）を持つトークンである。ECMAScript のキーワードには if, while, async, await など多数が含まれる。

予約語 (reserved word) とは識別子として使用できない IdentifierName である。多くのキーワードは予約語であるが、そうでないものもあり、また特定の文脈でのみ予約されるものもある。if と while は予約語である。await は async 関数およびモジュール内でのみ予約される。async は予約されていないため、変数名やラベルとして制限なく使用できる。

この仕様は文法生成規則および早期エラールールの組み合わせを用いて、どの名前が有効な識別子でどれが予約語かを指定する。下記 ReservedWord 一覧内の await と yield を除くすべてのトークンは無条件に予約される。await と yield の例外は 13.1 でパラメータ化された構文生成規則を用いて指定される。最後に、いくつかの早期エラールールが有効な識別子集合を制限する。13.1.1, 14.3.1.1, 14.7.5.1, 15.7.1 を参照。まとめると識別子名には 5 つの分類がある:

常に識別子として許可されキーワードではないもの（Math, window, toString, _ など）;
決して識別子として許可されないもの（await と yield を除く ReservedWord）;
文脈的に識別子として許可されるもの（await と yield）;
strict mode code で文脈的に識別子として不許可となるもの: let, static, implements, interface, package, private, protected, public;
常に識別子として許可されるが、特定の構文生成規則中で Identifier が許可されない位置にキーワードとして現れるもの: as, async, from, get, meta, of, set, target。

条件付きキーワード (conditional keyword) または文脈的キーワード (contextual keyword) という語がしばしば最後の 3 つのカテゴリに属するキーワードを指し、これらは文脈によって識別子またはキーワードとして使用できる。

構文 (Syntax)

ReservedWord

one of

await

break

case

catch

class

const

continue

debugger

default

delete

else

enum

export

extends

false

finally

for

function

import

instanceof

new

null

return

super

switch

this

throw

true

try

typeof

var

void

while

with

yield

Note 1

5.1.5 に従い、文法内のキーワードは特定の SourceCharacter 列をリテラルにマッチする。キーワード中の符号位置は `` |UnicodeEscapeSequence| で表現できない。

IdentifierName は ` |UnicodeEscapeSequence| を含み得るが、els\u{65}` のように書いて “else” という名前の変数を宣言することはできない。13.1.1 にある早期エラールールが、予約語と同じ StringValue を持つ識別子を除外する。

Note 2

enum は現時点で本仕様においてキーワードとして使用されていない。これは将来の言語拡張でキーワードとして使用するために予約された future reserved word である。

同様に、implements, interface, package, private, protected, public は strict mode code における future reserved words である。

Note 3

arguments および eval はキーワードではないが strict mode code でいくつかの制限を受ける。13.1.1, 8.6.4, 15.2.1, 15.5.1, 15.6.1, 15.8.1 を参照。

12.8 句読点 (Punctuators)

構文 (Syntax)

Punctuator

OtherPunctuator

OptionalChainingPunctuator

[lookahead ∉ DecimalDigit]

OtherPunctuator

one of

{

(

)

[

]

...

;

===

!==

>>>

**=

<<=

>>=

>>>=

&&=

||=

??=

DivPunctuator

RightBracePunctuator

}

12.9 リテラル (Literals)

12.9.1 null リテラル (Null Literals)

構文 (Syntax)

NullLiteral

null

12.9.2 真偽値リテラル (Boolean Literals)

構文 (Syntax)

BooleanLiteral

true

false

12.9.3 数値リテラル (Numeric Literals)

構文 (Syntax)

DecimalLiteral

[+Sep]

[+Sep]

[+Sep]

opt

[+Sep]

[Sep]

[?Sep]

[?Sep]

[?Sep]

[+Sep]

opt

ExponentPart

[+Sep]

opt

[+Sep]

ExponentPart

[+Sep]

opt

ExponentPart

[+Sep]

opt

opt

[+Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

one of

[Sep]

[?Sep]

one of

[Sep]

[?Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

LegacyOctalLikeDecimalIntegerLiteral

NonOctalDigit

NonOctalDigit

LegacyOctalLikeDecimalIntegerLiteral

DecimalDigit

LegacyOctalLikeDecimalIntegerLiteral

one of

one of

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

NumericLiteral に直続する SourceCharacter は IdentifierStart でも DecimalDigit でもあってはならない。

Note

例えば: 3in はエラーであり、3 と in の 2 つの入力要素ではない。

12.9.3.1 静的セマンティクス: 早期エラー (Early Errors)

IsStrict(this production) が true なら構文エラー。

Note

非 strict コードではこの構文は Legacy である。

12.9.3.2 静的セマンティクス: MV

数値リテラルは Number 型または BigInt 型の値を表す。

DecimalLiteral :: DecimalIntegerLiteral . DecimalDigits の MV は DecimalIntegerLiteral の MV に (DecimalDigits の MV × 10^-n) を加えたもの。ここで n は NumericLiteralSeparator の出現を除いた DecimalDigits の符号位置数。
DecimalLiteral :: DecimalIntegerLiteral . ExponentPart の MV は DecimalIntegerLiteral の MV × 10^e（e は ExponentPart の MV）。
DecimalLiteral :: DecimalIntegerLiteral . DecimalDigits ExponentPart の MV は (DecimalIntegerLiteral の MV + (DecimalDigits の MV × 10^-n)) × 10^e。
DecimalLiteral :: . DecimalDigits の MV は DecimalDigits の MV × 10^-n。
DecimalLiteral :: . DecimalDigits ExponentPart の MV は DecimalDigits の MV × 10^{e - n}。
DecimalLiteral :: DecimalIntegerLiteral ExponentPart の MV は DecimalIntegerLiteral の MV × 10^e。
DecimalIntegerLiteral :: 0 の MV は 0。
DecimalIntegerLiteral :: NonZeroDigit NumericLiteralSeparatoropt DecimalDigits の MV は (NonZeroDigit の MV × 10ⁿ) + DecimalDigits の MV。
DecimalDigits :: DecimalDigits DecimalDigit の MV は (DecimalDigits の MV × 10) + DecimalDigit の MV。
DecimalDigits :: DecimalDigits NumericLiteralSeparator DecimalDigit の MV も (DecimalDigits の MV × 10) + DecimalDigit の MV。
ExponentPart :: ExponentIndicator SignedInteger の MV は SignedInteger の MV。
SignedInteger :: - DecimalDigits の MV は DecimalDigits の MV の負。
DecimalDigit :: 0 / HexDigit :: 0 / OctalDigit :: 0 / LegacyOctalEscapeSequence :: 0 / BinaryDigit :: 0 の MV は 0。
DecimalDigit :: 1 / NonZeroDigit :: 1 / HexDigit :: 1 / OctalDigit :: 1 / BinaryDigit :: 1 の MV は 1。
DecimalDigit :: 2 / NonZeroDigit :: 2 / HexDigit :: 2 / OctalDigit :: 2 の MV は 2。
DecimalDigit :: 3 / ... （以下同様に）9 まで、指定通り 3,4,5,6,7,8,9。
HexDigit :: a / A の MV は 10。
HexDigit :: b / B の MV は 11。
HexDigit :: c / C の MV は 12。
HexDigit :: d / D の MV は 13。
HexDigit :: e / E の MV は 14。
HexDigit :: f / F の MV は 15。
BinaryDigits :: BinaryDigits BinaryDigit の MV は (BinaryDigits の MV × 2) + BinaryDigit の MV。
BinaryDigits :: BinaryDigits NumericLiteralSeparator BinaryDigit も同様。
OctalDigits :: OctalDigits OctalDigit の MV は (OctalDigits の MV × 8) + OctalDigit の MV。
OctalDigits :: OctalDigits NumericLiteralSeparator OctalDigit も同様。
LegacyOctalIntegerLiteral :: LegacyOctalIntegerLiteral OctalDigit の MV は (LegacyOctalIntegerLiteral の MV × 8) + OctalDigit の MV。
NonOctalDecimalIntegerLiteral :: LegacyOctalLikeDecimalIntegerLiteral NonOctalDigit の MV は (LegacyOctalLikeDecimalIntegerLiteral の MV × 10) + NonOctalDigit の MV。
NonOctalDecimalIntegerLiteral :: NonOctalDecimalIntegerLiteral DecimalDigit の MV は (NonOctalDecimalIntegerLiteral の MV × 10) + DecimalDigit の MV。
LegacyOctalLikeDecimalIntegerLiteral :: LegacyOctalLikeDecimalIntegerLiteral OctalDigit の MV は (LegacyOctalLikeDecimalIntegerLiteral の MV × 10) + OctalDigit の MV。
HexDigits :: HexDigits HexDigit の MV は (HexDigits の MV × 16) + HexDigit の MV。
HexDigits :: HexDigits NumericLiteralSeparator HexDigit も同様。

12.9.3.3 静的セマンティクス: NumericValue : Number または BigInt

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

DecimalLiteral

RoundMVResult(DecimalLiteral の MV) を返す。

𝔽(NonDecimalIntegerLiteral の MV) を返す。

𝔽(LegacyOctalIntegerLiteral の MV) を返す。

NonDecimalIntegerLiteral の MV に対応する BigInt 値を返す。

0_ℤ を返す。

NonZeroDigit の MV に対応する BigInt 値を返す。

n を NumericLiteralSeparator の出現を除いた DecimalDigits の符号位置数とする。
mv を (NonZeroDigit の MV × 10ⁿ) + DecimalDigits の MV とする。
ℤ(mv) を返す。

12.9.4 文字列リテラル (String Literals)

Note 1

文字列リテラルは単一または二重引用符で囲まれた 0 個以上の Unicode 符号位置である。Unicode 符号位置はエスケープシーケンスで表すこともできる。閉じ引用符、U+005C (REVERSE SOLIDUS), U+000D (CARRIAGE RETURN), U+000A (LINE FEED) 以外のすべての符号位置は文字列リテラル内にリテラルに記述可能である。任意の符号位置はエスケープシーケンスの形で出現可能である。文字列リテラルは ECMAScript String 値へと評価される。これらの String 値を生成する際、Unicode 符号位置は 11.1.1 で定義されるように UTF-16 エンコードされる。基本多言語面に属するコードポイントは文字列の 1 つのコードユニット要素としてエンコードされ、それ以外は 2 つのコードユニット要素としてエンコードされる。

構文 (Syntax)

StringLiteral

DoubleStringCharacters

opt

SingleStringCharacters

opt

DoubleStringCharacters

DoubleStringCharacter

DoubleStringCharacters

opt

SingleStringCharacters

SingleStringCharacter

SingleStringCharacters

opt

DoubleStringCharacter

but not one of " or \ or LineTerminator

<LS>

<PS>

LineContinuation

SingleStringCharacter

but not one of ' or \ or LineTerminator

<LS>

<PS>

LineContinuation

LineTerminatorSequence

LegacyOctalEscapeSequence

CharacterEscapeSequence

[lookahead ∉ DecimalDigit]

NonOctalDecimalEscapeSequence

HexEscapeSequence

CharacterEscapeSequence

SingleEscapeCharacter

NonEscapeCharacter

SingleEscapeCharacter

one of

NonEscapeCharacter

LegacyOctalEscapeSequence

but not one of EscapeCharacter or LineTerminator

EscapeCharacter

SingleEscapeCharacter

DecimalDigit

[lookahead ∈ { 8, 9 }]

NonZeroOctalDigit

[lookahead ∉ OctalDigit]

ZeroToThree

NonOctalDecimalEscapeSequence

[lookahead ∉ OctalDigit]

but not 0

one of

one of

one of

HexEscapeSequence

}

非終端 HexDigit の定義は 12.9.3 に、SourceCharacter は 11.1 にある。

Note 2

<LF> と <CR> は LineContinuation の一部として空の符号位置列を生成する場合を除き文字列リテラル内に現れない。文字列リテラルの String 値にこれらを含める正しい方法は \n や \u000A などのエスケープシーケンスを用いることである。

12.9.4.1 静的セマンティクス: 早期エラー (Early Errors)

LegacyOctalEscapeSequence

NonOctalDecimalEscapeSequence

IsStrict(this production) が true なら構文エラー。

Note 1

非 strict コードではこの構文は Legacy。

Note 2

文字列リテラルは囲むコードを strict mode にする Use Strict ディレクティブより前に現れる可能性があるため、実装はそのようなリテラルに対して上記規則を適用する際注意しなければならない。例えば次のソーステキストは構文エラーを含む:

function invalid() { "\7"; "use strict"; }

12.9.4.2 静的セマンティクス: SV : String

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS.

文字列リテラルは String 型の値を表す。SV は文字列リテラルの各部分に再帰的に適用され String 値を生成する。この過程で、文字列リテラル内の一部の Unicode 符号位置は下記または 12.9.3 に述べるように数学的値を持つものとして解釈される。

StringLiteral :: " " の SV は空文字列。
StringLiteral :: ' ' の SV は空文字列。
DoubleStringCharacters :: DoubleStringCharacter DoubleStringCharacters の SV は DoubleStringCharacter の SV と DoubleStringCharacters の SV の連結。
SingleStringCharacters :: SingleStringCharacter SingleStringCharacters の SV は同様。
DoubleStringCharacter :: SourceCharacter but not one of " or \ or LineTerminator の SV は SourceCharacter によりマッチされたコードポイントに UTF16EncodeCodePoint を行った結果。
DoubleStringCharacter :: <LS> の SV はコードユニット 0x2028。
DoubleStringCharacter :: <PS> の SV はコードユニット 0x2029。
DoubleStringCharacter :: LineContinuation の SV は空文字列。
SingleStringCharacter :: SourceCharacter but not one of ' or \ or LineTerminator の SV も UTF16EncodeCodePoint の結果。
SingleStringCharacter :: <LS> の SV は 0x2028。
SingleStringCharacter :: <PS> の SV は 0x2029。
SingleStringCharacter :: LineContinuation の SV は空文字列。
EscapeSequence :: 0 の SV はコードユニット 0x0000。
CharacterEscapeSequence :: SingleEscapeCharacter の SV は Table 35 に従い決定されるコードユニット値。

Table 35: String Single Character Escape Sequences

Escape Sequence	Code Unit Value	Unicode Character Name	Symbol
`\\b`	`0x0008`	BACKSPACE	<BS>
`\\t`	`0x0009`	CHARACTER TABULATION	<HT>
`\\n`	`0x000A`	LINE FEED (LF)	<LF>
`\\v`	`0x000B`	LINE TABULATION	<VT>
`\\f`	`0x000C`	FORM FEED (FF)	<FF>
`\\r`	`0x000D`	CARRIAGE RETURN (CR)	<CR>
`\\"`	`0x0022`	QUOTATION MARK	`"`
`\\'`	`0x0027`	APOSTROPHE	`'`
`\\\\`	`0x005C`	REVERSE SOLIDUS	`\\`

NonEscapeCharacter :: SourceCharacter but not one of EscapeCharacter or LineTerminator の SV は UTF16EncodeCodePoint の結果。
EscapeSequence :: LegacyOctalEscapeSequence の SV は LegacyOctalEscapeSequence の MV の数値値を持つコードユニット。
NonOctalDecimalEscapeSequence :: 8 の SV は 0x0038。
NonOctalDecimalEscapeSequence :: 9 の SV は 0x0039。
HexEscapeSequence :: x HexDigit HexDigit の SV は HexEscapeSequence の MV を数値値とするコードユニット。
Hex4Digits :: HexDigit HexDigit HexDigit HexDigit の SV は Hex4Digits の MV を数値値とするコードユニット。
UnicodeEscapeSequence :: u{ CodePoint } の SV は CodePoint の MV に UTF16EncodeCodePoint を行った結果。
TemplateEscapeSequence :: 0 の SV は 0x0000。

12.9.4.3 静的セマンティクス: MV

LegacyOctalEscapeSequence :: ZeroToThree OctalDigit の MV は (8 × ZeroToThree の MV) + OctalDigit の MV。
LegacyOctalEscapeSequence :: FourToSeven OctalDigit の MV は (8 × FourToSeven の MV) + OctalDigit の MV。
LegacyOctalEscapeSequence :: ZeroToThree OctalDigit OctalDigit の MV は (64 × ZeroToThree の MV) + (8 × 最初の OctalDigit の MV) + 2 番目の OctalDigit の MV。
ZeroToThree :: 0 の MV は 0。
ZeroToThree :: 1 の MV は 1。
ZeroToThree :: 2 の MV は 2。
ZeroToThree :: 3 の MV は 3。
FourToSeven :: 4 の MV は 4。
FourToSeven :: 5 の MV は 5。
FourToSeven :: 6 の MV は 6。
FourToSeven :: 7 の MV は 7。
HexEscapeSequence :: x HexDigit HexDigit の MV は (16 × 最初の HexDigit の MV) + 2 番目の HexDigit の MV。
Hex4Digits :: HexDigit HexDigit HexDigit HexDigit の MV は (0x1000 × 最初の HexDigit の MV) + (0x100 × 2 番目) + (0x10 × 3 番目) + 4 番目。

12.9.5 正規表現リテラル (Regular Expression Literals)

Note 1

正規表現リテラルは評価のたびに RegExp オブジェクト（22.2 参照）へ変換される入力要素である。プログラム中の 2 つの正規表現リテラルは内容が同一でも === で等しくならない。RegExp オブジェクトは new RegExp またはコンストラクタ呼び出し（22.2.4）で実行時に生成することもできる。

以下の生成規則は正規表現リテラルの構文を記述し、入力要素スキャナが正規表現リテラルの終端を見つけるために用いられる。RegularExpressionBody と RegularExpressionFlags を成すソーステキストは、その後より厳密な ECMAScript 正規表現文法（22.2.1）を用いて再度パースされる。

実装は 22.2.1 で定義される ECMAScript 正規表現文法を拡張してもよいが、下に定義される RegularExpressionBody および RegularExpressionFlags 生成規則、またそれらが使用する生成規則を拡張してはならない。

構文 (Syntax)

RegularExpressionFirstChar

RegularExpressionChars

[empty]

RegularExpressionChars

RegularExpressionChar

RegularExpressionFirstChar

but not one of * or \ or / or [

RegularExpressionClass

RegularExpressionChar

but not one of \ or / or [

RegularExpressionClass

RegularExpressionClassChars

but not LineTerminator

RegularExpressionClass

[

]

RegularExpressionClassChars

[empty]

RegularExpressionClassChars

RegularExpressionClassChar

but not one of ] or \

[empty]

IdentifierPartChar

Note 2

正規表現リテラルは空にできない。空の正規表現リテラルを表す代わりに // は単一行コメントを開始する。空の正規表現を指定するには /(?:)/ を用いる。

12.9.5.1 静的セマンティクス: BodyText : ソーステキスト

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

RegularExpressionBody として認識されたソーステキストを返す。

12.9.5.2 静的セマンティクス: FlagText : ソーステキスト

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. It is defined piecewise over the following productions:

RegularExpressionFlags として認識されたソーステキストを返す。

12.9.6 テンプレートリテラルの字句要素 (Template Literal Lexical Components)

構文 (Syntax)

Template

NoSubstitutionTemplate

TemplateHead

NoSubstitutionTemplate

TemplateCharacters

opt

TemplateHead

TemplateCharacters

opt

TemplateSubstitutionTail

}

opt

}

opt

opt

[lookahead ≠ {]

TemplateEscapeSequence

NotEscapeSequence

LineContinuation

LineTerminatorSequence

but not one of ` or \ or $ or LineTerminator

TemplateEscapeSequence

CharacterEscapeSequence

[lookahead ∉ DecimalDigit]

HexEscapeSequence

NotEscapeSequence

DecimalDigit

but not 0

[lookahead ∉ HexDigit]

[lookahead ∉ HexDigit]

[lookahead ≠ {]

[lookahead ∉ HexDigit]

[lookahead ∉ HexDigit]

[lookahead ∉ HexDigit]

{

[lookahead ∉ HexDigit]

{

NotCodePoint

[lookahead ∉ HexDigit]

{

CodePoint

[lookahead ∉ HexDigit]

[lookahead ≠ }]

NotCodePoint

HexDigits

[~Sep]

but only if the MV of HexDigits > 0x10FFFF

CodePoint

HexDigits

[~Sep]

but only if the MV of HexDigits ≤ 0x10FFFF

Note

TemplateSubstitutionTail は InputElementTemplateTail の代替字句目標で用いられる。

12.9.6.1 静的セマンティクス: TV : String または undefined

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. テンプレートリテラル構成要素は TV により String 型の値として解釈される。TV はテンプレートオブジェクトのインデックス付き構成要素（テンプレート値）を構成する。TV ではエスケープシーケンスはその Unicode 符号位置を UTF-16 のコードユニットに置換される。

NoSubstitutionTemplate :: ` ` の TV は空文字列。
TemplateHead :: ` ${ の TV は空文字列。
TemplateMiddle :: } ${ の TV は空文字列。
TemplateTail :: } ` の TV は空文字列。
TemplateCharacters :: TemplateCharacter TemplateCharacters の TV は TemplateCharacter または TemplateCharacters の TV が undefined なら undefined、そうでなければその連結。
TemplateCharacter :: SourceCharacter but not one of ` or \ or $ or LineTerminator の TV は SourceCharacter にマッチしたコードポイントへ UTF16EncodeCodePoint を行った結果。
TemplateCharacter :: $ の TV はコードユニット 0x0024。
TemplateCharacter :: \ TemplateEscapeSequence の TV は TemplateEscapeSequence の SV。
TemplateCharacter :: \ NotEscapeSequence の TV は undefined。
TemplateCharacter :: LineTerminatorSequence の TV は LineTerminatorSequence の TRV。
LineContinuation :: \ LineTerminatorSequence の TV は空文字列。

12.9.6.2 静的セマンティクス: TRV : String

The syntax-directed operation UNKNOWN takes UNPARSEABLE ARGUMENTS. テンプレートリテラル構成要素は TRV により String 型の値として解釈される。TRV はテンプレートオブジェクトの raw 構成要素（テンプレート raw 値）を構築する。TRV は TV と似ているが、TRV ではエスケープシーケンスは字面通りのコード単位として扱われる点が異なる。

NoSubstitutionTemplate :: ` ` の TRV は空文字列。
TemplateHead :: ` ${ の TRV は空文字列。
TemplateMiddle :: } ${ の TRV は空文字列。
TemplateTail :: } ` の TRV は空文字列。
TemplateCharacters :: TemplateCharacter TemplateCharacters の TRV は各 TRV の連結。
TemplateCharacter :: SourceCharacter but not one of ` or \ or $ or LineTerminator の TRV は UTF16EncodeCodePoint の結果。
TemplateCharacter :: $ の TRV は 0x0024。
TemplateCharacter :: \ TemplateEscapeSequence の TRV は 0x005C と TemplateEscapeSequence の TRV の連結。
TemplateCharacter :: \ NotEscapeSequence の TRV は 0x005C と NotEscapeSequence の TRV の連結。
TemplateEscapeSequence :: 0 の TRV は 0x0030。
NotEscapeSequence :: 0 DecimalDigit の TRV は 0x0030 と DecimalDigit の TRV の連結。
NotEscapeSequence :: x [lookahead ∉ HexDigit] の TRV は 0x0078。
NotEscapeSequence :: x HexDigit [lookahead ∉ HexDigit] の TRV は 0x0078 と HexDigit の TRV の連結。
NotEscapeSequence :: u [lookahead ∉ HexDigit] [lookahead ≠ {] の TRV は 0x0075。
NotEscapeSequence :: u HexDigit [lookahead ∉ HexDigit] の TRV は 0x0075 と HexDigit の TRV の連結。
NotEscapeSequence :: u HexDigit HexDigit [lookahead ∉ HexDigit] の TRV は 0x0075 と最初および 2 番目の HexDigit の TRV の連結。
NotEscapeSequence :: u HexDigit HexDigit HexDigit [lookahead ∉ HexDigit] の TRV は 0x0075 と最初,2 番目,3 番目の HexDigit の TRV の連結。
NotEscapeSequence :: u { [lookahead ∉ HexDigit] の TRV は 0x0075 と 0x007B の連結。
NotEscapeSequence :: u { NotCodePoint [lookahead ∉ HexDigit] の TRV は 0x0075, 0x007B, NotCodePoint の TRV の連結。
NotEscapeSequence :: u { CodePoint [lookahead ∉ HexDigit] [lookahead ≠ }] の TRV は 0x0075, 0x007B, CodePoint の TRV の連結。
DecimalDigit :: one of 0 9 の TRV は該当コードポイントを UTF16EncodeCodePoint した結果。
CharacterEscapeSequence :: NonEscapeCharacter の TRV は NonEscapeCharacter の SV。
SingleEscapeCharacter :: one of ' " \ b f n r t v の TRV はそのコードポイントの UTF16EncodeCodePoint 結果。
HexEscapeSequence :: x HexDigit HexDigit の TRV は 0x0078 と 2 つの HexDigit の TRV の連結。
UnicodeEscapeSequence :: u Hex4Digits の TRV は 0x0075 と Hex4Digits の TRV の連結。
UnicodeEscapeSequence :: u{ CodePoint } の TRV は 0x0075, 0x007B, CodePoint の TRV, 0x007D の連結。
Hex4Digits :: HexDigit HexDigit HexDigit HexDigit の TRV は 4 つの HexDigit の TRV の連結。
HexDigits :: HexDigits HexDigit の TRV は HexDigits の TRV と HexDigit の TRV の連結。
HexDigit :: one of 0 9 a f A F の TRV は UTF16EncodeCodePoint の結果。
LineContinuation :: \ LineTerminatorSequence の TRV は 0x005C と LineTerminatorSequence の TRV の連結。
LineTerminatorSequence :: <LF> の TRV は 0x000A。
LineTerminatorSequence :: <CR> の TRV は 0x000A。
LineTerminatorSequence :: <LS> の TRV は 0x2028。
LineTerminatorSequence :: <PS> の TRV は 0x2029。
LineTerminatorSequence :: <CR> <LF> の TRV は 0x000A。

Note

TV は LineContinuation のコードユニットを除外するが TRV は含む。<CR><LF> と <CR> の LineTerminatorSequence は TV と TRV の両方で <LF> に正規化される。<CR> または <CR><LF> を含めるには明示的な TemplateEscapeSequence が必要。

12.10 自動セミコロン挿入 (Automatic Semicolon Insertion)

ほとんどの ECMAScript 文および宣言はセミコロンで終端されなければならない。これらのセミコロンは常に明示的に記述できる。利便性のため、特定の状況ではそれらを省略できる。これらの状況ではソースコードトークン列へ自動的にセミコロンが挿入されると記述される。

12.10.1 自動セミコロン挿入の規則 (Rules of Automatic Semicolon Insertion)

以下の規則において “token” は 12 に述べる現在の字句目標記号を用いて決定される実際に認識された字句トークンを意味する。

セミコロン挿入には 3 つの基本規則がある:

ソーステキストを左から右へパースする際、いかなる文法生成規則でも許可されないトークン（違反トークン）に遭遇したとき、以下のいずれかが真ならその違反トークンの前にセミコロンが自動挿入される:
- 違反トークンが直前のトークンと 1 つ以上の LineTerminator で分離されている。
- 違反トークンが } である。
- 直前のトークンが ) であり、挿入されたセミコロンが do-while 文 (14.7.2) の終端セミコロンとしてパースされる。
ソーステキストを左から右へパースする際、トークン入力列の終端に到達し、構文解析器が入力トークン列を目標非終端の単一インスタンスとしてパースできないなら、入力列末尾にセミコロンが自動挿入される。
ソーステキストを左から右へパースする際、文法生成規則により許可されるトークンだがその生成規則が制限付き生成規則であり、トークンが制限付き生成規則内の “[no LineTerminator here]” 注釈直後に位置する終端または非終端の先頭トークン（= 制限トークン）であり、その制限トークンが直前トークンと 1 つ以上の LineTerminator で分離されているなら、制限トークンの前にセミコロンが自動挿入される。

ただし上記規則には更に支配的な条件がある: セミコロンが自動挿入された結果それが空文としてパースされる場合、またはそのセミコロンが for 文ヘッダ内の 2 つのセミコロンの一つになる場合（14.7.4 参照）、セミコロンは決して自動挿入されない。

Note

以下は文法中の唯一の制限付き生成規則である:

UpdateExpression

[Yield, Await]

LeftHandSideExpression

[?Yield, ?Await]

[no LineTerminator here]

LeftHandSideExpression

[?Yield, ?Await]

[no LineTerminator here]

ContinueStatement

[Yield, Await]

continue

;

continue

[no LineTerminator here]

LabelIdentifier

[?Yield, ?Await]

;

BreakStatement

[Yield, Await]

break

;

break

[no LineTerminator here]

LabelIdentifier

[?Yield, ?Await]

;

ReturnStatement

[Yield, Await]

return

;

return

[no LineTerminator here]

Expression

[+In, ?Yield, ?Await]

;

ThrowStatement

[Yield, Await]

throw

[no LineTerminator here]

Expression

[+In, ?Yield, ?Await]

;

YieldExpression

[In, Await]

yield

[no LineTerminator here]

AssignmentExpression

[?In, +Yield, ?Await]

yield

[no LineTerminator here]

AssignmentExpression

[?In, +Yield, ?Await]

ArrowFunction

[In, Yield, Await]

ArrowParameters

[?Yield, ?Await]

[no LineTerminator here]

ConciseBody

[?In]

AsyncFunctionDeclaration

[Yield, Await, Default]

async

[no LineTerminator here]

function

[?Yield, ?Await]

(

[~Yield, +Await]

)

{

}

[+Default]

async

[no LineTerminator here]

function

(

[~Yield, +Await]

)

{

}

AsyncFunctionExpression

async

[no LineTerminator here]

function

[~Yield, +Await]

opt

(

[~Yield, +Await]

)

{

}

AsyncMethod

[Yield, Await]

async

[no LineTerminator here]

ClassElementName

[?Yield, ?Await]

(

UniqueFormalParameters

[~Yield, +Await]

)

{

AsyncGeneratorDeclaration

}

[Yield, Await, Default]

async

[no LineTerminator here]

function

[?Yield, ?Await]

(

[+Yield, +Await]

)

{

}

[+Default]

async

[no LineTerminator here]

function

(

[+Yield, +Await]

)

{

}

AsyncGeneratorExpression

async

[no LineTerminator here]

function

[+Yield, +Await]

opt

(

[+Yield, +Await]

)

{

}

AsyncGeneratorMethod

[Yield, Await]

async

[no LineTerminator here]

ClassElementName

[?Yield, ?Await]

(

UniqueFormalParameters

[+Yield, +Await]

)

{