- 6. New string features
- 6.1 Overview
- 6.2 Unicode code point escapes
- 6.3 String interpolation, multi-line string literals and raw string literals
- 6.4 Iterating over strings
- 6.5 Numeric values of code points
- 6.6 Checking for inclusion
- 6.7 Repeating strings
- 6.8 String methods that delegate regular expression work to their parameters
- 6.9 Reference: the new string methods
Please support this book: buy it (PDF, EPUB, MOBI) or donate
6. New string features
6.1 Overview
New string methods:
> 'hello'.startsWith('hell')
- true
- > 'hello'.endsWith('ello')
- true
- > 'hello'.includes('ell')
- true
- > 'doo '.repeat(3)
- 'doo doo doo '
ES6 has a new kind of string literal, the template literal:
// String interpolation via template literals (in backticks)
const
first
=
'Jane'
;
const
last
=
'Doe'
;
console
.
log
(
`Hello
${
first
}
${
last
}
!`
);
// Hello Jane Doe!
// Template literals also let you create strings with multiple lines
const
multiLine
=
`
This is
a string
with multiple
lines`
;
6.2 Unicode code point escapes
In ECMAScript 6, there is a new kind of Unicode escape that lets you specify any code point (even those beyond 16 bits):
console
.
log
(
'\u{1F680}'
);
// ES6: single code point
console
.
log
(
'\uD83D\uDE80'
);
// ES5: two code units
More information on escapes is given in the chapter on Unicode.
6.3 String interpolation, multi-line string literals and raw string literals
Template literals are described in depth in their own chapter. They provide three interesting features.
First, template literals support string interpolation:
const
first
=
'Jane'
;
const
last
=
'Doe'
;
console
.
log
(
`Hello
${
first
}
${
last
}
!`
);
// Hello Jane Doe!
Second, template literals can contain multiple lines:
const
multiLine
=
`
This is
a string
with multiple
lines`
;
Third, template literals are “raw” if you prefix them with the tag String.raw
– the backslash is not a special character and escapes such as \n
are not interpreted:
const
str
=
String
.
raw
`Not a newline:
\
n`
;
console
.
log
(
str
===
'Not a newline: \\n'
);
// true
6.4 Iterating over strings
Strings are iterable, which means that you can use for-of
to iterate over their characters:
for
(
const
ch
of
'abc'
)
{
console
.
log
(
ch
);
}
// Output:
// a
// b
// c
And you can use the spread operator (…
) to turn strings into Arrays:
const
chars
=
[...
'abc'
];
// ['a', 'b', 'c']
6.4.1 Iteration honors Unicode code points
The string iterator splits strings along code point boundaries, which means that the strings it returns comprise one or two JavaScript characters:
for
(
const
ch
of
'x\uD83D\uDE80y'
)
{
console
.
log
(
ch
.
length
);
}
// Output:
// 1
// 2
// 1
6.4.2 Counting code points
Iteration gives you a quick way to count the Unicode code points in a string:
> [...'x\uD83D\uDE80y'].length
- 3
6.4.3 Reversing strings with non-BMP code points
Iteration also helps with reversing strings that contain non-BMP code points (which are larger than 16 bit and encoded as two JavaScript characters):
const
str
=
'x\uD83D\uDE80y'
;
// ES5: \uD83D\uDE80 are (incorrectly) reversed
console
.
log
(
str
.
split
(
''
).
reverse
().
join
(
''
));
// 'y\uDE80\uD83Dx'
// ES6: order of \uD83D\uDE80 is preserved
console
.
log
([...
str
].
reverse
().
join
(
''
));
// 'y\uD83D\uDE80x'
6.5 Numeric values of code points
The new method codePointAt()
returns the numeric value of a code point at a given index in a string:
const
str
=
'x\uD83D\uDE80y'
;
console
.
log
(
str
.
codePointAt
(
0
).
toString
(
16
));
// 78
console
.
log
(
str
.
codePointAt
(
1
).
toString
(
16
));
// 1f680
console
.
log
(
str
.
codePointAt
(
3
).
toString
(
16
));
// 79
This method works well when combined with iteration over strings:
for
(
const
ch
of
'x\uD83D\uDE80y'
)
{
console
.
log
(
ch
.
codePointAt
(
0
).
toString
(
16
));
}
// Output:
// 78
// 1f680
// 79
The opposite of codePointAt()
is String.fromCodePoint()
:
> String.fromCodePoint(0x78, 0x1f680, 0x79) === 'x\uD83D\uDE80y'
- true
6.6 Checking for inclusion
Three new methods check whether a string exists within another string:
> 'hello'.startsWith('hell')
- true
- > 'hello'.endsWith('ello')
- true
- > 'hello'.includes('ell')
- true
Each of these methods has a position as an optional second parameter, which specifies where the string to be searched starts or ends:
> 'hello'.startsWith('ello', 1)
- true
- > 'hello'.endsWith('hell', 4)
- true
- > 'hello'.includes('ell', 1)
- true
- > 'hello'.includes('ell', 2)
- false
6.7 Repeating strings
The repeat()
method repeats strings:
> 'doo '.repeat(3)
- 'doo doo doo '
6.8 String methods that delegate regular expression work to their parameters
In ES6, the four string methods that accept regular expression parameters do relatively little. They mainly call methods of their parameters:
String.prototype.match(regexp)
callsregexpSymbol.match
.String.prototype.replace(searchValue, replaceValue)
callssearchValueSymbol.replace
.String.prototype.search(regexp)
callsregexpSymbol.search
.String.prototype.split(separator, limit)
callsseparatorSymbol.split
. The parameters don’t have to be regular expressions, anymore. Any objects with appropriate methods will do.
6.9 Reference: the new string methods
Tagged templates:
String.raw(callSite, …substitutions) : string
Template tag for “raw” content (backslashes are not interpreted):
> String.raw`\n` === '\\n'
- true
Consult the chapter on template literals for more information.
Unicode and code points:
String.fromCodePoint(…codePoints : number[]) : string
Turns numbers denoting Unicode code points into a string.String.prototype.codePointAt(pos) : number
Returns the number of the code point starting at positionpos
(comprising one or two JavaScript characters).String.prototype.normalize(form? : string) : string
Different combinations of code points may look the same. Unicode normalization changes them all to the same value(s), their so-called canonical representation. That helps with comparing and searching for strings. The'NFC'
form is recommended for general text. Finding strings:String.prototype.startsWith(searchString, position=0) : boolean
Does the receiver start withsearchString
?position
lets you specify where the string to be checked starts.String.prototype.endsWith(searchString, endPosition=searchString.length) : boolean
Does the receiver end withsearchString
?endPosition
lets you specify where the string to be checked ends.String.prototype.includes(searchString, position=0) : boolean
Does the receiver containsearchString
?position
lets you specify where the string to be searched starts. Repeating strings:String.prototype.repeat(count) : string
Returns the receiver, concatenatedcount
times.