- 6. Syntax
- 1 is only interpreted as an expression, because we wrap it in parentheses. If we didn’t, we would get a syntax error, because then JavaScript expects a function declaration and complains about the missing function name. Additionally, you can’t put a function call immediately after a function declaration.
6. Syntax
6.1. An overview of JavaScript’s syntax
6.1.1. Basic syntax
Comments:
Primitive (atomic) values:
An assertion describes what the result of a computation is expected to look like and throws an exception if those expectations aren’t correct. For example, the following assertion states that the result of the computation 7 plus 1 must be 8:
assert.equal()
is a method call (the object is assert
, the method is .equal()
) with two arguments: the actual result and the expected result. It is part of a Node.js assertion API that is explained later in this book.
Logging to the console of a browser or Node.js:
Operators:
// Operators for booleans
assert.equal(true && false, false); // And
assert.equal(true || false, true); // Or
// Operators for numbers
assert.equal(3 + 4, 7);
assert.equal(5 - 1, 4);
assert.equal(3 * 4, 12);
assert.equal(9 / 3, 3);
// Operators for strings
assert.equal('a' + 'b', 'ab');
assert.equal('I see ' + 3 + ' monkeys', 'I see 3 monkeys');
// Comparison operators
assert.equal(3 < 4, true);
assert.equal(3 <= 4, true);
assert.equal('abc' === 'abc', true);
assert.equal('abc' !== 'def', true);
Declaring variables:
Control flow statements:
Ordinary function declarations:
Arrow function expressions (used especially as arguments of function calls and method calls):
The previous code contains the following arrow functions (the terms expression and statement are explained later in this chapter):
Objects:
// Creating a plain object via an object literal
const obj = {
first: 'Jane', // property
last: 'Doe', // property
getFullName() { // property (method)
return this.first + ' ' + this.last;
},
};
// Getting a property value
assert.equal(obj.first, 'Jane');
// Setting a property value
obj.first = 'Janey';
// Calling the method
assert.equal(obj.getFullName(), 'Janey Doe');
Arrays (Arrays are also objects):
6.1.2. Modules
Each module is a single file. Consider, for example, the following two files with modules in them:
file-tools.js
main.js
The module in file-tools.js
exports its function isTextFilePath()
:
The module in main.js
imports the whole module path
and the function isTextFilePath()
:
6.1.3. Legal variable and property names
The grammatical category of variable names and property names is called identifier.
Identifiers are allowed to have the following characters:
- Unicode letters:
A
–Z
,a
–z
(etc.) $
,_
- Unicode digits:
0
–9
(etc.)- Variable names can’t start with a digit
Some words have special meaning in JavaScript and are called reserved. Examples include:if
,true
,const
.
- Variable names can’t start with a digit
Reserved words can’t be used as variable names:
But they are allowed as names of properties:
6.1.4. Casing styles
Common casing styles for concatenating words are:
- Camel case:
threeConcatenatedWords
- Underscore case (also called snake case):
three_concatenated_words
- Dash case (also called kebab case):
three-concatenated-words
6.1.5. Capitalization of names
In general, JavaScript uses camel case, except for constants.
Lowercase:
- Functions, variables:
myFunction
- Methods:
obj.myMethod
CSS:
- CSS entity:
special-class
- Corresponding JavaScript variable:
specialClass
Uppercase:
- CSS entity:
Classes:
MyClass
- Constants:
MY_CONSTANT
- Constants are also often written in camel case:
myConstant
- Constants are also often written in camel case:
6.1.6. Where to put semicolons?
At the end of a statement:
But not if that statement ends with a curly brace:
However, adding a semicolon after such a statement is not a syntax error – it is interpreted as an empty statement:
6.2. (Advanced)
All remaining sections of this chapter are advanced.
6.3. Identifiers
6.3.1. Valid identifiers (variable names etc.)
First character:
- Unicode letter (including accented characters such as
é
andü
and characters from non-latin alphabets, such asα
) $
_
Subsequent characters:Legal first characters
- Unicode digits (including Eastern Arabic numerals)
- Some other Unicode marks and punctuations
Examples:
6.3.2. Reserved words
Reserved words can’t be variable names, but they can be property names.
All JavaScript keywords are reserved words:
await
break
case
catch
class
const
continue
debugger
default
delete
do
else
export
extends
finally
for
function
if
import
in
instanceof
let
new
return
static
super
switch
this
throw
try
typeof
var
void
while
with
yield
The following tokens are also keywords, but currently not used in the language:
enum
implements
package
protected
interface
private
public
The following literals are reserved words:
true
false
null
Technically, these words are not reserved, but you should avoid them, too, because they effectively are keywords:
Infinity
NaN
undefined
async
You shouldn’t use the names of global variables (String
, Math
, etc.) for your own variables and parameters, either.
6.4. Statement vs. expression
In this section, we explore how JavaScript distinguishes two kinds of syntactic constructs: statements and expressions. Afterwards, we’ll see that that can cause problems, because the same syntax can mean different things, depending on where it is used.
6.4.1. Statements
A statement is a piece of code that can be executed and performs some kind of action. For example, if
is a statement:
One more example of a statement: a function declaration.
6.4.2. Expressions
An expression is a piece of code that can be evaluated to produce a value. For example, the code between the parentheses is an expression:
The operator ?:
used between the parentheses is called the _ternary operator. It is the expression version of the if
statement.
Let’s look at more examples of expressions. We enter expressions and the REPL evaluates them for us:
6.4.3. What is allowed where?
The current location within JavaScript source code determines which kind of syntactic constructs you are allowed to use:
- The body of a function must be a sequence of statements:
- The arguments of a function call or a method call must be expressions:
However, expressions can be used as statements. Then they are called expression statements. The opposite is not true: when the context requires an expression, you can’t use statements.
The following code demonstrates that any expression bar()
can be either expression or statement – it depends on the context:
6.5. Ambiguous syntax
JavaScript has several programming constructs that are syntactically ambiguous: The same syntax is interpreted differently, depending on whether it is used in statement context or in expression context. This section explores the phenomenon and the pitfalls it causes.
6.5.1. Same syntax: function declaration and function expression
A function declaration is a statement:
A function expression is an expression (right-hand side of =
):
6.5.2. Same syntax: object literal and block
In the following code, {}
is an object literal: an expression that creates an empty object.
This is an empty code block (a statement):
6.5.3. Disambiguation
The ambiguities are only a problem in statement context: If the JavaScript parser encounters ambiguous syntax, it doesn’t know if it’s a plain statement or an expression statement. For example:
- If a statement starts with
function
: Is it a function declaration or a function expression? - If a statement starts with
{
: Is it an object literal or a code block?
To resolve the ambiguity, statements starting withfunction
or{
are never interpreted as expressions. If you want an expression statement to start with either one of these tokens, you must wrap it in parentheses:
In this code:
- We first create a function, via a function expression:
- Then we invoke that function:
('abc')
1 is only interpreted as an expression, because we wrap it in parentheses. If we didn’t, we would get a syntax error, because then JavaScript expects a function declaration and complains about the missing function name. Additionally, you can’t put a function call immediately after a function declaration.
Later in this book, we’ll see more examples of pitfalls caused by syntactic ambiguity:
6.6. Semicolons
6.6.1. Rule of thumb for semicolons
Each statement is terminated by a semicolon.
Except: statements ending with blocks.
The following case is slightly tricky:
The whole const
declaration (a statement) ends with a semicolon, but inside it, there is an arrow function expression. That is: It’s not the statement per se that ends with a curly brace; it’s the embedded arrow function expression. That’s why there is a semicolon at the end.
6.6.2. Semicolons: control statements
The body of a control statement is itself a statement. For example, this is the syntax of the while
loop:
The body can be a single statement:
But blocks are also statements and therefore legal bodies of control statements:
If you want a loop to have an empty body, your first option is an empty statement (which is just a semicolon):
Your second option is an empty block:
6.7. Automatic semicolon insertion (ASI)
While I recommend to always write semicolons, most of them are optional in JavaScript. The mechanism that makes this possible is called automatic semicolon insertion (ASI). In a way, it corrects syntax errors.
ASI works as follows. Parsing of a statement continues until there is either:
- A semicolon
- A line terminator followed by an illegal token
In other words, ASI can be seen as inserting semicolons at line breaks. The next subsections cover the pitfalls of ASI.
6.7.1. ASI triggered unexpectedly
The good news about ASI is that – if you don’t rely on it and always write semicolons – there is only one pitfall that you need to be aware of. It is that JavaScript forbids line breaks after some tokens. If you do insert a line break, a semicolon will be inserted, too.
The token where this is most practically relevant is return
. Consider, for example, the following code:
This code is parsed as:
That is, an empty return statement, followed by a code block, followed by an empty statement.
Why does JavaScript do this? It protects against accidentally returning a value in a line after a return
.
6.7.2. ASI unexpectedly not triggered
In some cases, ASI is not triggered when you think it should be. That makes life more complicated for people who don’t like semicolons, because they need to be aware of those cases. The following are three examples. There are more.
Example 1: Unintended function call.
Parsed as:
Example 2: Unintended division.
Parsed as:
Example 3: Unintended property access.
Executed as:
6.8. Semicolons: best practices
I recommend that you always write semicolons:
- I like the visual structure it gives code – you clearly see when a statement ends.
- There are less rules to keep in mind.
The majority of JavaScript programmers use semicolons.
However, there are also many people who don’t like the added visual clutter of semicolons. If you are one of them: code without them is legal. I recommend that you use tools to help you avoid mistakes. The following are two examples:The automatic code formatter Prettier can be configured to not use semicolons. It then automatically fixes problems. For example, if it encounters a line that starts with a square bracket, it prefixes that line with a semicolon.
- The static checker ESLint has a rule that you tell your preferred style (always semicolons or as few semicolons as possible) and that warns you about critical issues.
6.9. Strict mode
Starting with ECMAScript 5, you can optionally execute JavaScript in a so-called strict mode. In that mode, the language is slightly cleaner: a few quirks don’t exist and more exceptions are thrown.
The default (non-strict) mode is also called sloppy mode.
Note that strict mode is switched on by default inside modules and classes, so you don’t really need to know about it when you write modern JavaScript (which is almost always located in modules). In this book, I assume that strict mode is always switched on.
6.9.1. Switching on strict mode
In legacy script files and CommonJS modules, you switch on strict mode for a complete file, by putting the following code in the first line:
The neat thing about this “directive” is that ECMAScript versions before 5 simply ignore it: it’s an expression statement that does nothing.
You can also switch on strict mode for just a single function:
6.9.2. Example: strict mode in action
Let’s look at an example where sloppy mode does something bad that strict mode doesn’t: Changing an unknown variable (that hasn’t been created via let
or similar) creates a global variable.
Strict mode does it better:
The assert.throws()
demands that its first argument, a function, throws a ReferenceError
when it is called.