Form and field validation
Form validation happens when the data is cleaned. If you want to customizethis process, there are various places to make changes, each one serving adifferent purpose. Three types of cleaning methods are run during formprocessing. These are normally executed when you call the is_valid()
method on a form. There are other things that can also trigger cleaning andvalidation (accessing the errors
attribute or calling full_clean()
directly), but normally they won't be needed.
In general, any cleaning method can raise ValidationError
if there is aproblem with the data it is processing, passing the relevant information tothe ValidationError
constructor. See belowfor the best practice in raising ValidationError
. If no ValidationError
is raised, the method should return the cleaned (normalized) data as a Pythonobject.
Most validation can be done using validators - simple helpers that can bereused easily. Validators are simple functions (or callables) that take a singleargument and raise ValidationError
on invalid input. Validators are runafter the field's to_python
and validate
methods have been called.
Validation of a form is split into several steps, which can be customized oroverridden:
The
to_python()
method on aField
is the first step in everyvalidation. It coerces the value to a correct datatype and raisesValidationError
if that is not possible. This method accepts the rawvalue from the widget and returns the converted value. For example, aFloatField
will turn the data into a Pythonfloat
or raise aValidationError
.The
validate()
method on aField
handles field-specific validationthat is not suitable for a validator. It takes a value that has beencoerced to a correct datatype and raisesValidationError
on any error.This method does not return anything and shouldn't alter the value. Youshould override it to handle validation logic that you can't or don'twant to put in a validator.The
run_validators()
method on aField
runs all of the field'svalidators and aggregates all the errors into a singleValidationError
. You shouldn't need to override this method.The
clean()
method on aField
subclass is responsible for runningto_python()
,validate()
, andrun_validators()
in the correctorder and propagating their errors. If, at any time, any of the methodsraiseValidationError
, the validation stops and that error is raised.This method returns the clean data, which is then inserted into thecleaned_data
dictionary of the form.The
clean_<fieldname>()
method is called on a form subclass — where<fieldname>
is replaced with the name of the form field attribute.This method does any cleaning that is specific to that particularattribute, unrelated to the type of field that it is. This method is notpassed any parameters. You will need to look up the value of the fieldinself.cleaned_data
and remember that it will be a Python objectat this point, not the original string submitted in the form (it will beincleaned_data
because the general fieldclean()
method, above,has already cleaned the data once).
For example, if you wanted to validate that the contents of aCharField
called serialnumber
was unique,clean_serialnumber()
would be the right place to do this. You don'tneed a specific field (it's just a CharField
), but you want aformfield-specific piece of validation and, possibly,cleaning/normalizing the data.
The return value of this method replaces the existing value incleaned_data
, so it must be the field's value from cleaned_data
(evenif this method didn't change it) or a new cleaned value.
- The form subclass's
clean()
method can perform validation that requiresaccess to multiple form fields. This is where you might put in checks such as"if fieldA
is supplied, fieldB
must contain a valid email address".This method can return a completely different dictionary if it wishes, whichwill be used as thecleaned_data
.
Since the field validation methods have been run by the time clean()
iscalled, you also have access to the form's errors
attribute whichcontains all the errors raised by cleaning of individual fields.
Note that any errors raised by your Form.clean()
override will notbe associated with any field in particular. They go into a special"field" (called all
), which you can access via thenon_field_errors()
method if you need to. If youwant to attach errors to a specific field in the form, you need to calladd_error()
.
Also note that there are special considerations when overridingthe clean()
method of a ModelForm
subclass. (see theModelForm documentation for more information)
These methods are run in the order given above, one field at a time. That is,for each field in the form (in the order they are declared in the formdefinition), the Field.clean()
method (or its override) is run, thenclean_<fieldname>()
. Finally, once those two methods are run for everyfield, the Form.clean()
method, or its override, is executed whetheror not the previous methods have raised errors.
Examples of each of these methods are provided below.
As mentioned, any of these methods can raise a ValidationError
. For anyfield, if the Field.clean()
method raises a ValidationError
, anyfield-specific cleaning method is not called. However, the cleaning methodsfor all remaining fields are still executed.
Raising ValidationError
In order to make error messages flexible and easy to override, consider thefollowing guidelines:
- Provide a descriptive error
code
to the constructor:
- # Good
- ValidationError(_('Invalid value'), code='invalid')
- # Bad
- ValidationError(_('Invalid value'))
- Don't coerce variables into the message; use placeholders and the
params
argument of the constructor:
- # Good
- ValidationError(
- _('Invalid value: %(value)s'),
- params={'value': '42'},
- )
- # Bad
- ValidationError(_('Invalid value: %s') % value)
- Use mapping keys instead of positional formatting. This enables puttingthe variables in any order or omitting them altogether when rewriting themessage:
- # Good
- ValidationError(
- _('Invalid value: %(value)s'),
- params={'value': '42'},
- )
- # Bad
- ValidationError(
- _('Invalid value: %s'),
- params=('42',),
- )
- Wrap the message with
gettext
to enable translation:
- # Good
- ValidationError(_('Invalid value'))
- # Bad
- ValidationError('Invalid value')
Putting it all together:
- raise ValidationError(
- _('Invalid value: %(value)s'),
- code='invalid',
- params={'value': '42'},
- )
Following these guidelines is particularly necessary if you write reusableforms, form fields, and model fields.
While not recommended, if you are at the end of the validation chain(i.e. your form clean()
method) and you know you will never needto override your error message you can still opt for the less verbose:
- ValidationError(_('Invalid value: %s') % value)
The Form.errors.as_data()
andForm.errors.as_json()
methodsgreatly benefit from fully featured ValidationError
s (with a code
nameand a params
dictionary).
Raising multiple errors
If you detect multiple errors during a cleaning method and wish to signal allof them to the form submitter, it is possible to pass a list of errors to theValidationError
constructor.
As above, it is recommended to pass a list of ValidationError
instanceswith code
s and params
but a list of strings will also work:
- # Good
- raise ValidationError([
- ValidationError(_('Error 1'), code='error1'),
- ValidationError(_('Error 2'), code='error2'),
- ])
- # Bad
- raise ValidationError([
- _('Error 1'),
- _('Error 2'),
- ])
Using validation in practice
The previous sections explained how validation works in general for forms.Since it can sometimes be easier to put things into place by seeing eachfeature in use, here are a series of small examples that use each of theprevious features.
Using validators
Django's form (and model) fields support use of simple utility functions andclasses known as validators. A validator is merely a callable object orfunction that takes a value and simply returns nothing if the value is valid orraises a ValidationError
if not. These can bepassed to a field's constructor, via the field's validators
argument, ordefined on the Field
class itself with thedefault_validators
attribute.
Simple validators can be used to validate values inside the field, let's havea look at Django's SlugField
:
- from django.core import validators
- from django.forms import CharField
- class SlugField(CharField):
- default_validators = [validators.validate_slug]
As you can see, SlugField
is just a CharField
with a customizedvalidator that validates that submitted text obeys to some character rules.This can also be done on field definition so:
- slug = forms.SlugField()
is equivalent to:
- slug = forms.CharField(validators=[validators.validate_slug])
Common cases such as validating against an email or a regular expression can behandled using existing validator classes available in Django. For example,validators.validateslug
is an instance ofa RegexValidator
constructed with the firstargument being the pattern: ^[-a-zA-Z0-9
]+$
. See the section onwriting validators to see a list of what is alreadyavailable and for an example of how to write a validator.
Form field default cleaning
Let's first create a custom form field that validates its input is a stringcontaining comma-separated email addresses. The full class looks like this:
- from django import forms
- from django.core.validators import validate_email
- class MultiEmailField(forms.Field):
- def to_python(self, value):
- """Normalize data to a list of strings."""
- # Return an empty list if no input was given.
- if not value:
- return []
- return value.split(',')
- def validate(self, value):
- """Check if value consists only of valid emails."""
- # Use the parent's handling of required fields, etc.
- super().validate(value)
- for email in value:
- validate_email(email)
Every form that uses this field will have these methods run before anythingelse can be done with the field's data. This is cleaning that is specific tothis type of field, regardless of how it is subsequently used.
Let's create a simple ContactForm
to demonstrate how you'd use thisfield:
- class ContactForm(forms.Form):
- subject = forms.CharField(max_length=100)
- message = forms.CharField()
- sender = forms.EmailField()
- recipients = MultiEmailField()
- cc_myself = forms.BooleanField(required=False)
Simply use MultiEmailField
like any other form field. When theis_valid()
method is called on the form, the MultiEmailField.clean()
method will be run as part of the cleaning process and it will, in turn, callthe custom to_python()
and validate()
methods.
Cleaning a specific field attribute
Continuing on from the previous example, suppose that in our ContactForm
,we want to make sure that the recipients
field always contains the address"fred@example.com"
. This is validation that is specific to our form, so wedon't want to put it into the general MultiEmailField
class. Instead, wewrite a cleaning method that operates on the recipients
field, like so:
- from django import forms
- class ContactForm(forms.Form):
- # Everything as before.
- ...
- def clean_recipients(self):
- data = self.cleaned_data['recipients']
- if "fred@example.com" not in data:
- raise forms.ValidationError("You have forgotten about Fred!")
- # Always return a value to use as the new cleaned data, even if
- # this method didn't change it.
- return data
Cleaning and validating fields that depend on each other
Suppose we add another requirement to our contact form: if the cc_myself
field is True
, the subject
must contain the word "help"
. We areperforming validation on more than one field at a time, so the form'sclean()
method is a good spot to do this. Notice that we aretalking about the clean()
method on the form here, whereas earlier we werewriting a clean()
method on a field. It's important to keep the field andform difference clear when working out where to validate things. Fields aresingle data points, forms are a collection of fields.
By the time the form's clean()
method is called, all the individual fieldclean methods will have been run (the previous two sections), soself.cleaned_data
will be populated with any data that has survived sofar. So you also need to remember to allow for the fact that the fields youare wanting to validate might not have survived the initial individual fieldchecks.
There are two ways to report any errors from this step. Probably the mostcommon method is to display the error at the top of the form. To create suchan error, you can raise a ValidationError
from the clean()
method. Forexample:
- from django import forms
- class ContactForm(forms.Form):
- # Everything as before.
- ...
- def clean(self):
- cleaned_data = super().clean()
- cc_myself = cleaned_data.get("cc_myself")
- subject = cleaned_data.get("subject")
- if cc_myself and subject:
- # Only do something if both fields are valid so far.
- if "help" not in subject:
- raise forms.ValidationError(
- "Did not send for 'help' in the subject despite "
- "CC'ing yourself."
- )
In this code, if the validation error is raised, the form will display anerror message at the top of the form (normally) describing the problem.
The call to super().clean()
in the example code ensures that any validationlogic in parent classes is maintained. If your form inherits another thatdoesn't return a cleaned_data
dictionary in its clean()
method (doingso is optional), then don't assign cleaned_data
to the result of thesuper()
call and use self.cleaned_data
instead:
- def clean(self):
- super().clean()
- cc_myself = self.cleaned_data.get("cc_myself")
- ...
The second approach for reporting validation errors might involve assigning theerror message to one of the fields. In this case, let's assign an error messageto both the "subject" and "cc_myself" rows in the form display. Be careful whendoing this in practice, since it can lead to confusing form output. We'reshowing what is possible here and leaving it up to you and your designers towork out what works effectively in your particular situation. Our new code(replacing the previous sample) looks like this:
- from django import forms
- class ContactForm(forms.Form):
- # Everything as before.
- ...
- def clean(self):
- cleaned_data = super().clean()
- cc_myself = cleaned_data.get("cc_myself")
- subject = cleaned_data.get("subject")
- if cc_myself and subject and "help" not in subject:
- msg = "Must put 'help' in subject when cc'ing yourself."
- self.add_error('cc_myself', msg)
- self.add_error('subject', msg)
The second argument of add_error()
can be a simple string, or preferablyan instance of ValidationError
. See Raising ValidationError formore details. Note that add_error()
automatically removes the fieldfrom cleaned_data
.