I18n

When talking about I18n here, we’re going to talk about translation strings in user interfaces and on the frontend. For content in multiple languages or language pickers for users, you will need to install a plugin to provide the editing tools for posts and other content types.

At any point, you can manually set the language WordPress uses by user or overriding the WPLANG option. Older tutorials will recommend the WP_LANG constant, but this has been deprecated

However you will need to make sure the necessary language files are in place in your wp-content/languages folder before the change takes full effect.

Translation work falls under the Polyglots group at contributor days. If you’re interested in translating WordPress Core, you should read the official translators handbook to find out how

Securing Language Files

Language files have 2 major attack vectors:

  • Unescaped translation strings containing javascript tags
  • n-plurals

Embedded Security Risks

It’s important to use the escaping functions with the translation API to verify that dangerous content isn’t inserted. It’s possible to do by placing language files for a particular translation domain inside a WordPress install

You can save typing out echo esc_html( __( '','' ) ) by using the helper functions:

  • esc_html__ instead of __
  • esc_attr__ instead of __
  • etc

There are also an extended set of functions that simplify this further by adding e to the function name:

  • esc_html_e
  • esc_attr_e

These will output on their own, so an echo statement isn’t needed.

Translation API Abuse

The esc_html_e helper functions are sometimes misused as a susbtitute for echo esc_html. Always use the second parameter that sets the translation domain. If you don’t it could have unanticipated side effects as your strings are mistranslated:

  1. // Good:
  2. echo esc_html('date');
  3. // Great:
  4. esc_html_e( 'date', 'mytheme' );
  5. // Bad:
  6. esc_html_e( 'date' );

n-plurals

A relatively unknown part of the translation format is the n-plurals field. This determines the way plural forms work in a language for a particular file.

Because of its complexity, and for performance reasons, WordPress loads this field as a string, wraps it in a function, and passes the result to eval. Because of this, it’s very easy to craft a language file with a primitive PHP shell.

The only way to mitigate this is code review/manual inspection.

Setting the Admin language

On installation WordPress asks you to select a language, but you may want to set different languages for the front end back end. For example a German website ran by an English speaker may want the admin area to be in their native language.

In order to do this, set your sites language to German using the WP_LANG constant mentioned earlier, and add this code to set the admin area language to english:

  1. add_filter('locale', 'wpse27056_setLocale');
  2. function wpse27056_setLocale($locale) {
  3. if ( is_admin() ) {
  4. return 'en_US';
  5. }
  6. return $locale;
  7. }

Foreign Twitter Embeds

Sometimes oembeds come back in an unexpected foreign language, this is because the service being used is looking at your servers request and tracing it back to its origin to determine it’s country. For example, an English website hosted on a German server may result in German twitter embeds.

Further Reading