Composing Mapped Hierarchies with Mixins

A common need when mapping classes using the Declarative style is to share some functionality, such as a set of common columns, some common table options, or other mapped properties, across many classes. The standard Python idioms for this is to have the classes inherit from a superclass which includes these common features.

When using declarative mappings, this idiom is allowed via the usage of mixin classes, as well as via augmenting the declarative base produced by either the registry.generate_base() method or declarative_base() functions.

An example of some commonly mixed-in idioms is below:

  1. from sqlalchemy.orm import declared_attr
  2. class MyMixin(object):
  3. @declared_attr
  4. def __tablename__(cls):
  5. return cls.__name__.lower()
  6. __table_args__ = {'mysql_engine': 'InnoDB'}
  7. __mapper_args__= {'always_refresh': True}
  8. id = Column(Integer, primary_key=True)
  9. class MyModel(MyMixin, Base):
  10. name = Column(String(1000))

Where above, the class MyModel will contain an “id” column as the primary key, a __tablename__ attribute that derives from the name of the class itself, as well as __table_args__ and __mapper_args__ defined by the MyMixin mixin class.

There’s no fixed convention over whether MyMixin precedes Base or not. Normal Python method resolution rules apply, and the above example would work just as well with:

  1. class MyModel(Base, MyMixin):
  2. name = Column(String(1000))

This works because Base here doesn’t define any of the variables that MyMixin defines, i.e. __tablename__, __table_args__, id, etc. If the Base did define an attribute of the same name, the class placed first in the inherits list would determine which attribute is used on the newly defined class.

Augmenting the Base

In addition to using a pure mixin, most of the techniques in this section can also be applied to the base class itself, for patterns that should apply to all classes derived from a particular base. This is achieved using the cls argument of the declarative_base() function:

  1. from sqlalchemy.orm import declared_attr
  2. class Base(object):
  3. @declared_attr
  4. def __tablename__(cls):
  5. return cls.__name__.lower()
  6. __table_args__ = {'mysql_engine': 'InnoDB'}
  7. id = Column(Integer, primary_key=True)
  8. from sqlalchemy.orm import declarative_base
  9. Base = declarative_base(cls=Base)
  10. class MyModel(Base):
  11. name = Column(String(1000))

Where above, MyModel and all other classes that derive from Base will have a table name derived from the class name, an id primary key column, as well as the “InnoDB” engine for MySQL.

Mixing in Columns

The most basic way to specify a column on a mixin is by simple declaration:

  1. class TimestampMixin(object):
  2. created_at = Column(DateTime, default=func.now())
  3. class MyModel(TimestampMixin, Base):
  4. __tablename__ = 'test'
  5. id = Column(Integer, primary_key=True)
  6. name = Column(String(1000))

Where above, all declarative classes that include TimestampMixin will also have a column created_at that applies a timestamp to all row insertions.

Those familiar with the SQLAlchemy expression language know that the object identity of clause elements defines their role in a schema. Two Table objects a and b may both have a column called id, but the way these are differentiated is that a.c.id and b.c.id are two distinct Python objects, referencing their parent tables a and b respectively.

In the case of the mixin column, it seems that only one Column object is explicitly created, yet the ultimate created_at column above must exist as a distinct Python object for each separate destination class. To accomplish this, the declarative extension creates a copy of each Column object encountered on a class that is detected as a mixin.

This copy mechanism is limited to simple columns that have no foreign keys, as a ForeignKey itself contains references to columns which can’t be properly recreated at this level. For columns that have foreign keys, as well as for the variety of mapper-level constructs that require destination-explicit context, the declared_attr decorator is provided so that patterns common to many classes can be defined as callables:

  1. from sqlalchemy.orm import declared_attr
  2. class ReferenceAddressMixin(object):
  3. @declared_attr
  4. def address_id(cls):
  5. return Column(Integer, ForeignKey('address.id'))
  6. class User(ReferenceAddressMixin, Base):
  7. __tablename__ = 'user'
  8. id = Column(Integer, primary_key=True)

Where above, the address_id class-level callable is executed at the point at which the User class is constructed, and the declarative extension can use the resulting Column object as returned by the method without the need to copy it.

Columns generated by declared_attr can also be referenced by __mapper_args__ to a limited degree, currently by polymorphic_on and version_id_col; the declarative extension will resolve them at class construction time:

  1. class MyMixin:
  2. @declared_attr
  3. def type_(cls):
  4. return Column(String(50))
  5. __mapper_args__= {'polymorphic_on':type_}
  6. class MyModel(MyMixin, Base):
  7. __tablename__='test'
  8. id = Column(Integer, primary_key=True)

Mixing in Relationships

Relationships created by relationship() are provided with declarative mixin classes exclusively using the declared_attr approach, eliminating any ambiguity which could arise when copying a relationship and its possibly column-bound contents. Below is an example which combines a foreign key column and a relationship so that two classes Foo and Bar can both be configured to reference a common target class via many-to-one:

  1. class RefTargetMixin(object):
  2. @declared_attr
  3. def target_id(cls):
  4. return Column('target_id', ForeignKey('target.id'))
  5. @declared_attr
  6. def target(cls):
  7. return relationship("Target")
  8. class Foo(RefTargetMixin, Base):
  9. __tablename__ = 'foo'
  10. id = Column(Integer, primary_key=True)
  11. class Bar(RefTargetMixin, Base):
  12. __tablename__ = 'bar'
  13. id = Column(Integer, primary_key=True)
  14. class Target(Base):
  15. __tablename__ = 'target'
  16. id = Column(Integer, primary_key=True)

Using Advanced Relationship Arguments (e.g. primaryjoin, etc.)

relationship() definitions which require explicit primaryjoin, order_by etc. expressions should in all but the most simplistic cases use late bound forms for these arguments, meaning, using either the string form or a function/lambda. The reason for this is that the related Column objects which are to be configured using @declared_attr are not available to another @declared_attr attribute; while the methods will work and return new Column objects, those are not the Column objects that Declarative will be using as it calls the methods on its own, thus using different Column objects.

The canonical example is the primaryjoin condition that depends upon another mixed-in column:

  1. class RefTargetMixin(object):
  2. @declared_attr
  3. def target_id(cls):
  4. return Column('target_id', ForeignKey('target.id'))
  5. @declared_attr
  6. def target(cls):
  7. return relationship(Target,
  8. primaryjoin=Target.id==cls.target_id # this is *incorrect*
  9. )

Mapping a class using the above mixin, we will get an error like:

  1. sqlalchemy.exc.InvalidRequestError: this ForeignKey's parent column is not
  2. yet associated with a Table.

This is because the target_id Column we’ve called upon in our target() method is not the same Column that declarative is actually going to map to our table.

The condition above is resolved using a lambda:

  1. class RefTargetMixin(object):
  2. @declared_attr
  3. def target_id(cls):
  4. return Column('target_id', ForeignKey('target.id'))
  5. @declared_attr
  6. def target(cls):
  7. return relationship(Target,
  8. primaryjoin=lambda: Target.id==cls.target_id
  9. )

or alternatively, the string form (which ultimately generates a lambda):

  1. class RefTargetMixin(object):
  2. @declared_attr
  3. def target_id(cls):
  4. return Column('target_id', ForeignKey('target.id'))
  5. @declared_attr
  6. def target(cls):
  7. return relationship("Target",
  8. primaryjoin="Target.id==%s.target_id" % cls.__name__
  9. )

See also

Late-Evaluation of Relationship Arguments

Mixing in deferred(), column_property(), and other MapperProperty classes

Like relationship(), all MapperProperty subclasses such as deferred(), column_property(), etc. ultimately involve references to columns, and therefore, when used with declarative mixins, have the declared_attr requirement so that no reliance on copying is needed:

  1. class SomethingMixin(object):
  2. @declared_attr
  3. def dprop(cls):
  4. return deferred(Column(Integer))
  5. class Something(SomethingMixin, Base):
  6. __tablename__ = "something"

The column_property() or other construct may refer to other columns from the mixin. These are copied ahead of time before the declared_attr is invoked:

  1. class SomethingMixin(object):
  2. x = Column(Integer)
  3. y = Column(Integer)
  4. @declared_attr
  5. def x_plus_y(cls):
  6. return column_property(cls.x + cls.y)

Changed in version 1.0.0: mixin columns are copied to the final mapped class so that declared_attr methods can access the actual column that will be mapped.

Mixing in Association Proxy and Other Attributes

Mixins can specify user-defined attributes as well as other extension units such as association_proxy(). The usage of declared_attr is required in those cases where the attribute must be tailored specifically to the target subclass. An example is when constructing multiple association_proxy() attributes which each target a different type of child object. Below is an association_proxy() mixin example which provides a scalar list of string values to an implementing class:

  1. from sqlalchemy import Column, Integer, ForeignKey, String
  2. from sqlalchemy.orm import relationship
  3. from sqlalchemy.ext.associationproxy import association_proxy
  4. from sqlalchemy.orm import declarative_base, declared_attr
  5. Base = declarative_base()
  6. class HasStringCollection(object):
  7. @declared_attr
  8. def _strings(cls):
  9. class StringAttribute(Base):
  10. __tablename__ = cls.string_table_name
  11. id = Column(Integer, primary_key=True)
  12. value = Column(String(50), nullable=False)
  13. parent_id = Column(Integer,
  14. ForeignKey('%s.id' % cls.__tablename__),
  15. nullable=False)
  16. def __init__(self, value):
  17. self.value = value
  18. return relationship(StringAttribute)
  19. @declared_attr
  20. def strings(cls):
  21. return association_proxy('_strings', 'value')
  22. class TypeA(HasStringCollection, Base):
  23. __tablename__ = 'type_a'
  24. string_table_name = 'type_a_strings'
  25. id = Column(Integer(), primary_key=True)
  26. class TypeB(HasStringCollection, Base):
  27. __tablename__ = 'type_b'
  28. string_table_name = 'type_b_strings'
  29. id = Column(Integer(), primary_key=True)

Above, the HasStringCollection mixin produces a relationship() which refers to a newly generated class called StringAttribute. The StringAttribute class is generated with its own Table definition which is local to the parent class making usage of the HasStringCollection mixin. It also produces an association_proxy() object which proxies references to the strings attribute onto the value attribute of each StringAttribute instance.

TypeA or TypeB can be instantiated given the constructor argument strings, a list of strings:

  1. ta = TypeA(strings=['foo', 'bar'])
  2. tb = TypeB(strings=['bat', 'bar'])

This list will generate a collection of StringAttribute objects, which are persisted into a table that’s local to either the type_a_strings or type_b_strings table:

  1. >>> print(ta._strings)
  2. [<__main__.StringAttribute object at 0x10151cd90>,
  3. <__main__.StringAttribute object at 0x10151ce10>]

When constructing the association_proxy(), the declared_attr decorator must be used so that a distinct association_proxy() object is created for each of the TypeA and TypeB classes.

Controlling table inheritance with mixins

The __tablename__ attribute may be used to provide a function that will determine the name of the table used for each class in an inheritance hierarchy, as well as whether a class has its own distinct table.

This is achieved using the declared_attr indicator in conjunction with a method named __tablename__(). Declarative will always invoke declared_attr for the special names __tablename__, __mapper_args__ and __table_args__ function for each mapped class in the hierarchy, except if overridden in a subclass. The function therefore needs to expect to receive each class individually and to provide the correct answer for each.

For example, to create a mixin that gives every class a simple table name based on class name:

  1. from sqlalchemy.orm import declared_attr
  2. class Tablename:
  3. @declared_attr
  4. def __tablename__(cls):
  5. return cls.__name__.lower()
  6. class Person(Tablename, Base):
  7. id = Column(Integer, primary_key=True)
  8. discriminator = Column('type', String(50))
  9. __mapper_args__ = {'polymorphic_on': discriminator}
  10. class Engineer(Person):
  11. __tablename__ = None
  12. __mapper_args__ = {'polymorphic_identity': 'engineer'}
  13. primary_language = Column(String(50))

Alternatively, we can modify our __tablename__ function to return None for subclasses, using has_inherited_table(). This has the effect of those subclasses being mapped with single table inheritance against the parent:

  1. from sqlalchemy.orm import declared_attr
  2. from sqlalchemy.orm import has_inherited_table
  3. class Tablename(object):
  4. @declared_attr
  5. def __tablename__(cls):
  6. if has_inherited_table(cls):
  7. return None
  8. return cls.__name__.lower()
  9. class Person(Tablename, Base):
  10. id = Column(Integer, primary_key=True)
  11. discriminator = Column('type', String(50))
  12. __mapper_args__ = {'polymorphic_on': discriminator}
  13. class Engineer(Person):
  14. primary_language = Column(String(50))
  15. __mapper_args__ = {'polymorphic_identity': 'engineer'}

Mixing in Columns in Inheritance Scenarios

In contrast to how __tablename__ and other special names are handled when used with declared_attr, when we mix in columns and properties (e.g. relationships, column properties, etc.), the function is invoked for the base class only in the hierarchy. Below, only the Person class will receive a column called id; the mapping will fail on Engineer, which is not given a primary key:

  1. class HasId(object):
  2. @declared_attr
  3. def id(cls):
  4. return Column('id', Integer, primary_key=True)
  5. class Person(HasId, Base):
  6. __tablename__ = 'person'
  7. discriminator = Column('type', String(50))
  8. __mapper_args__ = {'polymorphic_on': discriminator}
  9. class Engineer(Person):
  10. __tablename__ = 'engineer'
  11. primary_language = Column(String(50))
  12. __mapper_args__ = {'polymorphic_identity': 'engineer'}

It is usually the case in joined-table inheritance that we want distinctly named columns on each subclass. However in this case, we may want to have an id column on every table, and have them refer to each other via foreign key. We can achieve this as a mixin by using the declared_attr.cascading modifier, which indicates that the function should be invoked for each class in the hierarchy, in almost (see warning below) the same way as it does for __tablename__:

  1. class HasIdMixin(object):
  2. @declared_attr.cascading
  3. def id(cls):
  4. if has_inherited_table(cls):
  5. return Column(ForeignKey('person.id'), primary_key=True)
  6. else:
  7. return Column(Integer, primary_key=True)
  8. class Person(HasIdMixin, Base):
  9. __tablename__ = 'person'
  10. discriminator = Column('type', String(50))
  11. __mapper_args__ = {'polymorphic_on': discriminator}
  12. class Engineer(Person):
  13. __tablename__ = 'engineer'
  14. primary_language = Column(String(50))
  15. __mapper_args__ = {'polymorphic_identity': 'engineer'}

Warning

The declared_attr.cascading feature currently does not allow for a subclass to override the attribute with a different function or value. This is a current limitation in the mechanics of how @declared_attr is resolved, and a warning is emitted if this condition is detected. This limitation does not exist for the special attribute names such as __tablename__, which resolve in a different way internally than that of declared_attr.cascading.

New in version 1.0.0: added declared_attr.cascading.

Combining Table/Mapper Arguments from Multiple Mixins

In the case of __table_args__ or __mapper_args__ specified with declarative mixins, you may want to combine some parameters from several mixins with those you wish to define on the class itself. The declared_attr decorator can be used here to create user-defined collation routines that pull from multiple collections:

  1. from sqlalchemy.orm import declared_attr
  2. class MySQLSettings(object):
  3. __table_args__ = {'mysql_engine':'InnoDB'}
  4. class MyOtherMixin(object):
  5. __table_args__ = {'info':'foo'}
  6. class MyModel(MySQLSettings, MyOtherMixin, Base):
  7. __tablename__='my_model'
  8. @declared_attr
  9. def __table_args__(cls):
  10. args = dict()
  11. args.update(MySQLSettings.__table_args__)
  12. args.update(MyOtherMixin.__table_args__)
  13. return args
  14. id = Column(Integer, primary_key=True)

Creating Indexes with Mixins

To define a named, potentially multicolumn Index that applies to all tables derived from a mixin, use the “inline” form of Index and establish it as part of __table_args__:

  1. class MyMixin(object):
  2. a = Column(Integer)
  3. b = Column(Integer)
  4. @declared_attr
  5. def __table_args__(cls):
  6. return (Index('test_idx_%s' % cls.__tablename__, 'a', 'b'),)
  7. class MyModel(MyMixin, Base):
  8. __tablename__ = 'atable'
  9. c = Column(Integer,primary_key=True)