Today I learned about an algorithm that Python uses to sort out inheritance.

From the Wiki article on C3 linearisation, by H2power, shared under the CC BY-SA 4.0 license.

C3 linearisation algorithm

Take the following piece of code, which defines a series of classes where hierarchical dependencies are represented in the diagram above, where arrows indicate which classes inherit from which classes.

class O:
    def __repr__(self):
        return "O"

class A(O):
    def __repr__(self):
        return "A"

class B(O):
    def __repr__(self):
        return "B"

class C(O):
    def __repr__(self):
        return "C"

class D(O):
    def __repr__(self):
        return "D"

class E(O):
    def __repr__(self):
        return "E"

class K1(A, B, C):
    def __repr__(self):
        return "K1"

class K2(D, B, E):
    def __repr__(self):
        return "K2"

class K3(D, A):
    def __repr__(self):
        return "K3"

class Z(K1, K2, K3):
    def __repr__(self):
        return "Z"

z = Z()
print(z)

What's the output of the code above? If you guessed "Z", you got it right!

Now, modify the class Z and remove its __repr__ method, like so:

class Z(K1, K2, K3): pass

z = Z()
print(z)

What's the output now? When we print z, we try to look for a __str__ or __repr__ implementation (read this article to understand why), and the class Z provides none. So, in order to print z, we need to go up the hierarchy and look for a way to print z. Next up in the inheritance hierarchy is K1, so the output is K1.

Modify K1 as well, to remove its __repr__, like so:

class K1(A, B, C): pass

class Z(K1, K2, K3): pass

z = Z()
print(z)

What's the output now?

It should be K2.

And we can keep going over this process of removing one __repr__ at a time, and this shows the hierarchy that Python builds for us, when we have multiple inheritance at stake.

Of course, in order to check the full hierarchy, we could keep doing this by hand, or we could just ask Python for the linearised hierarchy:

>>> Z.__mro__
(<class '__main__.Z'>, <class '__main__.K1'>, <class '__main__.K2'>, <class '__main__.K3'>, <class '__main__.D'>, <class '__main__.A'>, <class '__main__.B'>, <class '__main__.C'>, <class '__main__.E'>, <class '__main__.O'>, <class 'object'>)

This shows that we first look things up on Z, then on K1, then on K2, ...

This linearised hierarchy is built by a straightforward algorithm called the “C3 linearisation algorithm”, that you can find on Wikipedia.

The algorithm isn't that complicated (the description is in the link above) and I might actually implement it in Python in the near future, just for fun.

This came to my attention as I was reading this article that was shared with me on Twitter, in a discussion related to yesterday's TIL article, about the ICPO rule for attribute lookup.

That's it for now! Stay tuned and I'll see you around!

Become a better Python 🐍 developer 🚀

+35 chapters. +400 pages. Hundreds of examples. Over 30,000 readers!

My book “Pydon'ts” teaches you how to write elegant, expressive, and Pythonic code, to help you become a better developer. >>> Download it here 🐍🚀.

Previous Post Next Post

Blog Comments powered by Disqus.