Paul Apivat

Posted on Oct 31, 2020 • Edited on Dec 16, 2020 • Originally published at paulapivat.com

Understanding Object-Oriented Programming with Assert

#datascience #machinelearning #python

Assert

Automated Testing and Assert

One of the many cool things about Data Science from Scratch (by Joel Grus) is his use of assertions as a way to "test" code. This is a software engineering practice (see test-driven development) that may not be as pervasive in data science, but I suspect, will see growth in usage and will soon become best practice, if we're not already there.

While there are testing frameworks that deserve their own chapters, throughout this book, fortunately the author has provided a simple way to test by way of the assert key word, here's an example:

# create function to return the largest value in a list
def largest_item(x):
    return max(x)

# assert that our function is working properly
# we will see 'nothing' if things are working properly
assert largest_item([10, 20, 5, 40, 99]) == 99

# an AssertionError will pop up if any other value is used
assert largest_item([10, 20, 5, 40, 99]) == 40
---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-21-12dc291d091e> in <module>
----> 1 assert largest_item([10, 20, 5, 40, 99]) == 40

# we can also create an assertion for input values
def largest_item(x):
    assert x, "empty list has no largest value"
    return max(x)

Object-Oriented_Programming

Object-oriented programming could be it's own chapter, so we won't try to shoot for comprehensiveness here. Instead, we'll try to understand it's basics and the assert function is going to help us understand it even better.

First, we'll create a class CountingClicker that initializes at count 0, has several methods including a click method to increment the count, a read method to read the present number of count and a reset method to reset the count back to 0.

Then we'll write some assert statements to test that our class method is working as intended.

You'll note that there are private methods and public methods. Private methods have the double underscore (aka dunder methods), they're generally not called, but python won't stop you. Then we have the more familiar public methods. Also, all the methods have to be written within the scope of the class CountingClicker.

class CountingClicker:
    """A class can/should have a docstring, just like a function"""
    def __init__(self, count = 0):
        self.count = count
    def __repr__(self):
        return f"CountingClicker(count = {self.count})"
    def click(self, num_times = 1):
        """Click the clicker some number of times."""
        self.count += num_times
    def read(self):
        return self.count
    def reset(self):
        self.count = 0

After we've written the class and associated methods, we can write assert statements to test them. You'll want to write the below statements in this order because we're testing the behavior of our CountingClicker class.

clicker = CountingClicker()

assert clicker.read() == 0, "clicker should start with count 0"

clicker.click()

clicker.click()

assert clicker.read() == 2, "after two clicks, clicker should have count of 2"

clicker.reset()

assert clicker.read() == 0, "after reset, clicker should be back to 0"

In summary, we created a class CountingClicker whose methods allow it to display in text (__repr__), click, read and reset.

All these methods belong to the class CountingClicker and will be passed along to new instances of classes - we have yet to see what this will look like as it relates to tasks in data science so we'll revisit this post when we have updates on the applied end.

Then, we tested our class CountingClicker with various assert statements to see if it behaves as intended.

This is one section of a quick python crash course as conveyed from Data Science from Scratch (by Joel Grus), you may be interested in these other posts on: