top of page

Python, Object-oriented Programming, and Data Science: An In-Depth Tutorial



Introduction to Operator Overloading and Object Comparison


In the realm of programming, sometimes we need to redefine or "overload" the default behavior of Python operators. The process is known as operator overloading. Think of it as changing the rules of a board game to suit your own fun way of playing. Similarly, in Python, we can change how the addition operator works by overloading it.


Let's take a brief detour to understand object equality. When you're comparing two apples, what does it mean for them to be equal? They might be of the same variety and look alike, but they're two distinct apples. Python applies a similar logic for object comparison.

class Apple:
    pass

apple1 = Apple()
apple2 = Apple()

print(apple1 == apple2)  # Output: False

In this case, apple1 and apple2 are two different objects, although they are instances of the same class.


This leads us to how Python stores and compares objects in memory. Python assigns each object a unique id during its lifetime, used to compare object identities.

print(id(apple1) == id(apple2))  # Output: False

Thus, apple1 and apple2 are unequal, not just because they're different objects, but because they occupy different memory locations.


Understanding Variables as References


Python assigns variables as references to memory. When you create a variable a = 5, you're essentially placing a tag named a on a memory location storing the value 5.

a = 5
b = a

print(id(a) == id(b))  # Output: True

Here, both a and b refer to the same memory location. They are equal not only in value but also in identity.


However, this isn't the same with mutable data structures like NumPy arrays or pandas DataFrames. Python treats these differently in terms of object comparison.

import numpy as np

arr1 = np.array([1, 2, 3])
arr2 = arr1
arr3 = np.array([1, 2, 3])

print(id(arr1) == id(arr2))  # Output: True
print(id(arr1) == id(arr3))  # Output: False

Although arr1 and arr3 have the same values, they're stored in different memory locations and thus are unequal.


Custom Comparison using __eq__ Method


Imagine you're at a car showroom where every car is equal only if their make, model, and color are exactly the same. Now, how do you enforce this rule in your Python "Car" class? You'd do it by customizing object comparison using the __eq__ method. This special method, when defined in a class, allows us to control the equality logic.


Let's build a Car class and implement the __eq__ method.

class Car:
    def __init__(self, make, model, color):
        self.make = make
        self.model = model
        self.color = color

    def __eq__(self, other):
        if isinstance(other, Car):
            return self.make == other.make and self.model == other.model and self.color == other.color
        return False

In the __eq__ method, we're checking if the other object is a Car instance and if its make, model, and color attributes match with our car. If yes, they're equal; otherwise, they're not.


Comparison of Objects with __eq__


Let's create two car objects and compare them.

car1 = Car('Toyota', 'Corolla', 'Red')
car2 = Car('Toyota', 'Corolla', 'Red')

print(car1 == car2)  # Output: True

As we can see, even though car1 and car2 are different objects, our __eq__ method considers them equal because their attributes are the same.

But what happens when the attributes differ?

car3 = Car('Toyota', 'Camry', 'Red')

print(car1 == car3)  # Output: False

car1 and car3 aren't considered equal because the model attribute differs.

In the next part of the tutorial, we'll look at other special methods for comparison operators and the __hash__ method.


Implementing Other Comparison Operators


Apart from __eq__, Python provides other special methods for different comparison operators like __ne__ (not equal), __ge__ (greater than or equal), and __lt__ (less than).


Let's expand our Car class to include a year attribute and implement the __lt__ method for comparing the age of the cars.

class Car:
    def __init__(self, make, model, color, year):
        self.make = make
        self.model = model
        self.color = color
        self.year = year

    def __eq__(self, other):
        if isinstance(other, Car):
            return self.make == other.make and self.model == other.model and self.color == other.color
        return False

    def __lt__(self, other):
        if isinstance(other, Car):
            return self.year < other.year
        return False

Now, we can compare cars based on their age.

car1 = Car('Toyota', 'Corolla', 'Red', 2019)
car2 = Car('Toyota', 'Camry', 'Blue', 2020)

print(car1 < car2)  # Output: True

The car car1 is older than car2, hence the output is True.


Improving Integration of Custom Classes with Python's Operators


Let's talk about another magic method that can improve how our custom classes integrate with Python's built-in functions. Consider what happens when we print our car object.

print(car1)  # Output: <__main__.Car object at 0x102e35320>

Python's default print function isn't very informative for our custom class. Instead, it would be great if we could print the car's details. Python provides the __str__ and __repr__ methods to achieve this.


Difference between __str__ and __repr__ Methods


Python has two special methods that we can use to define how our objects should be represented as strings: __str__ and __repr__.

The __str__ method in Python represents the class objects as an informal or nicely printable string format which can be used for display. The __repr__ method returns a string that describes how to recreate the object. It's meant to be unambiguous and complete. This is useful for debugging and logging.

Consider this analogy: __str__ is like the name you use in casual conversation, while __repr__ is like your full legal name.


Implementation of __str__ Method


Let's define a __str__ method for our Car class that returns a nice, readable string:

class Car:
    # previous methods omitted for brevity

    def __str__(self):
        return f'{self.color} {self.make} {self.model} ({self.year})'

Now, when we print our car object, we'll see a more informative string:

car1 = Car('Toyota', 'Corolla', 'Red', 2019)
print(car1)  # Output: Red Toyota Corolla (2019)

This is much better than the default string representation we had earlier!


Implementation of __repr__ Method


For __repr__, we'll return a string that shows how to create the object:

class Car:
    # previous methods omitted for brevity

    def __repr__(self):
        return f"Car('{self.make}', '{self.model}', '{self.color}', {self.year})"

Now, when we call repr() on our car object, it'll return a string that we could use to recreate the object:

car1 = Car('Toyota', 'Corolla', 'Red', 2019)
print(repr(car1))  # Output: Car('Toyota', 'Corolla', 'Red', 2019)

By defining both __str__ and __repr__, we're making our class much more user-friendly, and this will be particularly helpful when debugging our code.


Introduction to Exceptions


In programming, "exceptions" refer to events that occur during the execution of programs, disrupting the normal flow of the program's instructions. Python handles exceptions using a special construct: the "try/except" statement.


You can think of exceptions as unexpected visitors. Your program may be running smoothly, just like a well-organized party, and then suddenly, an exception (the uninvited guest) shows up and disrupts everything!


Exception Handling


Python has a number of built-in exceptions, such as TypeError, ValueError, and IndexError, among others. These are automatically raised by Python when errors occur.


We can handle exceptions in Python using the try-except block. The syntax is as follows:

try:
    # Code that might raise an exception
except ExceptionType:
    # Code to execute if an exception of type ExceptionType is raised

For example, consider the following code:

try:
    result = 10 / 0  # This will raise a ZeroDivisionError
except ZeroDivisionError:
    print("Oops! You can't divide by zero.")

When run, this code will print "Oops! You can't divide by zero." instead of crashing with a ZeroDivisionError.

We can also handle multiple exceptions by using multiple except blocks:

try:
    # Some code...
except ZeroDivisionError:
    print("You can't divide by zero!")
except TypeError:
    print("Invalid type!")

Raising Custom Exceptions


Sometimes, Python's built-in exceptions aren't specific enough for our needs. In these cases, we can raise our own exceptions using the raise keyword:

if some_condition:
    raise ValueError("A very specific bad thing happened.")

When run, this code will raise a ValueError with the message "A very specific bad thing happened."


Understanding Exceptions as Classes


In Python, exceptions are nothing more than specially designed classes. Just as we create instances of our own custom classes, when an exception is raised, an instance of that exception class is created.


Imagine exceptions as different types of roadblocks. Each roadblock (exception) has its own characteristics (attributes and methods) defined by the type of class it is. For instance, a ValueError roadblock might indicate a wrong turn (an inappropriate value), while a TypeError roadblock might suggest you're on the wrong road entirely (you've used a wrong type for an operation).


These classes are organized in a hierarchy, with the base class BaseException at the top. Underneath BaseException, there are several more specific exception classes such as Exception, ArithmeticError, and BufferError, among others.


This hierarchy allows us to use a less specific exception type in an except clause to catch multiple types of exceptions. For example, the following code catches all exceptions that inherit from Exception:

try:
    # Some code...
except Exception as e:
    print(f"Caught an exception: {e}")

Creating Custom Exceptions


To create a custom exception, we need to define a new class that inherits from one of Python's built-in exception classes, usually Exception:

class MyCustomException(Exception):
    pass

We can then raise our custom exception like any other exception:

raise MyCustomException("This is a custom exception!")

A custom exception can have its own methods and attributes just like any other class. For example, we might want our custom exception to automatically log the error when it's raised:

class MyCustomException(Exception):
    def __init__(self, message):
        super().__init__(message)
        with open("error_log.txt", "a") as log_file:
            log_file.write(message + "\\\\n")

With this custom exception, every time it's raised, the error message is automatically appended to an error log.


Handling Custom Exceptions


Handling custom exceptions is no different from handling built-in ones. We just need to specify our custom exception type in an except clause:

try:
    raise MyCustomException("An error occurred!")
except MyCustomException as e:
    print(f"Caught a custom exception: {e}")

This will print "Caught a custom exception: An error occurred!".

As a final point, keep in mind that you should aim to use the most specific exception type that fits the error. While it's possible to catch all exceptions with except Exception, it's not generally a good idea because it can make debugging more difficult by masking other errors.


Conclusion


After this deep dive into Python, object-oriented programming, and data science, we now understand the importance and application of operator overloading, which can make our custom classes behave in a way that's intuitive and in line with Python's built-in classes.


We've examined how Python's memory management works, and how variables are references to objects in memory, influencing how comparison operations work. We learned how to customize object comparison via the __eq__ method and other special methods, and how to control the textual representation of our objects with __str__ and __repr__.


In the second part of our tutorial, we introduced the concept of exceptions, explaining how they're used in Python to handle errors and unexpected events during the execution of a program. We also learned how to define and use custom exceptions, providing us with a powerful tool to make our code more robust and easier to debug.


Through understanding and applying these concepts, you'll become a more effective and productive Python programmer, capable of writing code that's not only functional but also clear, efficient, and robust. So, the next time you're faced with a complex problem, remember these lessons and use them to your advantage. Happy coding!


Thank you for joining us on this journey through Python and data science. If you have any questions or suggestions, feel free to share. Keep practicing and exploring new challenges!

bottom of page