top of page

Mastering Documentation and Testing in Python



I. Understanding Documentation in Python


A. Importance of Documentation


Documentation is the blueprint of your code. Just like a map guides a traveler through unknown territory, documentation guides users and future collaborators through your code. It provides clarity on what your code does, how it operates, and how different pieces of your code interact with each other. Documentation also aids in maintaining and debugging the code in the future.


B. Comments in Python


1. Role and use of comments in Python


In Python, comments are created by using the hash (#) symbol before the comment text. They provide short descriptions or annotations for the code segment that follows. For instance:

# This is a comment
print("Hello, World!")

When Python encounters a # symbol, it ignores everything after it on that line.


2. Inline documentation for future collaborators


You can also use comments to leave notes for your future self or collaborators who might work on your code. It's like leaving breadcrumbs in a forest – they guide your way back when you return to your code after some time. Let's say you wrote a piece of code that calculates the Fibonacci sequence:

# Calculating the Fibonacci sequence up to n terms
def fibonacci(n):
    a, b = 0, 1
    for _ in range(n):
        yield a
        a, b = b, a + b


3. Importance of comments for code readability


Comments greatly enhance the readability of your code. They serve as signposts that help navigate complex pieces of code.


C. Effective Commenting


1. Debate on the usefulness of comments


Some argue that comments are a waste of time, that "good code" should be self-

explanatory. However, the reality of coding is more complex. There are moments when the logic isn't straightforward or when business requirements are obscure. In such cases, comments are essential.


2. Importance of knowing the audience


Understanding your audience is key. If your code is used by beginners, then comments explaining the intricacies of your code are beneficial. On the other hand, if your code is intended for experienced developers, then comments on obvious code sections might be unnecessary.


3. Rule of explaining the 'why' not the 'what' of code


Comments should typically explain 'why' something is being done, not 'what' is

being done. The 'what' should be evident from the code itself. If it isn't, then you might want to refactor your code to make it more self-explanatory.


4. Impact of over-commented and under-commented code


Like all good things, moderation is key when it comes to commenting. Too few comments and your code is cryptic; too many and your code is cluttered. Striking a balance is crucial. Always remember, the main focus should be on writing clean, readable code. Comments should supplement your code, not compensate for its shortcomings.


II. Exploring Docstrings


A. Difference between comments and docstrings


While comments are used for inline descriptions, Python has a built-in feature for multi-line comments or documentation, called "docstrings". You might wonder, "Aren't comments enough?" Well, docstrings serve a different purpose. They're associated with Python modules, functions, classes, and methods, and are accessible at runtime.


B. Role of docstrings in Python


1. Output when a user calls help on functions and classes


Docstrings provide a handy way for users to understand how to use different parts of your code. They show up when a user calls the help() function on your functions or classes. For example:

def add(a, b):
    """
    Adds two numbers together.

    Args:
        a (int): The first number.
        b (int): The second number.

    Returns:
        The sum of the two numbers.
    """
    return a + b

help(add)

The output would be:

Help on function add in module __main__:

add(a:int, b:int)
    Adds two numbers together.

    Args:
        a (int): The first number.
        b (int): The second number.

    Returns:
        The sum of the two numbers.


C. Anatomy of a Docstring


1. Documenting the functionality


The first line of your docstring should be a brief summary of the function's purpose. For example, "Adds two numbers together."


2. Documenting parameters and return value


Then, you should explain what the parameters are, their types, and what the function returns.


3. Documenting example usage and expected output


It's also good practice to include an example of how to use the function and what to expect when you do. This isn't always necessary but can be very helpful.


D. Example Docstring and its Output


1. Detailed explanation of a filled out docstring


Let's have a look at a detailed docstring for a function that calculates the nth Fibonacci number:

def fibonacci(n):
    """
    Calculate the nth Fibonacci number.

    Args:
        n (int): The position of the Fibonacci number to calculate.

    Returns:
        int: The nth Fibonacci number.

    Raises:
        ValueError: If n is not a positive integer.

    Example:
        >>> fibonacci(10)
        55
    """
    if not isinstance(n, int) or n <= 0:
        raise ValueError("n must be a positive integer.")
    a, b = 0, 1
    for _ in range(n):
        a, b = b, a + b
    return a


2. The utility of a docstring for users


Calling help(fibonacci) will output a helpful description, including the purpose of the function, its arguments, return value, any exceptions it might raise, and an example of how to use it. Docstrings are a powerful tool for making your code more user-friendly.


III. Code Readability


A. Importance of Readable Code


Readable code is easily understood by others, making it easier to debug, maintain, and update. It's a hallmark of good coding practice and it can save countless hours for you and your team.


B. The Zen of Python


1. Python's focus on readability


Python emphasizes the importance of readable code in its guiding principles, often referred to as "The Zen of Python". One of the most famous lines is, "Readability counts." It promotes the idea that code is read more often than it is written, making readability a top priority.


2. Guidelines for good code in Python


The Zen of Python also gives several other guidelines such as "Explicit is better than implicit", suggesting code should be straightforward and easy to understand. Another one is, "Simple is better than complex", encouraging developers to write simple, clear code.


C. Descriptive Naming


1. Role of good naming in code readability


Good naming is an important aspect of readability. Variable and function names should be descriptive and concise, indicating their role in the program. Let's consider the following example:

def p(x, y):
    return x ** y

This function performs exponentiation but it's not clear from the function name or variable names. A more readable version would be:

def power(base, exponent):
    return base ** exponent


2. Concept of self-documenting code


The above example demonstrates the idea of self-documenting code. The purpose of the code is evident from its structure and naming, reducing the need for separate comments or documentation.


3. Potential pitfalls of over-descriptive names


However, there's a caveat. Names should not be too verbose that they become cumbersome. Finding the right balance is key. For instance, calculateSumOfListOfNumbers could be simplified to sumNumbers.


D. Simplifying Code


1. Importance of simple, maintainable units of code


Another principle to adhere to for enhancing readability is to write simple, maintainable units of code. Large functions or classes can be hard to follow. Break them into smaller, manageable parts each with a clear purpose.


2. Example of a complex code structure


Consider a function that calculates the sum of squares of numbers in a list:

def sumOfSquares(lst):
    total = 0
    for i in range(len(lst)):
        total += lst[i] ** 2
    return total


3. Refactoring for improved readability and functionality


This function can be simplified and made more Pythonic with list comprehension:

def sumOfSquares(lst):
    return sum(i**2 for i in lst)

The refactored function is easier to read and understand. It demonstrates Python's focus on readability and simplicity.


IV. Importance of Testing


A. The Role of Testing in Code Verification


Testing is a crucial part of software development. It's the process of checking your code to ensure it behaves as expected under a variety of conditions. It can catch bugs before your code is deployed or used by others.


B. Benefits of Testing


Testing increases the reliability of your code and provides assurance that changes or additions haven't broken existing functionality. It also simplifies the process of adding new features, as you can confirm that your code is still working as expected after the update.


C. Types of Testing in Python: Doctest and Pytest


Python has built-in modules for testing - doctest and unittest. Another widely used testing library is pytest. Doctest uses examples in docstrings while pytest allows for more comprehensive and complex tests.


1. Using Doctest


a. Working with docstring examples


You can add examples in your docstrings that serve as tests. These examples should demonstrate the expected output of your function for a particular input.

def add(a, b):
    """
    Adds two numbers together.

    Args:
        a (int): The first number.
        b (int): The second number.

    Returns:
        The sum of the two numbers.

    Example:
    >>> add(1, 2)
    3
    """
    return a + b


b. Running doctest


To run doctest, add the following lines at the end of your script:

if __name__ == "__main__":
    import doctest
    doctest.testmod()


2. Using Pytest


a. Recommended pytest structure


In pytest, tests are functions that start with test_. Let's write a test for our add

function:

def test_add():
    assert add(1, 2) == 3


b. Writing unit tests


This is a simple unit test that checks if the add function works correctly for the input (1, 2).


c. Running pytest and understanding its output


To run pytest, simply call pytest from the command line in the directory containing your test file. The output will tell you how many tests passed or failed.


V. Documentation and Testing in Practice


A. Documenting Projects with Sphinx


1. Transformation of docstrings into documentation


When working on larger projects, transforming docstrings into a full-fledged documentation is beneficial. This is where Sphinx comes in. Sphinx is a tool that generates beautiful, detailed documentation from docstrings and markdown files.

Consider a function in a Python file:

def multiply(a, b):
    """
    Multiplies two numbers together.

    Args:
        a (int): The first number.
        b (int): The second number.

    Returns:
        The product of the two numbers.
    """
    return a * b


2. Benefits of Sphinx


Using Sphinx, the docstring above can be transformed into a webpage with sections for the function's purpose, arguments, and return value.

Sphinx also supports cross-referencing between different parts of the documentation, syntax highlighting for code snippets, and integration with version control systems.


3. Documenting Classes


Sphinx works seamlessly with classes, modules, and packages. Consider a Python class:

class Calculator:
    """
    A simple calculator class.
    """

    def add(self, a, b):
        """
        Adds two numbers together.

        Args:
            a (int): The first number.
            b (int): The second number.

        Returns:
            The sum of the two numbers.
        """
        return a + b

Sphinx will generate a page for the Calculator class with a separate subsection for the add method.


B. Continuous Integration Testing


1. Role of Continuous Integration Testing in code testing


Continuous Integration (CI) is a practice in software development where developers integrate code into a shared repository frequently, preferably several times a day. Each integration is verified by an automated build and automated tests to detect integration errors as quickly as possible.


2. Travis CI and its functionalities


Travis CI is a popular tool for continuous integration. It can be configured to run your tests every time you push changes to your online repository.

In Python projects, you could set Travis CI to run pytest every time you push. This way, you can catch and fix errors quickly, ensuring the integrity of your code base.

Here's an example of a basic .travis.yml file for a Python project:

language: python
python:
  - "3.8"
install:
  - pip install -r requirements.txt
script:
  - pytest

This tells Travis CI to set up a Python 3.8 environment, install the dependencies listed in requirements.txt, and run pytest.


Conclusion


Python offers a range of tools and practices to produce well-documented, readable, and tested code. Making use of comments, docstrings, testing tools like doctest and pytest, documentation tools like Sphinx, and continuous integration with Travis CI can significantly enhance the quality and maintainability of your code. As you work on Python projects, big or small, always keep in mind the importance of documentation, readability, and testing. They might seem tedious at first, but the payoff in terms of code reliability, maintainability, and collaboration is definitely worth the effort. Happy coding!

bottom of page