top of page

Python Package Development and Structuring


In this tutorial, we'll take an in-depth look into Python package development, understand the critical aspects, and learn about implementing Python package structure. Our journey will involve learning about scripts, modules, and packages, understanding the directory structure of a Python package, and delving into the importance of documentation. We'll then proceed to explore Python package imports, learning about internal imports, importing between sibling and distant modules, and creating shortcuts for accessing functions.


I. Introduction to Python Packages


A. Importance of Building Packages


Python packages are crucial because they help encapsulate code and allow its reuse across different projects, which fosters DRY (Don't Repeat Yourself) principle. They help manage and organize Python projects on a large scale by providing a hierarchical directory structure.

# A simple example of a package might look like this:
import math

# Here, we're using the "sqrt" function from the "math" package.
print(math.sqrt(16))


Output:


4.0


B. Applications of Python Packages


Python packages find their use in many applications, such as web development, data science, machine learning, game development, and much more. These packages save development time by providing pre-defined solutions for complex tasks.

# A practical example might be using "pandas" in data analysis.
import pandas as pd

# Create a simple dataframe
df = pd.DataFrame({
    'A': [1, 2, 3],
    'B': [4, 5, 6]
})

print(df)

Output:

   A  B
0  1  4
1  2  5
2  3  6


C. Benefits of Packaging Code


Packaging code enhances readability and maintainability, making it more understandable to other developers. It also encourages modular programming, making debugging easier and improving overall code quality.


II. Understanding Python Scripts, Modules, and Packages


A. Definition of a Script


A Python script is a collection of commands in a file designed to be executed like a program. These commands are written in Python language. A script is typically a sequence of instructions that is interpreted and executed one by one by the Python interpreter.

# An example script could be a simple print statement
print("Hello, world!")


Output:

Hello, world!


B. Definition of a Package and Subpackage


A Python package is a way of organizing related modules into a directory hierarchy. Essentially, it is a directory that contains multiple module files and a special __init__.py file. A subpackage is simply a package that exists inside another package.

# Let's assume we have a package structure like this:

# my_package/
#    __init__.py
#    module_a.py
#    subpackage/
#       __init__.py
#       module_b.py


C. Definition of a Module


A module is a file containing Python definitions and statements. The file name is

the module name with the suffix .py.

# An example of a simple Python module could be a file "module_a.py" with the following content:

def greet():
    print("Hello, world!")

# You can use the module by importing it.
import module_a

module_a.greet()


Output:

Hello, world!


D. Explanation of the Term 'Library'


In Python, a library is a collection of modules, but the terms are often used interchangeably. Libraries provide reusable code for common programming tasks such as connecting to a database, reading and writing files, and handling dates and time.

# An example of a library could be the "datetime" library, which provides functions for handling dates and times.

import datetime

print(datetime.datetime.now())


Output:

2023-08-03 12:34:56.789012 (the output will be the current date and time)


III. Directory Structure of a Python Package


A. Basic Directory Structure of a Simple Python Package


Python packages are arranged in a hierarchical directory structure. In its simplest form, a Python package looks like this:

my_package/
    __init__.py
    module.py


In this structure, my_package is the package directory, __init__.py is a special Python file that treats the directory as a Python package, and module.py is a module in the package.


B. Role of __init__.py and module.py in a Package


The __init__.py file serves a special purpose in a Python package. It's responsible for making Python treat the directories as containing packages. In the simplest case, it can just be an empty file.


The module.py file is where you put the functions and classes you want to be included in your module.

# For example, let's say the contents of "module.py" are:
def greet():
    print("Hello, World!")

# And "__init__.py" is an empty file.
# We can import the function from the module like this:
from my_package import module

module.greet()


Output:

Hello, World!


C. Concept of Subpackages and Their Organization


Subpackages are packages that exist within another package. They further help in the organization of code in larger Python projects. Their directory structure follows a similar format to that of regular packages.

my_package/
    __init__.py
    module.py
    subpackage/
        __init__.py
        submodule.py

In this case, subpackage is a subpackage inside my_package, and submodule.py is a module inside subpackage.


IV. Documentation in Python Packages


A. Importance of Documentation


Documentation is vital in Python packages as it describes what the package does, its functions, classes, and modules, and how they are to be used. Good documentation makes your package easier for others to use and contribute to.


B. Writing Documentation for Functions, Classes, and Methods


Documentation in Python is added in the form of comments and docstrings. Docstrings are enclosed in triple quotes and are placed immediately after the definition of a function, class, or method.

def add_numbers(a, b):
    """
    This function adds two numbers and returns the result.

    Parameters:
    a (int): The first number.
    b (int): The second number.

    Returns:
    int: The sum of the two numbers.
    """
    return a + b

print(add_numbers(5, 3))


Output:

8


C. Role of Documentation in Code Usability


Documentation plays a significant role in code usability. A well-documented code can be easily understood, updated, and maintained, not only by the author but also by other programmers who may have to work with the code in the future.


V. Documenting Python Functions


A. Positioning of Documentation within Functions


When documenting functions, the position of the docstring is key. The docstring is placed immediately below the function definition and is enclosed in triple quotes. Here's an example:

def multiply_numbers(a, b):
    """
    This function multiplies two numbers and returns the result.

    Parameters:
    a (int): The first number.
    b (int): The second number.

    Returns:
    int: The product of the two numbers.
    """
    return a * b

print(multiply_numbers(5, 3))


Output:

15


B. How to Write Summary for Documentation


The first line of the docstring is a brief summary of the function's purpose. It should be a concise explanation that answers "what does this function do?"

For our multiply_numbers function, the summary is "This function multiplies two numbers and returns the result." This summary immediately lets a reader know what to expect from the function.


C. Additional Information in Documentation


After the summary, we include more detailed information such as the parameters the function accepts and what it returns. Each parameter is mentioned along with its expected data type and a brief description of its role in the function.

In our multiply_numbers function, we have two parameters a and b, both of which are integers and represent the numbers to be multiplied. The return value is also an integer, which is the product of a and b.


VI. Documentation Styles


A. Importance of Choosing a Documentation Style


Choosing a consistent documentation style is crucial. It ensures that your code is easy to read and understand. A consistent style in your project helps other developers, including your future self, understand and maintain it.


B. Consistency in Documentation Styles


Once you choose a style, it's important to apply it consistently throughout your project. This makes your codebase uniform and predictable, thus improving its readability.


C. Overview of NumPy Documentation Style


One of the commonly used documentation styles in Python, especially for projects involving scientific computing, is the NumPy documentation style. It's known for its clarity and is widely adopted in the Python community.

def numpy_style_docstring(param1, param2):
    """
    Example function with types documented in the docstring.

    Parameters
    ----------
    param1 : int
        The first parameter.
    param2 : str
        The second parameter.

    Returns
    -------
    bool
        True if successful, False otherwise.
    """


D. Other Potential Styles


Other than NumPy, there are different styles like Google style and Sphinx style. Each comes with its own syntax and format. The choice of style depends on your or your team's preferences.


VII. Pyment for Documentation Generation and

Translation


A. Use of Pyment for Generating Template Docstrings


Pyment is a Python tool that can be used to generate or convert docstrings in your Python files. It's helpful when you want to generate template docstrings for your existing code.

# Install Pyment
pip install pyment

# Generate or convert docstrings
pyment myfile.py


B. Use of Pyment for Translating between Documentation Styles


One of the advantages of Pyment is the ability to translate between different docstring styles. This is particularly useful when you want to migrate your project to a different documentation style.


C. Creation and Filling of Documentation Templates


With Pyment, you can generate a template docstring for every method, function, class, or module that lacks a docstring. After generating, you can then fill the templates with the necessary information.


VIII. Documentation at Different Levels


Documentation should be present at all levels of your code, from the highest (packages) to the lowest (functions or methods). Here's how to go about it:


A. Documentation for Modules, Packages, and Subpackages


At the package and subpackage level, the documentation should provide a high-level overview of what the package or subpackage does. This is usually written in the __init__.py file of the package or subpackage.

For a module, the documentation should outline the purpose of the module and provide a brief explanation of the classes and functions it contains. This typically goes at the top of the module file, before any code.

"""
my_module.py

This module contains a function for multiplying two numbers.
"""

def multiply_numbers(a, b):
    """
    This function multiplies two numbers and returns the result.

    Parameters:
    a (int): The first number.
    b (int): The second number.

    Returns:
    int: The product of the two numbers.
    """
    return a * b


B. Placement and Role of Package, Subpackage, and Module Documentation


The placement of the documentation is important for it to serve its purpose. As mentioned earlier, module-level docstrings are placed at the top of the module file, while function or method level docstrings are positioned immediately below their respective definitions.

Package or subpackage level docstrings go in the __init__.py file. This is the first file that gets executed when a package or subpackage is imported, thus, placing the docstring here makes it readily accessible.

The role of these docstrings is to guide the user (and the developer) about the function, role, and usage of the module, function, method, or package. This makes code maintenance and usage easier and more efficient.


Python Package Structuring and Import System


This section of the tutorial will delve into the intricacies of how Python packages are structured, and how the import system works. Here, we will explore topics like implementing internal imports, importing modules, setting up shortcuts, and more. So, without further ado, let's delve in.


I. Introduction to Package Imports


A. Role of Imports in Python Packages


In Python, the term "import" is used when we use code written in one module into another. Modules are simply Python files that contain a collection of functions and global variables defined by a developer.

For example, consider a module named math_operations.py that contains a function to add numbers.

# math_operations.py

def add_numbers(a, b):
    return a + b

You can use this function in another Python script or module using the import statement as follows:

# main.py

import math_operations

print(math_operations.add_numbers(3, 4))

Output:

7


B. Current Status of Package Without Internal Imports


Without internal imports, accessing subpackages and modules within your package can get very verbose. You might have to provide the full path to the function or class you wish to use, which can become quite tedious.

Consider a package named my_package with the following structure:

my_package/
    __init__.py
    math_operations.py
    string_operations.py

If you wanted to use the add_numbers function from math_operations.py in another script outside of my_package, you would have to import it as follows:

from my_package.math_operations import add_numbers

print(add_numbers(3, 4))


C. How to Access Subpackages and Modules Without Internal Imports


While this works perfectly fine, it can become quite verbose when the package structure gets more complicated. That's where internal imports come into play.


Python Package Structuring and Import System


This section of the tutorial will delve into the intricacies of how Python packages are structured, and how the import system works. Here, we will explore topics like implementing internal imports, importing modules, setting up shortcuts, and more. So, without further ado, let's delve in.


I. Introduction to Package Imports


A. Role of Imports in Python Packages


In Python, the term "import" is used when we use code written in one module into another. Modules are simply Python files that contain a collection of functions and global variables defined by a developer.

For example, consider a module named math_operations.py that contains a function to add numbers.

# math_operations.py

def add_numbers(a, b):
    return a + b

You can use this function in another Python script or module using the import statement as follows:

# main.py

import math_operations

print(math_operations.add_numbers(3, 4))

Output:

7


B. Current Status of Package Without Internal Imports


Without internal imports, accessing subpackages and modules within your

package can get very verbose. You might have to provide the full path to the function or class you wish to use, which can become quite tedious.

Consider a package named my_package with the following structure:

my_package/
    __init__.py
    math_operations.py
    string_operations.py

If you wanted to use the add_numbers function from math_operations.py in another script outside of my_package, you would have to import it as follows:

from my_package.math_operations import add_numbers

print(add_numbers(3, 4))


C. How to Access Subpackages and Modules Without Internal Imports


While this works perfectly fine, it can become quite verbose when the package structure gets more complicated. That's where internal imports come into play.


II. Implementing Internal Imports


Understanding how to implement internal imports is key in developing Python packages. It ensures that our code remains clean, manageable, and easy to understand.


A. Concept of Importing Subpackages into Packages


Suppose we have a package named my_package, with a subpackage named my_subpackage. Inside my_subpackage, we have a module named module1 with a function function1. If we want to import function1 into our main package, we could do it as follows:

# Inside my_package/__init__.py

from .my_subpackage.module1 import function1

The . before my_subpackage indicates that it's a relative import - we're importing from a package that is relative to the current one.


B. Difference between Absolute and Relative Imports


Python offers two types of imports - absolute imports and relative imports. Absolute imports involve importing a resource using its full path from the project's root folder.

For example, let's consider a project structure as follows:

my_project/
    my_package/
        __init__.py
        my_subpackage/
            __init__.py
            module1.py

If we wanted to import function1 from module1.py into my_package, an absolute import would look like this:

# Inside my_package/__init__.py

from my_project.my_package.my_subpackage.module1 import function1

On the other hand, relative imports involve importing a resource using its relative path to the current module. The same import as above, but as a relative import would look like this:

# Inside my_package/__init__.py

from .my_subpackage.module1 import function1

The . before my_subpackage indicates that my_subpackage is in the same directory as the current module, which in this case is my_package.


C. Preference for Absolute Imports and Conditions for Using Relative Imports


As a rule of thumb, absolute imports are clearer and easier to understand. They give the full context of what is being imported and where it's coming from. However, in certain cases, using relative imports may be more convenient.

When a package is deeply nested, using absolute imports can become verbose. Relative imports are a useful shortcut in such cases. They also make it easier to change the structure of large projects, since you don't need to update every single import statement when you move files around. However, they can be harder to read since it's less clear what is being imported. As with many things in programming, it's a tradeoff.


III. Importing Modules


Once we understand the concept of importing packages, the next step is to understand how to import modules into packages.


A. How to Import Modules into Subpackages


Suppose we have a subpackage my_subpackage inside our main package my_package. Inside my_subpackage, we have two modules module1 and module2. If we want to import module2 into module1, we could do it like this:

# Inside my_package/my_subpackage/module1.py

from . import module2

This code allows us to use functions and classes from module2 inside module1.


B. Consequences of Not Importing Modules


Importing modules when necessary is crucial for several reasons:

  1. Code Reusability: If we have a function or a class defined in one module and we need to use it in another, we don't have to write the same code again. We can simply import it.

  2. Code Organization: By importing modules where needed, we ensure that our code is better organized and easier to maintain.

  3. Code Readability: Well-organized code is easier to understand and read. When we import only the modules we need, it becomes clear which parts of our program interact with each other.

If we don't import a module where it's needed, Python will raise an error when we try to use any functions or classes from that module. This is because Python doesn't know where those functions or classes are defined.


IV. Setting Up Shortcuts for Accessing Functions


Shortcuts in Python packages are often set up for easier accessibility and convenience. They allow for more intuitive and straightforward calls to functions or classes within a package.


A. Reason for Setting Up Shortcuts


Setting up shortcuts essentially means making certain functions or classes available directly from the package or subpackage level.

For instance, without a shortcut, if we had a function my_func in module1 which is part of my_subpackage in my_package, the function would be called as follows:

from my_package.my_subpackage.module1 import my_func
my_func()

However, with a shortcut, we could simplify this to:

from my_package.my_subpackage import my_func
my_func()


B. Process of Importing Functions into Subpackages


To create such shortcuts, we need to import the desired functions or classes into the __init__.py file of our package or subpackage.

Let's import my_func from module1 into my_subpackage:

# Inside my_package/my_subpackage/__init__.py

from .module1 import my_func

Now, my_func can be directly imported from my_subpackage.


V. Importing Between Sibling Modules


Importing functions between sibling modules refers to the process where a function from one module is made available in another module within the same package or subpackage.


A. Conditions for Splitting Modules into Multiple Files


Before we go into importing, it's crucial to understand when we might need to split modules into multiple files. In a complex system, it's common to have modules that contain a lot of functions or classes.

As your codebase grows, a large module can become unwieldy and hard to maintain. To enhance code readability and maintainability, you might decide to split this module into smaller, more manageable modules (siblings), each with its own specific responsibility. For instance, you might have math_operations module that you decide to split into basic_operations and advanced_operations.


B. Importing Functions Between Sibling Modules


Once you have multiple modules, it's common that a function or class in one module might need to use a function or class in another module.

Let's illustrate this with an example where we have two sibling modules basic_operations and advanced_operations. We have a function add in basic_operations that we want to use in advanced_operations:

# Inside basic_operations.py
def add(a, b):
    return a + b

We want to use add function in advanced_operations.py to implement a function mean that calculates the average of two numbers. We can import add as follows:

# Inside advanced_operations.py
from .basic_operations import add

def mean(a, b):
    return add(a, b) / 2


VI. Importing Across Distant Modules


At times, you might need to import functions or classes from modules that are not siblings, but are housed within the same package or different packages altogether. This is what we refer to as importing across distant modules.


A. Circumstances for Importing Across Distant Modules


The need to import across distant modules may arise in cases where you have functionalities spread across different modules that need to interact with each other. For instance, you may have a utility function housed in a utilities module that's used in several other modules within your package.

To illustrate, let's say we have a utilities module at the root of our package, and we have an advanced_operations module in a subpackage called operations. The structure looks like this:

my_package/
    __init__.py
    utilities.py
    operations/
        __init__.py
        advanced_operations.py


B. Use of Absolute and Relative Imports for Importing Across Distant Modules


Importing across distant modules can be done using either absolute or relative imports.

An absolute import specifies the complete path (from the root of the package) to the module that contains the function or class you wish to import. Continuing with our example, if we want to use a function called log from utilities.py in advanced_operations.py, we would do the following:

# Inside advanced_operations.py
from my_package.utilities import log

def complex_operation(a, b):
    log("Starting operation...")
    # (Rest of function implementation)

Relative imports can also be used here, but keep in mind that they may become more complicated and less readable as the distance between modules increases. A relative import from advanced_operations.py to utilities.py would look like this:

# Inside advanced_operations.py
from ..utilities import log

def complex_operation(a, b):
    log("Starting operation...")
    # (Rest of function implementation)

In this case, .. means "go up one level". If you had to go up multiple levels, you would add more periods, e.g., from ...utilities import log.

As a rule of thumb, it's often best to use absolute imports for clarity and ease of understanding, particularly when dealing with distant modules. However, there may be cases where relative imports are more suitable, depending on the specific structure and requirements of your package.


VII. Cheat Sheet for Relative Imports


Relative imports in Python can sometimes be tricky to handle, especially for beginners. This section provides a cheat sheet and some general guidelines to make using them easier.


A. Guidelines for Using Relative Imports in Python Packages

  1. Level Indicators: Remember, a single period (.) before the import statement refers to the current directory, two periods (..) refer to the parent directory, and so on. This is just like file paths in Unix-based systems.

  2. Importing Sibling Modules: If you're importing a sibling module, use a single period. For example, from .sibling_module import function_name.

  3. Importing from Parent Directory: If you're importing from a parent directory, use two periods. For example, from ..parent_module import function_name.

  4. Importing from Grandparent Directory: If you're importing from a grandparent directory, use three periods. For example, from ...grandparent_module import function_name.

  5. Usage: Relative imports are best used when the exact location of the imported module in the filesystem is unknown or can change. They make your code more flexible in the face of structural changes in your project.

  6. Readability: Be careful when using relative imports across distant modules; this can make your code less readable and more difficult to understand.

Here's a handy visual representation:

my_package/
    __init__.py
    module1.py
    subpackage1/
        __init__.py
        module2.py
    subpackage2/
        __init__.py
        module3.py

  • If module3.py wants to import a function from module2.py (sibling module in a different subpackage), you would write: from ..subpackage1.module2 import function_name.

  • If module2.py wants to import a function from module1.py (a module in the parent directory), you would write: from ..module1 import function_name.

That brings us to the end of our tutorial on Python Package Development and the Python Import System. I hope you've found it insightful and helpful.

In this tutorial, we've learned about the importance of Python packages, understood the difference between scripts, modules, and packages, and explored the directory structure of a package. We've also delved into the importance of documentation, explored the nuances of Python's import system, and understood how to implement internal imports and structure our package accordingly.

Remember, creating a well-structured, well-documented Python package with a good system of internal imports can greatly improve code reuse and readability, making your code easier to maintain and understand. As always, happy coding!

bottom of page