Skip to content

Delve 10: Let's Build a Modern ML Microservice Application - Part 4, Configuration

Banner

"The measure of intelligence is the ability to change." - Albert Einstein

ML Microservices, The Great Env-scape

Hello data delvers! In part three of this series we refactored our application into three separate layers, allowing us to better separate concerns within our codebase. However, if we examine certain parts of our code we can still observe some brittleness:

src/main.py
from fastapi import FastAPI, HTTPException

from provider.met_provider import MetProvider
from service.search_service import SearchService

app = FastAPI()
search_service = SearchService(MetProvider('https://collectionapi.metmuseum.org'))


@app.get('/api/search')
def search(title: str) -> str:
    """Executes a search against the Metropolitan Museum of Art API and returns the url of the primary image of the first search result.

    Args:
        title: The title of the work you wish to search for.

    Returns:
        The url of the primary image of the first search result or 'No results found.' if no search results are found.
    """

    try:
        search_result = search_service.search_by_title(title)
        return search_result.primary_image
    except ValueError:
        raise HTTPException(status_code=404, detail='No results found.')

The problem occurs on line 7, right now we are hardcoding a link to https://collectionapi.metmuseum.org. However, what if this link changes? We would have to change our code! What's the big deal you may ask? I can just update my code when needed right? While yes you could, the problem comes if we want to deploy different versions of our code each with a different url. This can happen in a few different scenarios:

Scenario 1:

Imagine we want to load balance our application. In this setup we have one copy of our application deployed on servers on the east coast and another copy deployed on servers on the west. In addition, the search API has hypothetically deployed two different copies of itself, one in east and one it west each behind two different urls:

  • https://collectionapi-east.metmuseum.org
  • https://collectionapi-west.metmuseum.org

We want to deploy the same code to each region but change the url it is pointing to.

Scenario 2:

Perhaps more common, we are following a Deployment Environment pattern in our codebase and we have 3 separate environments:

  • DEV - Where we deploy changes we are actively working on
  • QA - Where we test that our code is behaving as expected
  • PROD - Where consumers of our application directly interact with it

We'd like to only hit the "real" search service in our PROD environment and hit dummy urls in our lower environments to make sure we don't overwhelm the real search service with our tests.

Note

There are other variations of this pattern with more environments, for this example I'm choosing to use a relatively simple setup with only 3.

There are other scenarios where the url can change but I find these two to be the most frequent and for this delve I'd like to hone in on the second scenario as the deployment environment pattern is extremely common in practice. If you haven't encountered it before I encourage you to read up on it, every enterprise application I've ever worked on has used some variation of the pattern without fail, it is that ubiquitous. So how can we implement this pattern in our codebase?

(Con)figure it Out!

One simple way to handle this is to introduce a configuration file to our application that could hold the url per environment. We could then set a flag (such as an environment variable) the lets the application know which environment it is running in so it can choose which setting to read. To start we need to choose how we define our config file. A few options to choose from that I've seen before include:

  • .env - Probably one of the simplest formats out there, simply stores key value pairs
  • JSON - Just like with our APIs, we can use JSON to specify our app configuration, though it is a bit verbose
  • HOCON - A superset of JSON designed to be more human readable, though not as common in Python projects
  • YAML - Another superset of JSON with additional capabilities, fairly common
  • TOML - A simplified format somewhere between .env and YAML, starting to become more popular in Python projects, especially with the advent of the pyproject.toml standard

You could be successful using any of these formats but I'm going to go with YAML as I like that it is not as verbose as JSON but provides support for more complex structures than TOML.

To begin, let's create a new file to hold our configuration:

src/config/shared/config/config.yaml
1
2
3
4
5
6
7
8
dev: 
  met_api_url: https://collectionapi-dummy.metmuseum.org

qa:
  met_api_url: https://collectionapi-dummy.metmuseum.org

prod:
  met_api_url: https://collectionapi.metmuseum.org

Take note of the filepath, that will come in handy later. We now have a file that accomplishes our objective: having a dummy url in our non-prod environments. However, there's a problem, we're copy pasting the same url multiple times! This isn't a big deal now, but you can probably imagine as our app grows and has more and more settings making sure we copy paste everything correctly could become a challenge. Fortunately, we can take advantage of one of the more advanced features of YAML here to help us: anchors and references. Essentially we can define a separate section of our file that defines the default values a field should take and only override the default when necessary. That looks something like this:

src/config/shared/config/config.yaml
default: &default 
  met_api_url: https://collectionapi-dummy.metmuseum.org

dev: 
  <<: *default 

qa:
  <<: *default

prod:
  <<: *default
  met_api_url: https://collectionapi.metmuseum.org 

In setting up our file in this way, we reduce the amount of duplicate lines in our configuration file we have to maintain! So now that we have a config file, how do we load it into our code?

Loading Time

Fortunately Pydantic comes to the rescue again here with the Pydantic Settings extension which makes loading config files a breeze!

To get started we can add pydantic-settings as a dependency of our project:

uv add pydantic-settings

Next we can create a new module in our python project under src/shared/config called config_loader.py to handle loading our configuration files. To start we can include a simple Pydantic data model to define our project's settings:

src/config/shared/config/config_loader.yaml
1
2
3
4
from pydantic import BaseModel

class Settings(BaseModel):
    met_api_url: str

Note

Don't forget to add an empty __init__.py file in the directory to make the module importable!

Next, we create a model to define our config file's structure, including each environment:

src/config/shared/config/config_loader.yaml
from pydantic import BaseModel

class Settings(BaseModel):
    met_api_url: str

class Config(BaseSettings):
    default: Settings
    dev: Settings
    qa: Settings
    prod: Settings

So far, so good. Now we need to tell Pydantic where the config file is located on disk and specify the logic for loading it in. Admittedly this isn't very clear in the Pydantic documentation so I'll share what I found to work and then break it down (make sure to scroll over to see all the annotations!):

src/config/shared/config/config_loader.yaml
import pathlib
from typing import Tuple, Type
from pydantic import BaseModel
from pydantic_settings import BaseSettings, PydanticBaseSettingsSource, SettingsConfigDict, YamlConfigSettingsSource


class Settings(BaseModel):
    met_api_url: str


class Config(BaseSettings):
    default: Settings
    dev: Settings
    qa: Settings
    prod: Settings
    model_config = SettingsConfigDict(yaml_file=pathlib.Path(__file__).parent.resolve() / 'config.yaml') 

    @classmethod
    def settings_customise_sources(
        cls,
        settings_cls: Type[BaseSettings],
        init_settings: PydanticBaseSettingsSource,
        env_settings: PydanticBaseSettingsSource,
        dotenv_settings: PydanticBaseSettingsSource,
        file_secret_settings: PydanticBaseSettingsSource,
    ) -> Tuple[PydanticBaseSettingsSource, ...]: 
        return (
            init_settings,
            env_settings,
            dotenv_settings,
            file_secret_settings,
            YamlConfigSettingsSource(settings_cls),
        )

Tip

Pydantic Settings supports a wide variety of settings sources including some of the alternative file formats we discussed, so you are free to choose whichever format you like best!

Finally we need to provide a function for loading the config when it is needed:

src/config/shared/config/config_loader.yaml
from functools import lru_cache
import pathlib
from typing import Tuple, Type
from pydantic import BaseModel
from pydantic_settings import BaseSettings, PydanticBaseSettingsSource, SettingsConfigDict, YamlConfigSettingsSource


class Settings(BaseModel):
    met_api_url: str


class Config(BaseSettings):
    default: Settings
    dev: Settings
    qa: Settings
    prod: Settings
    model_config = SettingsConfigDict(yaml_file=pathlib.Path(__file__).parent.resolve() / 'config.yaml')

    @classmethod
    def settings_customise_sources(
        cls,
        settings_cls: Type[BaseSettings],
        init_settings: PydanticBaseSettingsSource,
        env_settings: PydanticBaseSettingsSource,
        dotenv_settings: PydanticBaseSettingsSource,
        file_secret_settings: PydanticBaseSettingsSource,
    ) -> Tuple[PydanticBaseSettingsSource, ...]:
        return (
            init_settings,
            env_settings,
            dotenv_settings,
            file_secret_settings,
            YamlConfigSettingsSource(settings_cls),
        )


@lru_cache 
def load_config_settings(env: str) -> Settings:
    appconfig = Config()  # type: ignore 
    return getattr(appconfig, env) 

Now, we could make this more robust make making env an enum but this will work for now. Now that we have our config loader class we can use it to load our app settings!

Powering our App with Settings!

To use this loader we only need to make a few edits to our main.py function:

src/main.py
import os
from fastapi import FastAPI, HTTPException

from provider.met_provider import MetProvider
from service.search_service import SearchService
from shared.config.config_loader import load_config_settings

app = FastAPI()
app_settings = load_config_settings(os.getenv('ENV', 'dev')) 
search_service = SearchService(MetProvider(app_settings.met_api_url)) 


@app.get('/api/search')
def search(title: str) -> str:
    """Executes a search against the Metropolitan Museum of Art API and returns the url of the primary image of the first search result.

    Args:
        title: The title of the work you wish to search for.

    Returns:
        The url of the primary image of the first search result or 'No results found.' if no search results are found.
    """

    try:
        search_result = search_service.search_by_title(title)
        return search_result.primary_image
    except ValueError:
        raise HTTPException(status_code=404, detail='No results found.')

Now all that's left is to let our code know what environment it is running in and test out the app! An easy way to do this is create a new file called .env in the root of the project with the following contents:

.env
ENV=prod

When you start up a new shell uv is smart enough to load these environment variables for you. Go ahead and try it out! The app should still work as before, try changing the environment to QA and verify that the app breaks when it hits the dummy url! Congratulations, you now have a per-environment configurable app! Full code for this part is available here.

Delve Data

  • Applications often have a need to change variable values per Deployment Environment
  • Hard coding these values makes this difficult
  • Using tools like Pydantic Settings, we can create a configuration file that makes it easy to switch these values per environment