Delve 10: Let's Build a Modern ML Microservice Application - Part 4, Configuration
"The measure of intelligence is the ability to change." - Albert Einstein
ML Microservices, The Great Env-scape
Hello data delvers! In part three of this series we refactored our application into three separate layers, allowing us to better separate concerns within our codebase. However, if we examine certain parts of our code we can still observe some brittleness:
The problem occurs on line 7, right now we are hardcoding a link to https://collectionapi.metmuseum.org
. However, what if this link changes? We would have to change our code! What's the big deal you may ask? I can just update my code when needed right? While yes you could, the problem comes if we want to deploy different versions of our code each with a different url. This can happen in a few different scenarios:
Scenario 1:
Imagine we want to load balance our application. In this setup we have one copy of our application deployed on servers on the east coast and another copy deployed on servers on the west. In addition, the search API has hypothetically deployed two different copies of itself, one in east and one it west each behind two different urls:
- https://collectionapi-east.metmuseum.org
- https://collectionapi-west.metmuseum.org
We want to deploy the same code to each region but change the url it is pointing to.
Scenario 2:
Perhaps more common, we are following a Deployment Environment pattern in our codebase and we have 3 separate environments:
- DEV - Where we deploy changes we are actively working on
- QA - Where we test that our code is behaving as expected
- PROD - Where consumers of our application directly interact with it
We'd like to only hit the "real" search service in our PROD environment and hit dummy urls in our lower environments to make sure we don't overwhelm the real search service with our tests.
Note
There are other variations of this pattern with more environments, for this example I'm choosing to use a relatively simple setup with only 3.
There are other scenarios where the url can change but I find these two to be the most frequent and for this delve I'd like to hone in on the second scenario as the deployment environment pattern is extremely common in practice. If you haven't encountered it before I encourage you to read up on it, every enterprise application I've ever worked on has used some variation of the pattern without fail, it is that ubiquitous. So how can we implement this pattern in our codebase?
(Con)figure it Out!
One simple way to handle this is to introduce a configuration file to our application that could hold the url per environment. We could then set a flag (such as an environment variable) the lets the application know which environment it is running in so it can choose which setting to read. To start we need to choose how we define our config file. A few options to choose from that I've seen before include:
- .env - Probably one of the simplest formats out there, simply stores key value pairs
- JSON - Just like with our APIs, we can use JSON to specify our app configuration, though it is a bit verbose
- HOCON - A superset of JSON designed to be more human readable, though not as common in Python projects
- YAML - Another superset of JSON with additional capabilities, fairly common
- TOML - A simplified format somewhere between .env and YAML, starting to become more popular in Python projects, especially with the advent of the
pyproject.toml
standard
You could be successful using any of these formats but I'm going to go with YAML as I like that it is not as verbose as JSON but provides support for more complex structures than TOML.
To begin, let's create a new file to hold our configuration:
src/config/shared/config/config.yaml | |
---|---|
Take note of the filepath, that will come in handy later. We now have a file that accomplishes our objective: having a dummy url in our non-prod environments. However, there's a problem, we're copy pasting the same url multiple times! This isn't a big deal now, but you can probably imagine as our app grows and has more and more settings making sure we copy paste everything correctly could become a challenge. Fortunately, we can take advantage of one of the more advanced features of YAML here to help us: anchors and references. Essentially we can define a separate section of our file that defines the default values a field should take and only override the default when necessary. That looks something like this:
src/config/shared/config/config.yaml | |
---|---|
In setting up our file in this way, we reduce the amount of duplicate lines in our configuration file we have to maintain! So now that we have a config file, how do we load it into our code?
Loading Time
Fortunately Pydantic comes to the rescue again here with the Pydantic Settings extension which makes loading config files a breeze!
To get started we can add pydantic-settings
as a dependency of our project:
Next we can create a new module in our python project under src/shared/config
called config_loader.py
to handle loading our configuration files. To start we can include a simple Pydantic data model to define our project's settings:
src/config/shared/config/config_loader.yaml | |
---|---|
Note
Don't forget to add an empty __init__.py
file in the directory to make the module importable!
Next, we create a model to define our config file's structure, including each environment:
src/config/shared/config/config_loader.yaml | |
---|---|
So far, so good. Now we need to tell Pydantic where the config file is located on disk and specify the logic for loading it in. Admittedly this isn't very clear in the Pydantic documentation so I'll share what I found to work and then break it down (make sure to scroll over to see all the annotations!):
Tip
Pydantic Settings supports a wide variety of settings sources including some of the alternative file formats we discussed, so you are free to choose whichever format you like best!
Finally we need to provide a function for loading the config when it is needed:
Now, we could make this more robust make making env an enum
but this will work for now. Now that we have our config loader class we can use it to load our app settings!
Powering our App with Settings!
To use this loader we only need to make a few edits to our main.py
function:
Now all that's left is to let our code know what environment it is running in and test out the app! An easy way to do this is create a new file called .env
in the root of the project with the following contents:
.env | |
---|---|
When you start up a new shell uv is smart enough to load these environment variables for you. Go ahead and try it out! The app should still work as before, try changing the environment to QA and verify that the app breaks when it hits the dummy url! Congratulations, you now have a per-environment configurable app! Full code for this part is available here.
Delve Data
- Applications often have a need to change variable values per Deployment Environment
- Hard coding these values makes this difficult
- Using tools like Pydantic Settings, we can create a configuration file that makes it easy to switch these values per environment