Documentation Index
Fetch the complete documentation index at: https://wb-21fd5541-style-guide-models-integrations-20260527-015516.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Metaflow is a framework created by Netflix for creating and running ML workflows. This integration lets you apply decorators to Metaflow steps and flows to automatically log parameters and artifacts to W&B, so you can track experiments and inspect lineage across the workflows you build with Metaflow without writing custom logging code:- Decorating a step turns logging off or on for certain types within that step.
- Decorating the flow turns logging off or on for every step in the flow.
Quickstart
The following sections walk you through authenticating with W&B, installing the required libraries, and adding thewandb_log decorator to your Metaflow steps and flows.
Sign up and create an API key
An API key authenticates your machine to W&B. You can generate an API key from your user profile.For a more streamlined approach, create an API key by going directly to User Settings. Copy the newly created API key immediately and save it in a secure location such as a password manager.
- Click your user profile icon in the upper right corner.
- Select User Settings, then scroll to the API Keys section.
Install the wandb library and log in
To install the wandb library locally and log in:
For
wandb version 0.19.8 or below, install fastcore version 1.8.0 or below (fastcore<1.8.0) instead of plum-dispatch.- Command line
- Python
- Python notebook
-
Set the
WANDB_API_KEYenvironment variable to your API key. -
Install the
wandblibrary and log in.
Decorate your flows and steps
- Step
- Flow
- Flow and steps
Decorating a step turns logging off or on for certain types within that step.In this example, the integration logs all datasets and models in
start:Access your data programmatically
Once your flows and steps are decorated, runs send parameters and artifacts to W&B each time the flow executes. You can access the captured information in three ways: inside the original Python process being logged using thewandb client library, with the web app UI, or programmatically using the Public API. Parameters are saved to the W&B config and can be found in the Overview tab. datasets, models, and others are saved to W&B Artifacts and can be found in the Artifacts tab. Base python types are saved to the W&B summary dict and can be found in the Overview tab. See the guide to the Public API for details on using the API to get this information programmatically from outside.
Quick reference
| Data | Client library | UI |
|---|---|---|
Parameter(...) | wandb.Run.config | Overview tab, Config |
datasets, models, others | wandb.Run.use_artifact("{var_name}:latest") | Artifacts tab |
Base Python types (dict, list, str, etc.) | wandb.Run.summary | Overview tab, Summary |
wandb_log kwargs
| kwarg | Options |
|---|---|
datasets |
|
models |
|
others |
|
settings |
By default, if:
|
Frequently asked questions
The following sections answer common questions about logging behavior, supported data types, and artifact lineage.What exactly do you log
wandb_log only logs instance variables. Local variables are never logged. This is useful to avoid logging unnecessary data.
Which data types get logged
W&B supports these types:| Logging setting | Type |
|---|---|
| default (always on) |
|
datasets |
|
models |
|
others |
|
Configure logging behavior
| Kind of variable | Behavior | Example | Data type |
|---|---|---|---|
| Instance | Auto-logged | self.accuracy | float |
| Instance | Logged if datasets=True | self.df | pd.DataFrame |
| Instance | Not logged if datasets=False | self.df | pd.DataFrame |
| Local | Never logged | accuracy | float |
| Local | Never logged | df | pd.DataFrame |