AWS Lambda Layer

Workbench Lambda Layer

Run lightweight, scheduled Workbench jobs on AWS Lambda — no packaging, no container.

The Workbench Lambda layer is a dependency-minimal slice of Workbench, published per region and Python version. It bundles all of Workbench's source plus only networkx and pandas; boto3/botocore come from the Lambda runtime. The heavy dependencies (torch, awswrangler, sagemaker, ...) are intentionally absent, so the layer stays small and imports fast.

Scope

Allows importing anything from workbench.lambda_layer The full API (Meta/Model) isn't available.

Published ARNs

Attach the ARN matching your region and Python version (3.12):

us-east-1

arn:aws:lambda:us-east-1:507740646243:layer:workbench-lambda-layer-us-east-1-python312-wip:5

us-west-2

arn:aws:lambda:us-west-2:507740646243:layer:workbench-lambda-layer-us-west-2-python312-wip:5

The published versions are made public (lambda:GetLayerVersion to *), so you attach them by ARN with no per-account permission grants. Need a different region or Python version? Let us know and we'll publish it.

The -wip suffix

The layer name carries -wip while its contents are still settling, and the version bumps on each re-publish. Pin the exact version your function uses.

Each published version bundles a pinned Workbench release (:5 carries 0.8.438), so the layer's contents are reproducible from the ARN alone.

Using it from a Lambda

In the Lambda console, Add a layer → Specify an ARN and paste the ARN above. Then import and use PipelineManager directly:

from workbench.lambda_layer.pipeline_manager import PipelineManager


def lambda_handler(event, context):
    pm = PipelineManager(f"s3://{event['bucket']}/ml_pipelines/")
    for item in pm.plan():            # (job, run, reason) per job, real AWS mtimes
        if item.run:
            ...  # submit item.job

PipelineManager uses the default boto3 session (the Lambda's role and region), so no Workbench config is required.

add lambda layer

IAM

The execution role needs read access to whatever artifacts the manager resolves modification times against (plus the bucket it discovers pipelines.json from):

glue:GetTable — DataSource (ds:) update times
sagemaker:DescribeFeatureGroup, sagemaker:ListModelPackages, sagemaker:DescribeEndpoint — FeatureSet/Model/Endpoint times
s3:GetObject, s3:ListBucket — discovering pipelines.json files

public: refs resolve against the public data bucket with an unsigned client, so they need no permissions.

See Workbench Access Controls.

Runnable example

A read-only example handler plus an offline smoke test (loads the built layer and runs without AWS) lives in lambda_layers/example_lambda/. Use it to validate the layer in your account before wiring up a real job.

Exception log forwarding

When a Lambda crashes, the AWS console shows only the last line of the exception. Wrap your handler to forward the full stack to CloudWatch:

from workbench.utils.workbench_logging import exception_log_forward

with exception_log_forward():
    ...  # your lambda code; any exception/stack is forwarded to CloudWatch

Building and publishing

The layer is built and published from lambda_layers/:

./lambda_layers/build_deploy.sh                                    # build the zip locally
AWS_PROFILE=<profile> ./lambda_layers/build_deploy.sh --deploy     # publish to us-east-1, us-west-2

The dependency budget (source + networkx + pandas) is enforced by tests/lambda_layer/test_layer_dependencies.py, which fails if any lambda_layer module imports outside it.

Additional Resources

Setting up Workbench on your AWS Account: AWS Setup
Using Workbench for ML Pipelines: ML Pipelines
Workbench Access Management: Access Management

Consulting Available: SuperCowPowers LLC