Skip to content

AWS Lambda Layer

SageWorks Lambda Layers

AWS Lambda Jobs are a great way to spin up data processing jobs. Follow this guide and empower AWS Lambda with SageWorks!

SageWorks makes creating, testing, and debugging of AWS Lambda Functions easy. The exact same SageWorks API Classes are used in your AWS Lambda Functions. Also since SageWorks manages the access policies you'll be able to test new Lambda Jobs locally and minimizes surprises when deploying.

Work In Progress

The SageWorks Lambda Layers are a great way to use SageWorks but they are still in 'beta' mode so please let us know if you have any issues.

Lambda Job Setup

Setting up a AWS Lambda Job that uses SageWorks is straight forward. SageWorks can be 'installed' using a Lambda Layer and then you can use the Sageworks API just like normal.

Here are the ARNs for the current SageWorks Lambda Layers, please note they are specified with region and Python version in the name, so if your lambda is us-east-1, python 3.12, pick this ARN with those values in it.

Python 3.12 (if you need another version/region let us know)

us-east-1

  • arn:aws:lambda:us-east-1:507740646243:layer:sageworks_lambda_layer-us-east-1-python312:7

us-west-2

  • arn:aws:lambda:us-west-2:507740646243:layer:sageworks_lambda_layer-us-west-2-python312:8

Note: If you're using lambdas on a different region or with a different Python version, just let us know and we'll publish some additional layers.

lambda_layer

At the bottom of the Lambda page there's an 'Add Layer' button. You can click that button and specify the layer using the ARN above. Also in the 'General Configuration' set these parameters:

  • Timeout: 5 Minutes
  • Memory: 4096
  • Ephemeral storage: 2048

Set the SAGEWORKS_BUCKET ENV SageWorks will need to know what bucket to work out of, so go into the Configuration...Environment Variables... and add one for the SageWorks bucket that your are using for AWS Account (dev, prod, etc). lambda_layer

Lambda Role Details

If your Lambda Function already use an existing IAM Role then you can add the SageWorks policies to that Role to enable the Lambda Job to perform SageWorks API Tasks. See SageWorks Access Controls

SageWorks Lambda Example

Here's a simple example of using SageWorks in your Lambda Function.

SageWorks Layer is Compressed

The SageWorks Lambda Layer is compressed (to fit all the awesome). This means that the load_lambda_layer() method must be called before using any other SageWorks imports, see the example below. If you do not do this you'll probably get a No module named 'numpy' error or something like that.

examples/lambda_hello_world.py
import json
from pprint import pprint
from sageworks.utils.lambda_utils import load_lambda_layer

# Load/Decompress the SageWorks Lambda Layer
load_lambda_layer()

# After 'load_lambda_layer()' we can use other SageWorks imports
from sageworks.api import Meta
from sageworks.api import Model 

def lambda_handler(event, context):

    # Create our Meta Class and get a list of our Models
    meta = Meta()
    models = meta.models()

    print(f"Number of Models: {len(models)}")
    print(models)

    # Onboard a model
    model = Model("abalone-regression")
    pprint(model.details())

    # Return success
    return {
        'statusCode': 200,
        'body': { "incoming_event": event}
    }

Exception Log Forwarding

When a Lambda Job crashes (has an exception), the AWS console will show you the last line of the exception, this is mostly useless. If you use SageWorks log forwarding the exception/stack will be forwarded to CloudWatch.

from sageworks.utils.sageworks_logging import exception_log_forward

with exception_log_forward():
   <my lambda code>
   ...
   <exception happens>
   <more of my code>
The exception_log_forward sets up a context manager that will trap exceptions and forward the exception/stack to CloudWatch for diagnosis.

Lambda Function Local Testing

Lambda Power without the Pain. SageWorks manages the AWS Execution Role/Policies, so local API and Lambda Functions will have the same permissions/access. Also using the same Code as your notebooks or scripts makes creating and testing Lambda Functions a breeze.

python my_lambda_function.py --sageworks-bucket <your bucket>

Additional Resources