Skip to content

Data Algorithms

Data Algorithms

WIP: These classes are currently actively being developed and are subject to change in both API and functionality over time. They provide a set of data algorithms for various types of data storage. We currently have subdirectorys for:

  • DataFrames: A set of algorithms that consume/return a Panda dataFrame.
  • Graphs: Algorithms for node/edge graphs, specifically focused on NetworkX graphs.
  • Spark: A set of algorithms that consume/return a Spark dataFrame.
  • SQL: SQL queries that provide a wide range of functionality:

    • Outliers
    • Descriptive Stats
    • Correlations
    • and More

Welcome to the SageWorks Data Algorithms

Docs TBD

Questions?

The SuperCowPowers team is happy to answer any questions you may have about AWS and SageWorks. Please contact us at sageworks@supercowpowers.com or on chat us up on Discord