cortex

Production infrastructure for machine learning at scale

8,033

607

8,033

131

View on GitHub

Top Related Projects

tensorflow

190,523

An Open Source Machine Learning Framework for Everyone

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

onnxruntime

16,412

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

seldon-core

4,509

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

kubeflow

14,906

Machine Learning Toolkit for Kubernetes

mlflow

20,329

Open source platform for the machine learning lifecycle

Quick Overview

Cortex is an open-source platform for deploying, managing, and scaling machine learning models in production. It provides a cloud-native solution that simplifies the process of deploying ML models as scalable, fault-tolerant APIs. Cortex automates infrastructure management and allows data scientists and engineers to focus on model development and deployment.

Pros

Simplifies ML model deployment with automated infrastructure management
Supports various ML frameworks and languages (e.g., TensorFlow, PyTorch, scikit-learn)
Offers automatic scaling and load balancing for deployed models
Provides monitoring and logging capabilities for deployed APIs

Cons

Primarily designed for AWS, limiting cloud provider options
May have a learning curve for users new to cloud-native technologies
Requires some understanding of containerization and Kubernetes concepts
Limited community support compared to some other ML deployment platforms

Getting Started

To get started with Cortex, follow these steps:

Install Cortex CLI:

bash -c "$(curl -sS https://raw.githubusercontent.com/cortexlabs/cortex/master/get-cli.sh)"

Configure AWS credentials:

export AWS_ACCESS_KEY_ID=<your-access-key>
export AWS_SECRET_ACCESS_KEY=<your-secret-key>

Create a Cortex cluster:

cortex cluster up cluster.yaml

Deploy a sample model:

cortex deploy

Get the API endpoint:

cortex get --watch

For more detailed instructions and examples, refer to the official Cortex documentation.

Competitor Comparisons

tensorflow

190,523

An Open Source Machine Learning Framework for Everyone

Pros of TensorFlow

Extensive ecosystem with a wide range of tools and libraries
Strong community support and extensive documentation
Flexible architecture supporting various platforms and devices

Cons of TensorFlow

Steeper learning curve for beginners
Can be resource-intensive for smaller projects
More complex setup and configuration process

Code Comparison

Cortex (Python):

@cortex.predictor
def predict(payload):
    return model.predict(payload)

TensorFlow (Python):

model = tf.keras.models.load_model('model.h5')
predictions = model.predict(input_data)

Cortex focuses on simplifying ML model deployment, while TensorFlow provides a comprehensive framework for building and training machine learning models. Cortex offers a more streamlined approach for deploying models in production, whereas TensorFlow gives developers greater control over the entire machine learning pipeline.

TensorFlow's extensive features and flexibility make it suitable for a wide range of ML tasks, from research to large-scale production. However, this comes with increased complexity. Cortex, on the other hand, prioritizes ease of use and rapid deployment, making it more accessible for developers looking to quickly operationalize their models.

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Pros of PyTorch

Widely adopted and supported by a large community
Extensive documentation and tutorials available
Flexible and dynamic computational graph

Cons of PyTorch

Steeper learning curve for beginners
Slower execution compared to static graph frameworks
Larger memory footprint

Code Comparison

PyTorch:

import torch

x = torch.tensor([1, 2, 3])
y = torch.tensor([4, 5, 6])
z = torch.add(x, y)

Cortex:

import cortex

@cortex.predictor
def add(payload):
    x = payload["x"]
    y = payload["y"]
    return [a + b for a, b in zip(x, y)]

Key Differences

PyTorch is a deep learning framework, while Cortex is a machine learning deployment platform
PyTorch focuses on model development, while Cortex emphasizes model deployment and serving
PyTorch provides low-level tensor operations, while Cortex offers high-level deployment abstractions

Use Cases

PyTorch: Research, prototyping, and developing complex neural network architectures
Cortex: Deploying and scaling machine learning models in production environments

Community and Support

PyTorch: Large, active community with extensive third-party libraries and resources
Cortex: Smaller but growing community, focused on deployment and MLOps

onnxruntime

16,412

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Pros of ONNX Runtime

Broader ecosystem support and compatibility with various ML frameworks
Extensive optimization capabilities for improved inference performance
Larger community and more frequent updates

Cons of ONNX Runtime

Steeper learning curve for beginners
Requires more manual configuration for deployment

Code Comparison

ONNX Runtime:

import onnxruntime as ort

session = ort.InferenceSession("model.onnx")
input_name = session.get_inputs()[0].name
output = session.run(None, {input_name: input_data})

Cortex:

import cortex

@cortex.predictor
def predict(payload):
    return model.predict(payload)

cortex.deploy("my-api", predictor=predict)

ONNX Runtime focuses on optimizing inference for ONNX models across various platforms, while Cortex provides a more streamlined approach for deploying machine learning models as APIs. ONNX Runtime offers greater flexibility and performance optimization but requires more setup. Cortex simplifies the deployment process but may have limitations in terms of customization and optimization options.

seldon-core

4,509

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

Pros of Seldon Core

More extensive support for different ML frameworks and languages
Advanced features like A/B testing, canary deployments, and multi-armed bandits
Larger community and ecosystem with more integrations

Cons of Seldon Core

Steeper learning curve and more complex setup
Requires more Kubernetes expertise to manage effectively
Can be resource-intensive for smaller deployments

Code Comparison

Seldon Core (Python):

class MyModel(object):
    def predict(self, X, features_names=None):
        return X * 2

from seldon_core.seldon_client import SeldonClient
sc = SeldonClient(deployment_name="mymodel", namespace="default")

Cortex:

import cortex

class PythonPredictor:
    def predict(self, payload):
        return payload * 2

cortex.deploy("mymodel.yaml")

Both frameworks offer straightforward ways to define and deploy models, but Seldon Core typically requires more configuration and setup in Kubernetes environments. Cortex aims for a simpler deployment process, especially for smaller-scale projects or those new to ML deployment.

kubeflow

14,906

Machine Learning Toolkit for Kubernetes

Pros of Kubeflow

More comprehensive ML platform with a wider range of tools and components
Larger community and ecosystem, with extensive documentation and support
Better suited for large-scale, enterprise-level ML workflows

Cons of Kubeflow

Steeper learning curve and more complex setup process
Requires more resources and can be overkill for smaller projects
Less focus on simplicity and ease of use compared to Cortex

Code Comparison

Kubeflow pipeline example:

@dsl.pipeline(
    name='My ML Pipeline',
    description='A sample ML pipeline'
)
def my_pipeline():
    preprocess = preprocess_op()
    train = train_model_op(preprocess.output)
    evaluate = evaluate_model_op(train.output)

Cortex deployment example:

- name: my-model
  kind: RealtimeAPI
  predictor:
    type: python
    path: predictor.py
  compute:
    cpu: 1

Both Cortex and Kubeflow aim to simplify machine learning workflows on Kubernetes, but they have different focuses. Cortex emphasizes simplicity and ease of use for deploying models, while Kubeflow offers a more comprehensive platform for the entire ML lifecycle. The choice between them depends on the scale and complexity of your ML projects.

mlflow

20,329

Open source platform for the machine learning lifecycle

Pros of MLflow

More comprehensive tracking and management of the entire ML lifecycle
Broader language support (Python, R, Java, and more)
Larger community and ecosystem with extensive integrations

Cons of MLflow

Can be more complex to set up and use for simple projects
Less focus on deployment and serving compared to Cortex
May require additional tools for production-grade model serving

Code Comparison

MLflow experiment tracking:

import mlflow

with mlflow.start_run():
    mlflow.log_param("param1", 5)
    mlflow.log_metric("accuracy", 0.85)
    mlflow.sklearn.log_model(model, "model")

Cortex deployment:

- name: iris-classifier
  kind: RealtimeAPI
  predictor:
    type: python
    path: predictor.py
  compute:
    cpu: 1

MLflow is more focused on experiment tracking and model management, while Cortex emphasizes deployment and serving of machine learning models. MLflow provides a more comprehensive solution for the entire ML lifecycle, but may require additional setup for production deployments. Cortex, on the other hand, offers a more streamlined approach to deploying models as APIs but with less emphasis on experiment tracking and model versioning.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

Docs â¢ Slack

Note: This project is no longer actively maintained by its original authors.

Production infrastructure for machine learning at scale

Deploy, manage, and scale machine learning models in production.

Serverless workloads

Realtime - respond to requests in real-time and autoscale based on in-flight request volumes.

Async - process requests asynchronously and autoscale based on request queue length.

Batch - run distributed and fault-tolerant batch processing jobs on-demand.

Automated cluster management

Autoscaling - elastically scale clusters with CPU and GPU instances.

Spot instances - run workloads on spot instances with automated on-demand backups.

Environments - create multiple clusters with different configurations.

CI/CD and observability integrations

Provisioning - provision clusters with declarative configuration or a Terraform provider.

Metrics - send metrics to any monitoring tool or use pre-built Grafana dashboards.

Logs - stream logs to any log management tool or use the pre-built CloudWatch integration.

Built for AWS

EKS - Cortex runs on top of EKS to scale workloads reliably and cost-effectively.

VPC - deploy clusters into a VPC on your AWS account to keep your data private.

IAM - integrate with IAM for authentication and authorization workflows.

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot