dlpack

common in-memory tensor structure

1,022

148

1,022

View on GitHub

Top Related Projects

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

tensorflow

190,523

An Open Source Machine Learning Framework for Everyone

onnx

18,872

Open standard for machine learning interoperability

tvm

12,482

Open deep learning compiler stack for cpu, gpu and specialized accelerators

onnxruntime

16,412

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

jax

32,065

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Quick Overview

DLPack is an open-source memory tensor structure for sharing tensors among deep learning frameworks. It provides a common interface for tensor data exchange, enabling interoperability between different machine learning libraries and frameworks without the need for data copying.

Pros

Facilitates seamless tensor sharing between different deep learning frameworks
Reduces memory overhead by eliminating the need for data copying
Improves performance in multi-framework environments
Promotes standardization and interoperability in the machine learning ecosystem

Cons

Limited to tensor data structures, not covering other aspects of machine learning workflows
Requires adoption by multiple frameworks to be truly effective
May introduce additional complexity in framework-specific implementations
Potential for version compatibility issues as the specification evolves

Code Examples

Creating a DLPack tensor:

import numpy as np
import torch
from torch.utils.dlpack import to_dlpack, from_dlpack

# Create a NumPy array
np_array = np.array([[1, 2, 3], [4, 5, 6]], dtype=np.float32)

# Convert NumPy array to PyTorch tensor
torch_tensor = torch.from_numpy(np_array)

# Convert PyTorch tensor to DLPack
dlpack_tensor = to_dlpack(torch_tensor)

Converting DLPack tensor to a framework-specific tensor:

# Convert DLPack tensor back to PyTorch tensor
new_torch_tensor = from_dlpack(dlpack_tensor)

print(new_torch_tensor)

Using DLPack with TensorFlow:

import tensorflow as tf

# Convert DLPack tensor to TensorFlow tensor
tf_tensor = tf.experimental.dlpack.from_dlpack(dlpack_tensor)

print(tf_tensor)

Getting Started

To use DLPack, you typically don't need to install it directly. Instead, you'll use it through supported frameworks. Here's a quick example using PyTorch and NumPy:

import numpy as np
import torch
from torch.utils.dlpack import to_dlpack, from_dlpack

# Create a NumPy array
np_array = np.random.rand(3, 4).astype(np.float32)

# Convert to PyTorch tensor
torch_tensor = torch.from_numpy(np_array)

# Convert to DLPack
dlpack_tensor = to_dlpack(torch_tensor)

# Convert back to PyTorch tensor
new_torch_tensor = from_dlpack(dlpack_tensor)

print(torch.allclose(torch_tensor, new_torch_tensor))  # Should print True

This example demonstrates creating a tensor, converting it to DLPack format, and then back to a framework-specific tensor without data copying.

Competitor Comparisons

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Pros of PyTorch

Comprehensive deep learning framework with a wide range of tools and functionalities
Large and active community, extensive documentation, and tutorials
Seamless integration with Python ecosystem and scientific computing libraries

Cons of PyTorch

Larger footprint and potentially slower startup time compared to DLPack
Steeper learning curve for beginners due to its extensive feature set
May be overkill for simple tensor operations or lightweight projects

Code Comparison

PyTorch:

import torch

x = torch.tensor([1, 2, 3])
y = torch.tensor([4, 5, 6])
z = x + y
print(z)

DLPack:

#include <dlpack/dlpack.h>

// DLPack is a specification, not a full framework
// Actual implementation depends on the library using DLPack

Summary

PyTorch is a full-featured deep learning framework, while DLPack is a lightweight tensor metadata specification. PyTorch offers a complete ecosystem for machine learning development, whereas DLPack focuses on providing a common interface for tensor data exchange between different frameworks. Choose PyTorch for comprehensive ML projects, and consider DLPack for interoperability between different ML libraries.

tensorflow

190,523

An Open Source Machine Learning Framework for Everyone

Pros of TensorFlow

Comprehensive ecosystem with tools for all stages of ML development
Extensive documentation and community support
Flexible deployment options across various platforms

Cons of TensorFlow

Steeper learning curve for beginners
Can be resource-intensive for smaller projects
More complex setup compared to lightweight alternatives

Code Comparison

DLPack:

DLManagedTensor* dlpack_tensor = ...;
DLTensor* dl_tensor = &dlpack_tensor->dl_tensor;

TensorFlow:

import tensorflow as tf

tensor = tf.constant([[1, 2], [3, 4]])
result = tf.matmul(tensor, tensor)

Key Differences

DLPack is a lightweight tensor metadata specification, while TensorFlow is a full-fledged ML framework
DLPack focuses on interoperability between deep learning frameworks, whereas TensorFlow provides end-to-end ML capabilities
TensorFlow offers high-level APIs and abstractions, while DLPack operates at a lower level for tensor representation

Use Cases

DLPack: Ideal for projects requiring seamless data exchange between different ML frameworks
TensorFlow: Suitable for large-scale ML projects, production deployments, and research applications

onnx

18,872

Open standard for machine learning interoperability

Pros of ONNX

Broader ecosystem support and wider adoption across various frameworks and tools
More comprehensive representation of complex neural network architectures
Includes a runtime (ONNX Runtime) for model execution and optimization

Cons of ONNX

Larger and more complex specification, potentially harder to implement
May have more overhead for simpler use cases or tensor operations

Code Comparison

DLPack:

typedef struct {
  void* data;
  DLContext ctx;
  int ndim;
  DLDataType dtype;
  int64_t* shape;
  int64_t* strides;
  uint64_t byte_offset;
} DLTensor;

ONNX:

message TensorProto {
  repeated int64 dims = 1;
  int32 data_type = 2;
  repeated float float_data = 4 [packed = true];
  repeated int32 int32_data = 5 [packed = true];
  bytes string_data = 6;
  repeated int64 int64_data = 7 [packed = true];
  string name = 8;
  bytes raw_data = 9;
  // ... (additional fields omitted for brevity)
}

DLPack focuses on a lightweight tensor representation, while ONNX provides a more comprehensive model format with additional metadata and operations. DLPack is simpler and easier to integrate for basic tensor interoperability, whereas ONNX offers a complete solution for model exchange and execution across different frameworks and platforms.

tvm

12,482

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Pros of TVM

Comprehensive end-to-end compiler framework for machine learning
Supports a wide range of hardware targets and optimization techniques
Active development with frequent updates and a large community

Cons of TVM

Steeper learning curve due to its complexity and extensive features
Larger codebase and resource requirements for compilation and deployment
May be overkill for simple projects or specific use cases

Code Comparison

DLPack (header-only library):

typedef struct {
  void* data;
  DLDeviceType device;
  int ndim;
  DLDataType dtype;
  int64_t* shape;
  int64_t* strides;
  uint64_t byte_offset;
} DLTensor;

TVM (example of defining a computation):

import tvm
from tvm import te

n = te.var("n")
A = te.placeholder((n,), name="A")
B = te.compute(A.shape, lambda i: A[i] * 2, name="B")
s = te.create_schedule(B.op)

DLPack is a lightweight, header-only library focused on tensor data structure interoperability. TVM is a more comprehensive compiler framework for machine learning, offering advanced optimizations and targeting various hardware platforms. While DLPack is simpler and easier to integrate, TVM provides a complete solution for optimizing and deploying machine learning models across different devices.

onnxruntime

16,412

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Pros of ONNX Runtime

Comprehensive inference engine with broad hardware support
Optimized performance for ONNX models
Extensive API support for multiple programming languages

Cons of ONNX Runtime

Larger codebase and more complex setup compared to DLPack
Focused on ONNX format, less flexible for other tensor representations

Code Comparison

DLPack:

typedef struct {
  void* data;
  DLDataType dtype;
  int32_t ndim;
  int64_t* shape;
  int64_t* strides;
  uint64_t byte_offset;
} DLTensor;

ONNX Runtime:

Ort::Session session(env, model_path, session_options);
std::vector<float> input_tensor_values{1.0f, 2.0f, 3.0f};
Ort::Value input_tensor = Ort::Value::CreateTensor<float>(memory_info, input_tensor_values.data(), input_tensor_values.size(), input_node_dims.data(), input_node_dims.size());
auto output_tensors = session.Run(Ort::RunOptions{nullptr}, input_node_names.data(), &input_tensor, 1, output_node_names.data(), 1);

DLPack is a lightweight tensor structure specification, while ONNX Runtime is a full-featured inference engine. DLPack focuses on providing a common tensor representation, whereas ONNX Runtime offers a complete solution for running ONNX models with optimized performance across various hardware platforms.

jax

32,065

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Pros of JAX

Comprehensive machine learning framework with automatic differentiation
Supports GPU/TPU acceleration and parallelism
Integrates well with NumPy and offers familiar API

Cons of JAX

Steeper learning curve for beginners
More complex setup and dependencies
Limited support for dynamic neural networks

Code Comparison

DLPack (C++):

DLManagedTensor* dlpack_tensor = ...;
DLTensor* dl_tensor = &dlpack_tensor->dl_tensor;

JAX (Python):

import jax.numpy as jnp

x = jnp.array([1, 2, 3])
y = jnp.sin(x)

Key Differences

DLPack: Lightweight tensor structure specification
JAX: Full-featured ML framework with NumPy-like API
DLPack: Language-agnostic, focuses on interoperability
JAX: Python-specific, emphasizes performance and ease of use

Use Cases

DLPack: Ideal for cross-framework tensor exchange
JAX: Suitable for end-to-end ML pipelines, especially in research

Community and Ecosystem

DLPack: Smaller community, used as a standard in various frameworks
JAX: Growing ecosystem with active development and research applications

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

DLPack: Open In Memory Tensor Structure

Documentation: https://dmlc.github.io/dlpack/latest

DLPack is an open in-memory tensor structure for sharing tensors among frameworks. DLPack enables

Easier sharing of operators between deep learning frameworks.
Easier wrapping of vendor level operator implementations, allowing collaboration when introducing new devices/ops.
Quick swapping of backend implementations, like different version of BLAS
For final users, this could bring more operators, and possibility of mixing usage between frameworks.

We do not intend to implement Tensor and Ops, but instead use this as common bridge to reuse tensor and ops across frameworks.

Proposal Procedure

RFC proposals are opened as issues. The major release will happen as a vote issue to make sure the participants agree on the changes.

Project Structure

There are two major components so far

include: stabilized headers
contrib: in progress unstable libraries

People

Here are list of people who have been involved in DLPack RFC design proposals:

@soumith @piiswrong @Yangqing @naibaf7 @bhack @edgarriba @tqchen @prigoyal @zdevito

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot