PyTorch Backend

Project Objective (Implementation of PyTorch RFC 37):

Bridging and Integration: Construct a device-agnostic layer that promotes a unified interface on the upper layer and ensures compatibility with various hardware on the lower layer, shielding PyTorch from direct awareness of multiple backends.
Low cost Integration: Provide device abstraction layer to accelerate new backend integration by only implementing few interfaces, offer comprehensive integration documentation, provide integrate implementations as a reference (CUDA/CPU/NPU) and general test cases and contract tests.
Quality Assurance: Maintain quality through CI/CD for the integration mechanism of third-party devices based on PrivateUse1.
Mainstream Approach: Promote the integration mechanism of third-party devices based on PrivateUse1 as the mainstream approach for integrating new backends into PyTorch in the future.

Current Progress:

Runtime: Completed components include Device, Stream, Event, Generator, Guard, and Allocator.
AMP: Registration and API have been completed.
Operators: Migrated NPU operator list and codegen. The next steps will involve operator simplification and codegen refactoring.

Next Steps:

Device-agnostic: Complete the device-agnostic layer; organize specific device logic according to different device type (e.g., backends/cuda, backends/cpu, backends/...). Making it as submodule in the future.
CodeGen: Enhance and refactor codegen module, providing general and reusable code generation capabilities that cover official operators, custom operators, routing code, forward and backward binding, etc.
Operators: Simplify operators, implement all factory class operators (as operator implementation reference), as well as functional operators (for testing the functionality of the third-party device integration mechanism).
Tests & Docs: Complete general test case suites, the full module integration and API documentation.
Live Demo: Integrate CPU into PyTorch based on this project and provide a full-process integration tutorial.

Getting Started

To start using the PyTorch Backend Project, users can refer to the comprehensive documentation provided. This includes detailed guides on setting up the environment, integrating new devices, and best practices for optimizing performance.

Project Structure

    .
    ├── backends
    │   ├── fake               // dummy backend: provide all weak symbols needed by csrc, we can run this demo without implementing all symbols in REAL Backend by this fake backend.
    │   ├── npu                // one of REAL Backend: provide API and Structure related witch specific Backends strongly
    │   ├── cuda               // one of REAL Backend: will be implemented later
    │   └── ...
    ├── cmake
    ├── codegen                // Code generation: includes registration for forward and backward, backward implementation, backward binding, custom operator routing, reroute routing, etc.
    │   ├── autograd
    │   │   └── templates      // General template
    │   └── templates
    ├── csrc                   // C++ implementations related to PyTorch, not involving specific backend implementations, theoretically only includes backend interface calls
    │   ├── api                // libtorch functionalities
    │   ├── aten               // Code generation: includes only wrap and PyTorch operator registration; in the future, considering moving Tensor & Storage & Serialization here, as these three are related to Tensor logic
    │   ├── backend            // General Implementation of PyTorch API
    │   ├── core               // Common Utils
    │   │   ├── allocator
    │   │   ├── generator
    │   │   └── guard
    │   └── distributed        // Distributed
    ├── docs                   // All docs: C++ API, Python API and E2E tutorials
    │   ├── cpp
    │   │   └── source
    │   └── source
    ├── test                   // General TestCase Sets: including C++ and python
    │   └── cpp
    │       ├── backend
    │       ├── common
    │       └── core
    ├── third_party
    │   └── googletest
    └── torch_backend          // Python interface implementation for PyTorch
        ├── backend
        ├── csrc               // Python & C++ binding
        │   ├── backend        // Python bindings for all low-level capabilities needed to be exposed to Python
        │   └── core           // General capabilities, only provided for Python
        └── meta               // Meta operator registration

Type	Description	Occurr Counts	Example PR
Refactoring	Turn Allocator::allocate into non-const, derived class’ override function is not modified.	1	#120969
Refactoring	Use DeviceIndex instead of int in CUDA wrappers, derived class’ override function is not modified.	1	#119142
Refactoring	Move new trace utils from source to header, which leads to some symbols can’t be found.	1	#114367
Refactoring	Migrate to getCvar* functions for env variable checking, which leads to function name can’t be found.	1	#113797
New Features	Add support for new data types, data type assert fails.	3	#107586, #116594
New Features	Add function to materialize COW storages, which add a pure virtual function Allocator::copy_data, derived class didn’t implement this pure virtual function.	2	#117053, #113396
Refactoring	Make macro with AMP more generic.	1	#124050

Type	Description	PR
Refactoring	Re-implement pin_memory to be device-agnostic by leveraging the Accelerator concept.	#126376
Refactoring	generalize custom_fwd&custom_bwd to be device-agnostic.	#126531
Refactoring	Refactor autocast C++ APIs to be device-agnostic.	#124359
Refactoring	refactor autocast python APIs.	#124479
Refactoring	refactor lazy init to device-agnostic.	#118846
Refactoring	Generalize host allocator to be device-agnostic.	#123079

cosdt / torch_npu Goto Github PK

torch_npu's Introduction

PyTorch Backend

Project Objective (Implementation of PyTorch RFC 37):

Current Progress:

Next Steps:

Getting Started

Project Structure

Documents

API Documents

License

torch_npu's People

Contributors

Stargazers

Watchers

torch_npu's Issues

1. NPUFunctions.h

roadmap

Recommend Projects

Recommend Topics

Recommend Org