coreylammie/MemTorch: 1.1.2 Release
Description
- C++ and CUDA bindings for
memtorch.bh.crossbar.Tile.tile_matmul.
Using an NVIDIA GeForce GTX 1080, a tile shape of (25, 25), and two tensors of size (500, 500), the runtime of tile_matmul without quantization support is reduced by 2.45x and 5.48x, for CPU-bound and GPU-bound operation, respectively. With an ADC resolution of 4 bits and an overflow rate of 0.0, the runtime of tile_matmul with quantization support is reduced by 2.30x and 105.27x, for CPU-bound and GPU-bound operation, respectively.
Eigenintegration with C++ and CUDA bindings.- Additional unit tests.
- Modularized C++ and CUDA
quantizebindings. - Enhanced functionality of
naive_progamand added additional input arguments to dictate logic for stuck devices.
- Removed debugging code from
naive_progam.
Files
coreylammie/MemTorch-v1.1.2.zip
Files
(6.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:5bcd8dfc8bea4771470e3ac89e407596
|
6.7 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/coreylammie/MemTorch/tree/v1.1.2 (URL)