The goal of torchvisionlib is to provide access to C++ opeartions implemented in torchvision. It provides plain R acesss to some of those C++ operations but, most importantly it provides full support for JIT operators defined in torchvision, allowing us to load ‘scripted’ object detection and image segmentation models.
torchvisionlib can be installed from CRAN with:
install.packages("torchvisionlib")
You can also install the development version of torchvisionlib from GitHub with:
# install.packages("devtools")
::install_github("mlverse/torchvisionlib") devtools
Suppose that we want to load an image detection model implemented in torchvision. First, in Python, we can save JIT script and then save this model:
import torch
import torchvision
= torchvision.models.detection.fasterrcnn_mobilenet_v3_large_320_fpn(pretrained=True)
model eval()
model.
= torch.jit.script(model)
jit_model "fasterrcnn_mobilenet_v3_large_320_fpn.pt") torch.jit.save(jit_model,
We can then load this model in R. Simply loading torchvisionlib will
register all JIT operators, and we can use
torch::jit_load()
.
library(torchvisionlib)
<- torch::jit_load("fasterrcnn_mobilenet_v3_large_320_fpn.pt")
model
model#> An `nn_module` containing 19,386,354 parameters.
#>
#> ── Modules ─────────────────────────────────────────────────────────────────────
#> • transform: <script_module> #0 parameters
#> • backbone: <script_module> #4,414,944 parameters
#> • rpn: <script_module> #609,355 parameters
#> • roi_heads: <script_module> #14,362,055 parameters
You can then use this model to make preditions or even fine tuning.