On-Device-Learning for MicroControllers

Applications and Software for Efficient ODL on MCUs

Ultra-low power devices are now powered with on-board processing resources to extract meaninful information from sensor data using Deep Learning models. These models are trained using powerfull high-performance machines and then frozen to be deployed on scale on low-power Micro-Controller platforms.

This line of research tackles the common train-once-deploy-everywhere scheme by investigating if deep learning models can be updated directly on-device after deployment, in short On-Device Leaning (ODL).

PULPTrain-lib: Bringing ODL to MicroControllers

The library collects deep learning primitives for layer-wise forward and backward passes. the target platform is the PULP platform: a MCU-class architecture featuring multiple RISC-V cores with a customized ISA. Our efficient software solution exploits parallelization, loop unrolling and vectorized MAC instrunctions, with full- and half-precision floating point formats. The library is open-source on GitHub.

Continual Learning on MCUs

We formulated a cost model to bring continual learning with latent replays on multi-core MCUs. To reduce the memory requirement, we proposed to a low-bitwidth quantization for the replay tensors up to 7 bits for a lossless solution (missing reference). Thanks to our system architecture and software stack, we could scale the final accuracy after ODL without forgetting depending on the memory size to store the replay buffer.

References

2023

  1. Reduced precision floating-point optimization for Deep Neural Network On-Device Learning on microcontrollers
    Davide Nadalini, Manuele Rusci, Luca Benini, and 1 more author
    Future Generation Computer Systems, 2023

2022

  1. PULP-TrainLib: Enabling on-device training for RISC-V multi-core MCUs through performance-driven Autotuning
    Davide Nadalini, Manuele Rusci, Giuseppe Tagliavini, and 3 more authors
    In International Conference on Embedded Computer Systems, 2022
  2. Towards on-device domain adaptation for noise-robust keyword spotting
    Cristian Cioflan, Lukas Cavigelli, Manuele Rusci, and 2 more authors
    In 2022 IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS), 2022

2021

  1. A tinyml platform for on-device continual learning with quantized latent replays
    Leonardo Ravaglia, Manuele Rusci, Davide Nadalini, and 3 more authors
    IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2021

2020

  1. Memory-latency-accuracy trade-offs for continual learning on a risc-v extreme-edge node
    Leonardo Ravaglia, Manuele Rusci, Alessandro Capotondi, and 5 more authors
    In 2020 IEEE Workshop on Signal Processing Systems (SiPS), 2020