Akademska digitalna zbirka SLovenije - logo
E-viri
Celotno besedilo
Recenzirano
  • AutoML for Architecting Eff...
    Cai, Han; Lin, Ji; Lin, Yujun; Liu, Zhijian; Wang, Kuan; Wang, Tianzhe; Zhu, Ligeng; Han, Song

    IEEE MICRO, 2020-Jan.-Feb.-1, 2020-1-1, 20200101, Letnik: 40, Številka: 1
    Journal Article

    Efficient deep learning inference requires algorithm and hardware codesign to enable specialization: we usually need to change the algorithm to reduce memory footprint and improve energy efficiency. However, the extra degree of freedom from the neural architecture design makes the design space much larger: it is not only about designing the hardware architecture but also codesigning the neural architecture to fit the hardware architecture. It is difficult for human engineers to exhaust the design space by heuristics. We propose design automation techniques for architecting efficient neural networks given a target hardware platform. We investigate automatically designing specialized and fast models, auto channel pruning, and auto mixed-precision quantization. We demonstrate that such learning-based, automated design achieves superior performance and efficiency than the rule-based human design. Moreover, we shorten the design cycle by 200× than previous work, so that we can afford to design specialized neural network models for different hardware platforms.