Akademska digitalna zbirka SLovenije - logo

Search results

Basic search    Expert search   

Currently you are NOT authorised to access e-resources SI consortium. For full access, REGISTER.

1 2 3 4
hits: 38
1.
  • Hardware Acceleration of Sp... Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
    Dave, Shail; Baghdadi, Riyadh; Nowatzki, Tony ... Proceedings of the IEEE, 10/2021, Volume: 109, Issue: 10
    Journal Article
    Peer reviewed
    Open access

    Machine learning (ML) models are widely used in many important domains. For efficiently processing these computational- and memory-intensive applications, tensors of these overparameterized models ...
Full text
Available for: IJS, NUK, UL

PDF
2.
  • Learning to optimize halide... Learning to optimize halide with tree search and random programs
    Adams, Andrew; Ma, Karima; Anderson, Luke ... ACM transactions on graphics, 07/2019, Volume: 38, Issue: 4
    Journal Article
    Peer reviewed
    Open access

    We present a new algorithm to automatically schedule Halide programs for high-performance image processing and deep learning. We significantly improve upon the performance of previous methods, which ...
Full text
Available for: NUK, UL

PDF
3.
  • A Hybrid Machine Learning M... A Hybrid Machine Learning Model for Code Optimization
    Hakimi, Yacine; Baghdadi, Riyadh; Challal, Yacine International journal of parallel programming, 12/2023, Volume: 51, Issue: 6
    Journal Article
    Peer reviewed
    Open access

    The complexity of programming modern heterogeneous systems raises huge challenges. Over the past two decades, researchers have aimed to alleviate these difficulties by employing classical Machine ...
Full text
Available for: EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
4.
  • GraphIt: a high-performance... GraphIt: a high-performance graph DSL
    Zhang, Yunming; Yang, Mengjiao; Baghdadi, Riyadh ... Proceedings of ACM on programming languages, 11/2018, Volume: 2, Issue: OOPSLA
    Journal Article
    Peer reviewed
    Open access

    The performance bottlenecks of graph applications depend not only on the algorithm and the underlying hardware, but also on the size and structure of the input graph. As a result, programmers must ...
Full text
Available for: NUK, UL, UM, UPUK

PDF
5.
  • Variational study of two-nu... Variational study of two-nucleon systems with lattice QCD
    Amarasinghe, Saman; Baghdadi, Riyadh; Davoudi, Zohreh ... Physical review. D, 05/2023, Volume: 107, Issue: 9
    Journal Article
    Peer reviewed
    Open access

    The low-energy spectrum and scattering of two-nucleon systems are studied with lattice quantum chromodynamics using a variational approach. A wide range of interpolating operators are used: dibaryon ...
Full text
Available for: CMK, CTK, FMFMET, IJS, NUK, PNG, UM
6.
  • Permutation Flowshop Schedu... Permutation Flowshop Scheduling Problem Considering Learning, Deteriorating Effects and Flexible Maintenance
    Touafek, Nesrine; Ladj, Asma; Tayeb, Fatima Benbouzid-Si ... Procedia computer science, 2022, 2022-00-00, Volume: 207
    Journal Article
    Peer reviewed
    Open access

    Availability constraints, machine condition as well as human behavior phenomena were recently introduced in the study of scheduling problems in order to get closer to the industrial reality. In this ...
Full text
Available for: GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
7.
  • A Common Backend for Hardwa... A Common Backend for Hardware Acceleration on FPGA
    Del Sozzo, Emanuele; Baghdadi, Riyadh; Amarasinghe, Saman ... 2017 IEEE International Conference on Computer Design (ICCD), 2017-Nov.
    Conference Proceeding
    Open access

    Field Programmable Gate Arrays (FPGAs) are configurable integrated circuits able to provide a good trade-off in terms of performance, power consumption, and flexibility with respect to other ...
Full text
Available for: IJS, NUK, UL, UM

PDF
8.
  • PENCIL: A Platform-Neutral ... PENCIL: A Platform-Neutral Compute Intermediate Language for Accelerator Programming
    Baghdadi, Riyadh; Beaugnon, Ulysse; Cohen, Albert ... 2015 International Conference on Parallel Architecture and Compilation (PACT), 10/2015
    Conference Proceeding
    Open access

    Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is difficult, error-prone, and not performance-portable. Automatic parallelization and domain specific ...
Full text
Available for: IJS, NUK, UL, UM

PDF
9.
  • Q-gym Q-gym
    Fu, Cheng; Huang, Hanxian; Wasti, Bram ... Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 10/2022
    Conference Proceeding
    Open access

    The high computation cost is one of the key bottlenecks for adopting deep neural networks (DNNs) in different hardware. When client data are sensitive, privacy-preserving DNN evaluation method, such ...
Full text
Available for: NUK, UL
10.
  • Seq: a high-performance lan... Seq: a high-performance language for bioinformatics
    Shajii, Ariya; Numanagić, Ibrahim; Baghdadi, Riyadh ... Proceedings of ACM on programming languages, 10/2019, Volume: 3, Issue: OOPSLA
    Journal Article
    Peer reviewed
    Open access

    The scope and scale of biological data are increasing at an exponential rate, as technologies like next-generation sequencing are becoming radically cheaper and more prevalent. Over the last two ...
Full text
Available for: NUK, UL, UM, UPUK

PDF
1 2 3 4
hits: 38

Load filters