Akademska digitalna zbirka SLovenije - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov konzorcija SI. Za polni dostop se PRIJAVITE.

1 2 3
zadetkov: 23
1.
  • Bit-Plane Compression: Tran... Bit-Plane Compression: Transforming Data for Better Compression in Many-Core Architectures
    Jungrae Kim; Sullivan, Michael; Choukse, Esha ... 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), 2016-June
    Conference Proceeding

    As key applications become more data-intensive and the computational throughput of processors increases, the amount of data to be transferred in modern memory subsystems grows. Increasing physical ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
2.
  • Buddy compression Buddy compression
    Choukse, Esha; Sullivan, Michael B.; O'Connor, Mike ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), 05/2020
    Conference Proceeding
    Odprti dostop

    GPUs accelerate high-throughput applications, which require orders-of-magnitude higher memory bandwidth than traditional CPU-only systems. However, the capacity of such high-bandwidth memory tends to ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM

PDF
3.
  • CompressPoints: An Evaluati... CompressPoints: An Evaluation Methodology for Compressed Memory Systems
    Choukse, Esha; Erez, Mattan; Alameldeen, Alaa IEEE computer architecture letters, 07/2018, Letnik: 17, Številka: 2
    Journal Article
    Recenzirano

    Current memory technology has hit a wall trying to scale to meet the increasing demands of modern client and datacenter systems. Data compression is a promising solution to this problem. Several ...
Celotno besedilo
Dostopno za: IJS, NUK, UL
4.
  • Towards Improved Power Mana... Towards Improved Power Management in Cloud GPUs
    Patel, Pratyush; Gong, Zibo; Rizvi, Syeda ... IEEE computer architecture letters, 07/2023, Letnik: 22, Številka: 2
    Journal Article
    Recenzirano

    As modern server GPUs are increasingly power intensive, better power management mechanisms can significantly reduce the power consumption, capital costs, and carbon emissions in large cloud ...
Celotno besedilo
Dostopno za: IJS, NUK, UL
5.
  • Compresso Compresso
    Choukse, Esha; Erez, Mattan; Alameldeen, Alaa R. 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 10/2018
    Conference Proceeding

    Today, larger memory capacity and higher memory bandwidth are required for better performance and energy efficiency for many important client and datacenter applications. Hardware memory compression ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
6.
  • Overclocking in Immersion-C... Overclocking in Immersion-Cooled Datacenters
    Misra, Pulkit A.; Manousakis, Ioannis; Choukse, Esha ... IEEE MICRO, 07/2022, Letnik: 42, Številka: 4
    Journal Article
    Recenzirano

    Large cloud providers are starting to leverage liquid cooling for an increasing number of workloads. Liquid cooling enables providers to overclock server components, but they must tradeoff the ...
Celotno besedilo
Dostopno za: IJS, NUK, UL
7.
  • Memory Compression for High... Memory Compression for Higher Effective Capacity and Bandwidth
    Choukse, Esha 01/2019
    Dissertation

    Many important client and data-center applications need large memory capacity and high memory bandwidth to achieve their performance and energy efficiency goals. With the increase in data-centered ...
Celotno besedilo
8.
  • Translation-Optimized Memor... Translation-Optimized Memory Compression for Capacity
    Panwar, Gagandeep; Laghari, Muhammad; Bears, David ... 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 10/2022
    Conference Proceeding

    The demand for memory is ever increasing. Many prior works have explored hardware memory compression to increase effective memory capacity. However, prior works compress and pack/migrate data at a ...
Celotno besedilo
Dostopno za: IJS, NUK, UL, UM
9.
  • PruneTrain PruneTrain
    Lym, Sangkug; Choukse, Esha; Zangeneh, Siavash ... Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 11/2019
    Conference Proceeding

    State-of-the-art convolutional neural networks (CNNs) used in vision applications have large models with numerous weights. Training these models is very compute- and memory-resource intensive. Much ...
Celotno besedilo
Dostopno za: NUK, UL

PDF
10.
  • DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency
    Stojkovic, Jovan; Zhang, Chaojie; Goiri, Íñigo ... arXiv.org, 08/2024
    Paper, Journal Article
    Odprti dostop

    The rapid evolution and widespread adoption of generative large language models (LLMs) have made them a pivotal workload in various applications. Today, LLM inference clusters receive a large number ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
1 2 3
zadetkov: 23

Nalaganje filtrov