Rezultati iskanja Bercea, Gheorghe-Teodor

1.	Firedrake Rathgeber, Florian; Ham, David A.; Mitchell, Lawrence ... ACM transactions on mathematical software, 01/2017, Letnik: 43, Številka: 3 Journal Article Recenzirano Odprti dostop Firedrake is a new tool for automating the numerical solution of partial differential equations. Firedrake adopts the domain-specific language for the finite element method of the FEniCS project, but ...	Celotno besedilo Dostopno za: NUK, UL PDF
2.	Reliable Actors with Retry Orchestration Tardieu, Olivier; Grove, David; Bercea, Gheorghe-Teodor ... Proceedings of ACM on programming languages, 06/2023, Letnik: 7, Številka: PLDI Journal Article Recenzirano Odprti dostop Cloud developers have to build applications that are resilient to failures and interruptions. We advocate for a fault-tolerant programming model for the cloud based on actors, retry orchestration, ...	Celotno besedilo Dostopno za: NUK, UL, UM, UPUK
3.	A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake Bercea, Gheorghe-Teodor; McRae, Andrew T. T; Ham, David A ... Geoscientific Model Development, 10/2016, Letnik: 9, Številka: 10 Journal Article Recenzirano Odprti dostop We present a generic algorithm for numbering and then efficiently iterating over the data values attached to an extruded mesh. An extruded mesh is formed by replicating an existing mesh, assumed to ...	Celotno besedilo Dostopno za: IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK PDF
4.	Cross-Loop Optimization of Arithmetic Intensity for Finite Element Local Assembly Luporini, Fabio; Varbanescu, Ana Lucia; Rathgeber, Florian ... ACM transactions on architecture and code optimization, 12/2014, Letnik: 11, Številka: 4 Journal Article Recenzirano Odprti dostop We study and systematically evaluate a class of composable code transformations that improve arithmetic intensity in local assembly operations, which represent a significant fraction of the execution ...	Celotno besedilo Dostopno za: NUK, UL PDF
5.	Efficient Fork-Join on GPUs Through Warp Specialization Jacob, Arpith Chacko; Eichenberger, Alexandre E; Sung, Hyojin ... 2017 IEEE 24th International Conference on High Performance Computing (HiPC) Conference Proceeding Graphics Processing Units (GPUs) are increasingly used to accelerate portions of general-purpose applications. Higher level language extensions have been proposed to help non-experts bridge the gap ...	Celotno besedilo Dostopno za: IJS, NUK, UL, UM
6.	"Sliced" Subwindow Search: a Sublinear-complexity Solution to the Maximum Rectangle Problem Reuter, Max; Gheorghe-Teodor Bercea; Fong, Liana arXiv (Cornell University), 04/2023 Paper, Journal Article Odprti dostop Considering a 2D matrix of positive and negative numbers, how might one draw a rectangle within it whose contents sum higher than all other rectangles'? This fundamental problem, commonly known the ...	Celotno besedilo Dostopno za: NUK, UL, UM, UPUK
7.	Reliable Actors with Retry Orchestration Tardieu, Olivier; Grove, David; Gheorghe-Teodor Bercea ... arXiv (Cornell University), 11/2022 Paper, Journal Article Odprti dostop Cloud developers have to build applications that are resilient to failures and interruptions. We advocate for a fault-tolerant programming model for the cloud based on actors, retry orchestration, ...	Celotno besedilo Dostopno za: NUK, UL, UM, UPUK
8.	Specialized Kernels for Optimizing GPU Offload in OpenMP Chakrabarti, Dhruva; Rodgers, Gregory; Bertolli, Carlo ... Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, 11/2023 Conference Proceeding Programming models for general purpose GPU (GPGPU) computing include grid and non-grid languages. Grid languages like CUDA and HIP map directly to the GPU hardware and can extract high performance ...	Celotno besedilo Dostopno za: NUK, UL
9.	Offloading Support for OpenMP in Clang and LLVM Antao, Samuel F.; Bataev, Alexey; Jacob, Arpith C. ... 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 2016-Nov. Conference Proceeding OpenMP 4.5 allows performance portability by enabling users to write a single application code and run it on multiple types of accelerators. Our goal is to deliver a high-performance implementation ...	Celotno besedilo Dostopno za: IJS, NUK, UL, UM
10.	Compiling ONNX Neural Network Models Using MLIR Tian, Jin; Gheorghe-Teodor Bercea; Le, Tung D ... arXiv (Cornell University), 10/2020 Paper, Journal Article Odprti dostop Deep neural network models are becoming increasingly popular and have been used in various tasks such as computer vision, speech recognition, and natural language processing. Machine learning models ...	Celotno besedilo Dostopno za: NUK, UL, UM, UPUK

1.

Firedrake
Rathgeber, Florian; Ham, David A.; Mitchell, Lawrence ... ACM transactions on mathematical software, 01/2017, Letnik: 43, Številka: 3
Journal Article

Recenzirano

Odprti dostop

Firedrake is a new tool for automating the numerical solution of partial differential equations. Firedrake adopts the domain-specific language for the finite element method of the FEniCS project, but ...

Celotno besedilo

Dostopno za: NUK, UL

PDF

2.

Reliable Actors with Retry Orchestration
Tardieu, Olivier; Grove, David; Bercea, Gheorghe-Teodor ... Proceedings of ACM on programming languages, 06/2023, Letnik: 7, Številka: PLDI
Journal Article

Recenzirano

Odprti dostop

Cloud developers have to build applications that are resilient to failures and interruptions. We advocate for a fault-tolerant programming model for the cloud based on actors, retry orchestration, ...

Celotno besedilo

Dostopno za: NUK, UL, UM, UPUK

3.

A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake
Bercea, Gheorghe-Teodor; McRae, Andrew T. T; Ham, David A ... Geoscientific Model Development, 10/2016, Letnik: 9, Številka: 10
Journal Article

Recenzirano

Odprti dostop

We present a generic algorithm for numbering and then efficiently iterating over the data values attached to an extruded mesh. An extruded mesh is formed by replicating an existing mesh, assumed to ...

Celotno besedilo

Dostopno za: IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK

PDF

4.

Cross-Loop Optimization of Arithmetic Intensity for Finite Element Local Assembly
Luporini, Fabio; Varbanescu, Ana Lucia; Rathgeber, Florian ... ACM transactions on architecture and code optimization, 12/2014, Letnik: 11, Številka: 4
Journal Article

Recenzirano

Odprti dostop

We study and systematically evaluate a class of composable code transformations that improve arithmetic intensity in local assembly operations, which represent a significant fraction of the execution ...

Celotno besedilo

Dostopno za: NUK, UL

PDF

5.

Efficient Fork-Join on GPUs Through Warp Specialization
Jacob, Arpith Chacko; Eichenberger, Alexandre E; Sung, Hyojin ... 2017 IEEE 24th International Conference on High Performance Computing (HiPC)
Conference Proceeding

Graphics Processing Units (GPUs) are increasingly used to accelerate portions of general-purpose applications. Higher level language extensions have been proposed to help non-experts bridge the gap ...

Celotno besedilo

Dostopno za: IJS, NUK, UL, UM

6.

"Sliced" Subwindow Search: a Sublinear-complexity Solution to the Maximum Rectangle Problem
Reuter, Max; Gheorghe-Teodor Bercea; Fong, Liana arXiv (Cornell University), 04/2023
Paper, Journal Article

Odprti dostop

Considering a 2D matrix of positive and negative numbers, how might one draw a rectangle within it whose contents sum higher than all other rectangles'? This fundamental problem, commonly known the ...

Celotno besedilo

Dostopno za: NUK, UL, UM, UPUK

7.

Reliable Actors with Retry Orchestration
Tardieu, Olivier; Grove, David; Gheorghe-Teodor Bercea ... arXiv (Cornell University), 11/2022
Paper, Journal Article

Odprti dostop

Cloud developers have to build applications that are resilient to failures and interruptions. We advocate for a fault-tolerant programming model for the cloud based on actors, retry orchestration, ...

Celotno besedilo

Dostopno za: NUK, UL, UM, UPUK

8.

Specialized Kernels for Optimizing GPU Offload in OpenMP
Chakrabarti, Dhruva; Rodgers, Gregory; Bertolli, Carlo ... Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, 11/2023
Conference Proceeding

Programming models for general purpose GPU (GPGPU) computing include grid and non-grid languages. Grid languages like CUDA and HIP map directly to the GPU hardware and can extract high performance ...

Celotno besedilo

Dostopno za: NUK, UL

9.

Offloading Support for OpenMP in Clang and LLVM
Antao, Samuel F.; Bataev, Alexey; Jacob, Arpith C. ... 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 2016-Nov.
Conference Proceeding

OpenMP 4.5 allows performance portability by enabling users to write a single application code and run it on multiple types of accelerators. Our goal is to deliver a high-performance implementation ...

Celotno besedilo

Dostopno za: IJS, NUK, UL, UM

10.

Compiling ONNX Neural Network Models Using MLIR
Tian, Jin; Gheorghe-Teodor Bercea; Le, Tung D ... arXiv (Cornell University), 10/2020
Paper, Journal Article

Odprti dostop

Deep neural network models are becoming increasingly popular and have been used in various tasks such as computer vision, speech recognition, and natural language processing. Machine learning models ...

Celotno besedilo

Dostopno za: NUK, UL, UM, UPUK

Naloži sliko

Rezultati iskanja

Nalaganje filtrov

Noben zadetek ni izbran!

Iskanje je bilo uspešno shranjeno.

Urejanje

Iskanja ni bilo mogoče shraniti.

Shrani iskanje

Vnos na polico

Noben zadetek ni izbran!

Dodajanje gradiva na polico je uspelo.

Dodajanje gradiva na polico je le deloma uspelo.

Dodajanje gradiva na polico je v celoti spodletelo.

Dodajanje gradiva na polico ni bilo potrebno.

Duplikat

Dosežena omejitev

Urejanje

Napaka

Urejanje

Dodajanje

Urejanje

Sprememba statusa

Tema