We present ADIOS 2, the latest version of the Adaptable Input Output (I/O) System. ADIOS 2 addresses scientific data management needs ranging from scalable I/O in supercomputers, to data analysis in ...personal computer and cloud systems. Version 2 introduces a unified application programming interface (API) that enables seamless data movement through files, wide-area-networks, and direct memory access, as well as high-level APIs for data analysis. The internal architecture provides a set of reusable and extendable components for managing data presentation and transport mechanisms for new applications. ADIOS 2 bindings are available in C++11, C, Fortran, Python, and Matlab and are currently used across different scientific communities. ADIOS 2 provides a communal framework to tackle data management challenges as we approach the exascale era of supercomputing.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The Wushan Salamander (
), Jinfo Salamander (
), and Wenxian Knobby Salamander (
) are rare national Class II protected wild animals in China. We performed MaxEnt modeling to predict and analyze the ...potential distribution and trends of these species in Chongqing under current and future climate conditions. Species distribution data were primarily obtained from field surveys, supplemented by museum collections and the existing literature. These efforts yielded 636 records, including 43 for
, 23 for
, and 570 for
. Duplicate records within the same 100 m × 100 m grid cell were removed using ENMTools, resulting in 10, 12, and 58 valid distribution points for
,
, and
, respectively. The optimization of feature class parameters (FC) and the regularization multiplier (RM) were applied using R package "ENMeval 2.0" to establish the optimal model with MaxEnt. The refined models were applied to simulate the suitable distribution areas for the three species. The results indicate that the current suitable habitat area for
accounted for 9.72% of the whole region of the Chongqing municipality. It is projected that, by 2050, the proportion of suitable habitat will increase to 12.54% but will decrease to 11.98% by 2070 and further decline to 8.80% by 2090. The current suitable habitat area for
accounted for 1.08% of the whole region of the Chongqing municipality, which is expected to decrease to 0.31%% by 2050, 0.20% by 2070, and 0.07% by 2090. The current suitable habitat area for
accounted for 0.81% of the whole region of the Chongqing municipality, which is anticipated to decrease to 0.37% by 2050, 0.21% by 2070, and 0.06% by 2090. Human disturbance, climate variables, and habitat characteristics are the primary factors influencing the distribution of three salamander species in Chongqing. The proximity to roads significantly impacts
, while climate conditions mainly affect
, and the distance to water sources is crucial for
. The following suggestions were made based on key variables identified for each species: (1) For
, it is imperative to minimize human disturbances and preserve areas without roads and the existing vegetation within nature reserves to ensure their continued existence. (2) For
, the conservation of high-altitude habitats is of utmost importance, along with the reduction in disturbances caused by roads to maintain the species' ecological niche. (3) For
, the protection of aquatic habitats is crucial. Additionally, efforts to mitigate the impacts of road construction and enhance public awareness are essential for the preservation of this species and the connectivity of its habitats.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Inhomogeneous and random scattering in a static-scattering medium diffuses an incident light field into a complex speckle pattern. This phenomenon leads to uncontrolled and unpredictable changes in ...the spectrum as incident polychromatic light is scattered by the static-scattering medium. In this paper, we propose and demonstrate a novel technique for shaping the randomly scattered spectrum into a controlled mode. The proposed technique is demonstrated experimentally by examples of bandpass spectrum, stairs, right triangle, and Gaussian spectra. A theoretical model for simulating polychromatic light field passing through a scattering medium is also developed. Theoretical and experimental results are consistent. The findings of this paper may present exciting applications, such as compensation for fiber dispersion.
LaTiO2N photocatalysts were prepared by thermal ammonolysis of flux‐synthesized La2Ti2O7 and La2TiO5, and were investigated for water oxidation. Though LaTiO2N derived from La2TiO5 appears ...defect‐free by UV/Vis/near‐IR and electron paramagnetic resonance (EPR) spectroscopy, its performance is much lower than that of conventional La2Ti2O7‐derived LaTiO2N with defects. It is shown by Mott–Schottky analysis that La2TiO5‐derived LaTiO2N has significantly lower donor density; this can result in insufficient built‐in electric field for the separation of photogenerated electrons and holes. The lower donor density is also consistent with the smaller difference between the Fermi level and the valence‐band maximum, which accounts for a lower oxidative power of the holes. In light of this discovery, the donor density was increased substantially by introducing anion vacancies through annealing in Ar. This resulted in improved performance. The CoOx‐assisted La2TiO5‐derived LaTiO2N annealed at 713 °C has a higher quantum efficiency (25 %) at 450 nm than high‐performance conventional CoOx/LaTiO2N (21 %).
Improving photocatalysts: Sufficient donor density in LaTiO2N photocatalysts is essential to generate an efficient internal electric field for charge separation and for high oxidizing power of the holes. On the basis of this understanding, LaTiO2N photocatalysts were modified to give a quantum efficiency for water oxidation of 25 % at 450 nm, which is higher than that of conventional LaTiO2N.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
Optical twist has always been a hot spot in optics since it was discovered in 1993. Twisted beams can be generated by introducing the twist phase into partially coherent beams, or by introducing the ...twisting phase into anisotropic beams, whose spectral density and degree of coherence will spontaneously rotate during propagation. Unlike conventional beams, twisted beams have unique properties and can be used in many applications, such as optical communications, laser material processing, and particle manipulation. In this paper, we present a review of recent developments on phase studies of beams carrying twist.
We present the Exascale Framework for High Fidelity coupled Simulations (EFFIS), a workflow and code coupling framework developed as part of the Whole Device Modeling Application (WDMApp) in the ...Exascale Computing Project. EFFIS consists of a library, command line utilities, and a collection of run-time daemons. Together, these software products enable users to easily compose and execute workflows that include: strong or weak coupling, in situ (or offline) analysis/visualization/monitoring, command-and-control actions, remote dashboard integration, and more. We describe WDMApp physics coupling cases and computer science requirements that motivate the design of the EFFIS framework. Furthermore, we explain the essential enabling technology that EFFIS leverages: ADIOS for performant data movement, PerfStubs/TAU for performance monitoring, and an advanced COUPLER for transforming coupling data from its native format to the representation needed by another application. Finally, we demonstrate EFFIS using coupled multi-simulation WDMApp workflows and exemplify how the framework supports the project’s needs. We show that EFFIS and its associated services for data movement, visualization, and performance collection does not introduce appreciable overhead to the WDMApp workflow and that the resource-dominant application’s idle time while waiting for data is minimal.
Full text
Available for:
NUK, OILJ, SAZU, UKNU, UL, UM, UPUK
A novel class of partially coherent light sources termed optical coherence grids (OCGs) are introduced that can yield stable optical grids in the far field. The optical grids, of which the light ...distributes in a network of straight lines crossing each other to form a series of hollow cages, can be seen as a better controlled optical lattice. Propagation properties of OCG beams in free space, including spectral density, transverse coherence, and M
factor, are investigated in detail. It is interesting that a periodic grid pattern is produced at a distance and remains stable on further propagation, and we stress that the structure of far-field optical grids can be flexibly tuned by modulating the correlation parameters of the source. In addition, by performing convolution of degree of coherence, we also propose perfect optical coherence grids (POCG). The far-field grid pattern of POCG is in a fully controllable fashion. This work is expected to find applications in cooling atoms, trapping microscopic particles, or assembling cells, etc.
The remarkable performance capabilities of AI accelerators offer promising opportunities for accelerating cryptographic algorithms, particularly in the context of lattice-based cryptography. However, ...current approaches to leveraging AI accelerators often remain at a rudimentary level of implementation, overlooking the intricate internal mechanisms of these devices. Consequently, a significant number of computational resources is underutilized.In this paper, we present a comprehensive exploration of NVIDIA Tensor Cores and introduce a novel framework tailored specifically for Kyber. Firstly, we propose two innovative approaches that efficiently break down Kyber’s NTT into iterative matrix multiplications, resulting in approximately a 75% reduction in costs compared to the state-of-the-art scanning-based methods. Secondly, by reversing the internal mechanisms, we precisely manipulate the internal resources of Tensor Cores using assembly-level code instead of inefficient standard interfaces, eliminating memory accesses and redundant function calls. Finally, building upon our highly optimized NTT, we provide a complete implementation for all parameter sets of Kyber. Our implementation surpasses the state-of-the-art Tensor Core based work, achieving remarkable speed-ups of 1.93x, 1.65x, 1.22x and 3.55x for polyvec_ntt, KeyGen, Enc and Dec in Kyber-1024, respectively. Even when considering execution latency, our throughput-oriented full Kyber implementation maintains an acceptable execution latency. For instance, the execution latency ranges from 1.02 to 5.68 milliseconds for Kyber-1024 on R3080 when achieving the peak throughput.
With the increase of the scale and intensity of the parallel I/O workloads generated by those scientific applications running on high performance computing facilities, understanding the I/O dynamics, ...especially the root cause of the I/O performance variability and degradation in HPC environment, have become extremely critical to the HPC community. In this paper, we run extensive I/O measuring tests on a production leadership-class storage system to capture the performance variabilities of large-scale parallel I/O. Analyzing these results and its statistic correlation revealed some valuable insights into the characteristics of the storage system and the root cause of I/O performance variability. Further, we leverage these findings and propose an I/O middleware design refactoring which can improve the performance of the parallel I/O by optimizing the data striping and placement. Our preliminary evaluation results demonstrate the proposed approach can reduce the average per-process write latency by at least 80% and the maximum per-process write latency by at least 20%.
A growing disparity between supercomputer computation speeds and I/O rates means that it is rapidly becoming infeasible to analyze supercomputer application output only after that output has been ...written to a file system. Instead, data-generating applications must run concurrently with data reduction and/or analysis operations, with which they exchange information via high-speed methods such as interprocess communications. The resulting parallel computing motif, online data analysis and reduction (ODAR), has important implications for both application and HPC systems design. Here we introduce the ODAR motif and its co-design concerns, describe a co-design process for identifying and addressing those concerns, present tools that assist in the co-design process, and present case studies to illustrate the use of the process and tools in practical settings.
Full text
Available for:
NUK, OILJ, SAZU, UKNU, UL, UM, UPUK