Local Relation Networks for Image Recognition Hu, Han; Zhang, Zheng; Xie, Zhenda ...
2019 IEEE/CVF International Conference on Computer Vision (ICCV),
10/2019
Conference Proceeding
Odprti dostop
The convolution layer has been the dominant feature extractor in computer vision for years. However, the spatial aggregation in convolution is basically a pattern matching process that applies fixed ...filters which are inefficient at modeling visual elements with varying spatial distributions. This paper presents a new image feature extractor, called the local relation layer, that adaptively determines aggregation weights based on the compositional relationship of local pixel pairs. With this relational approach, it can composite visual elements into higher-level entities in a more efficient manner that benefits semantic inference. A network built with local relation layers, called the Local Relation Network (LR-Net), is found to provide greater modeling capacity than its counterpart built with regular convolution on large-scale recognition tasks such as ImageNet classification.
We present techniques for scaling Swin Transformer 35 up to 3 billion parameters and making it capable of training with images of up to 1,536x1,536 resolution. By scaling up capacity and resolution, ...Swin Transformer sets new records on four representative vision benchmarks: 84.0% top-1 accuracy on ImageNet- V2 image classification, 63.1 / 54.4 box / mask mAP on COCO object detection, 59.9 mIoU on ADE20K semantic segmentation, and 86.8% top-1 accuracy on Kinetics-400 video action classification. We tackle issues of training instability, and study how to effectively transfer models pre-trained at low resolutions to higher resolution ones. To this aim, several novel technologies are proposed: 1) a residual post normalization technique and a scaled cosine attention approach to improve the stability of large vision models; 2) a log-spaced continuous position bias technique to effectively transfer models pre-trained at low-resolution images and windows to their higher-resolution counterparts. In addition, we share our crucial implementation details that lead to significant savings of GPU memory consumption and thus make it feasi-ble to train large vision models with regular GPUs. Using these techniques and self-supervised pre-training, we suc-cessfully train a strong 3 billion Swin Transformer model and effectively transfer it to various vision tasks involving high-resolution images or windows, achieving the state-of-the-art accuracy on a variety of benchmarks. Code is avail-able at https://github.com/microsoft/Swin-Transformer.
Abstract
As a quantum material, Weyl semimetal has a series of electronic-band-structure features, including Weyl points with left and right chirality and corresponding Berry curvature, which have ...been observed in experiments. These band-structure features also lead to some unique nonlinear properties, especially high-order harmonic generation (HHG) due to the dynamic process of electrons under strong laser excitation, which has remained unexplored previously. Herein, we obtain effective HHG in type-II Weyl semimetal β-WP
2
crystals, where both odd and even orders are observed, with spectra extending into the vacuum ultraviolet region (190 nm, 10th order), even under fairly low femtosecond laser intensity. In-depth studies have interpreted that odd-order harmonics come from the Bloch electron oscillation, while even orders are attributed to Bloch oscillations under the “spike-like” Berry curvature at Weyl points. With crystallographic orientation-dependent HHG spectra, we further quantitatively retrieved the electronic band structure and Berry curvature of β-WP
2
. These findings may open the door for exploiting metallic/semimetallic states as solid platforms for deep ultraviolet radiation and offer an all-optical and pragmatic solution to characterize the complicated multiband electronic structure and Berry curvature of quantum topological materials.
Abstract
Dissipative Kerr soliton (DKS) frequency combs—also known as microcombs—have arguably created a new field in cavity nonlinear photonics, with a strong cross-fertilization between ...theoretical, experimental, and technological research. Spatiotemporal mode-locking (STML) not only adds new degrees of freedom to ultrafast laser technology, but also provides new insights for implementing analogue computers and heuristic optimizers with photonics. Here, we combine the principles of DKS and STML to demonstrate the STML DKS by developing an unexplored ultrahigh-quality-factor Fabry–Pérot (FP) mesoresonator based on graded index multimode fiber (GRIN-MMF). Complementing the two-step pumping scheme with a cavity stress tuning method, we can selectively excite either the eigenmode DKS or the STML DKS. Furthermore, we demonstrate an ultralow noise microcomb that enhances the photonic flywheel performance in both the fundamental comb linewidth and DKS timing jitter. The demonstrated fundamental comb linewidth of 400 mHz and DKS timing jitter of 500 attosecond (averaging times up to 25 μs) represent improvements of 25× and 2.5×, respectively, from the state-of-the-art. Our results show the potential of GRIN-MMF FP mesoresonators as an ideal testbed for high-dimensional nonlinear cavity dynamics and photonic flywheel with ultrahigh coherence and ultralow timing jitter.
Graphene, a unique two-dimensional material comprising carbon in a honeycomb lattice1, has brought breakthroughs across electronics, mechanics and thermal transport, driven by the quasiparticle Dirac ...fermions obeying a linear dispersion2,3. Here, we demonstrate a counter-pumped all-optical difference frequency process to coherently generate and control terahertz plasmons in atomic-layer graphene with octave-level tunability and high efficiency. We leverage the inherent surface asymmetry of graphene for strong second-order nonlinear polarizability4,5, which, together with tight plasmon field confinement, enables a robust difference frequency signal at terahertz frequencies. The counter-pumped resonant process on graphene uniquely achieves both energy and momentum conservation. Consequently, we demonstrate a dual-layer graphene heterostructure with terahertz charge- and gate-tunability over an octave, from 4.7 THz to 9.4 THz, bounded only by the pump amplifier optical bandwidth. Theoretical modelling supports our single-volt-level gate tuning and optical-bandwidth-bounded 4.7 THz phase-matching measurements through the random phase approximation, with phonon coupling, saturable absorption and below the Landau damping, to predict and understand graphene plasmon physics.
Quantum entanglement is a fundamental resource for secure information processing and communications, and hyperentanglement or high-dimensional entanglement has been separately proposed for its high ...data capacity and error resilience. The continuous-variable nature of the energy-time entanglement makes it an ideal candidate for efficient high-dimensional coding with minimal limitations. Here, we demonstrate the first simultaneous high-dimensional hyperentanglement using a biphoton frequency comb to harness the full potential in both the energy and time domain. Long-postulated Hong-Ou-Mandel quantum revival is exhibited, with up to 19 time-bins and 96.5% visibilities. We further witness the high-dimensional energy-time entanglement through Franson revivals, observed periodically at integer time-bins, with 97.8% visibility. This qudit state is observed to simultaneously violate the generalized Bell inequality by up to 10.95 standard deviations while observing recurrent Clauser-Horne-Shimony-Holt S-parameters up to 2.76. Our biphoton frequency comb provides a platform for photon-efficient quantum communications towards the ultimate channel capacity through energy-time-polarization high-dimensional encoding.
We demonstrate a compact, high efficiency and widely tunable intracavity singly resonant optical parametric oscillator (IC-OPO) based on multichannel periodically poled lithium niobate (PPLN). The ...IC-OPO is composed of 808 nm pump laser diode (LD), Nd:YVO
laser and linear OPO. The continuous-wave (CW) mid-infrared (MIR) output laser is tunable from 2.25 to 4.79 μm. The maximum output power exceeds 1.08 W at 3.189 μm at 9.1 W LD pump power and the conversion efficiency is 11.88%. We also build up a prototype with volume of 145 × 85 × 42.5 mm
and its total weight is less than 2 kg. The measured power stability is 1.3% Root Meat Square (RMS) for a 3 h duration under simulated high temperature conditions of 40 °C. RMS is 2.6% for a 4 h duration when simulated temperature is - 40 °C.
Dissipative Kerr soliton (DKS) microcomb has emerged as an enabling technology that revolutionizes a wide range of applications in both basic science and technological innovation. Reliable turnkey ...operation with sub-optical-cycle and sub-femtosecond timing jitter is key to the success of many intriguing microcomb applications at the intersection of ultrafast optics and microwave electronics. Here we propose an approach and demonstrate the first turnkey Brillouin-DKS frequency comb to the best of our knowledge. Our microresonator-filtered laser design offers essential benefits, including phase insensitivity, self-healing capability, deterministic selection of the DKS state, and access to the ultralow noise comb state. The demonstrated turnkey Brillouin-DKS frequency comb achieves a fundamental comb linewidth of 100 mHz and DKS timing jitter of 1 femtosecond for averaging times up to 56 μs. The approach is universal and generalizable to various device platforms for user-friendly and field-deployable comb devices.
Abstract
Optical frequency combs in microresonators (microcombs) have a wide range of applications in science and technology, due to its compact size and access to considerably larger comb spacing. ...Despite recent successes, the problems of self-starting, high mode efficiency as well as high output power have not been fully addressed for conventional soliton microcombs. Recent demonstration of laser cavity soliton microcombs by nesting a microresonator into a fiber cavity, shows great potential to solve the problems. Here we study the dissipative soliton generation and interaction dynamics in a microresonator-filtered fiber laser in both theory and experiment. We bring theoretical insight into the mode-locking principle, discuss the parameters effect on soliton properties, and provide experimental guidelines for broadband soliton generation. We predict chirped bright dissipative soliton with flat-top spectral envelope in microresonators with normal dispersion, which is fundamentally forbidden for the externally driven case. Furthermore, we experimentally achieve soliton microcombs with large bandwidth of ~10 nm and high mode efficiency of 90.7%. Finally, by taking advantage of an ultrahigh-speed time magnifier, we study the real-time soliton formation and interaction dynamics and experimentally observe soliton Newton’s cradle. Our study will benefit the design of the novel, high-efficiency and self-starting microcombs for real-world applications.