ترغب بنشر مسار تعليمي؟ اضغط هنا

Crystal-field excitations, for example in transition-metal oxides where a rare-earth element is used as a spacer between the transition-metal-oxide tetrahedra and octahedra, are assumed to be extremely robust with respect to external perturbations su ch as temperature. Using inelastic neutron scattering experiments, a giant shift of the energy of the lowest crystal-field excitation of Er3+ (4I15/2) in ErFeO3 from 0.30(2) meV to 0.75(2) meV was measured below the magnetic-ordering temperature of erbium at 4.1 K. Quantum-mechanical point-charge calculations of the crystal-field levels indicate that the shift is caused by the internal magnetic field created by the erbium spins themselves, which causes a Zeeman splitting of the erbium 4f electronic levels, and therefore a change in the energies of crystal-field transitions. To verify this explanation, the effect of an external magnetic field on the crystal-field excitations was measured by inelastic neutron scattering and compared to the field-dependent point-charge calculations. The existence of an internal magnetic exchange interaction will have implications for a deeper understanding of a broader group of phenomena such as multiferroic properties or spin frustration, which are a consequence of various competing electronic and magnetic exchange interactions.
We propose a method for emotion recognition through emotiondependent speech recognition using Wav2vec 2.0. Our method achieved a significant improvement over most previously reported results on IEMOCAP, a benchmark emotion dataset. Different types of phonetic units are employed and compared in terms of accuracy and robustness of emotion recognition within and across datasets and languages. Models of phonemes, broad phonetic classes, and syllables all significantly outperform the utterance model, demonstrating that phonetic units are helpful and should be incorporated in speech emotion recognition. The best performance is from using broad phonetic classes. Further research is needed to investigate the optimal set of broad phonetic classes for the task of emotion recognition. Finally, we found that Wav2vec 2.0 can be fine-tuned to recognize coarser-grained or larger phonetic units than phonemes, such as broad phonetic classes and syllables.
Much of the recent literature on automatic speech recognition (ASR) is taking an end-to-end approach. Unlike English where the writing system is closely related to sound, Chinese characters (Hanzi) represent meaning, not sound. We propose factoring a udio -> Hanzi into two sub-tasks: (1) audio -> Pinyin and (2) Pinyin -> Hanzi, where Pinyin is a system of phonetic transcription of standard Chinese. Factoring the audio -> Hanzi task in this way achieves 3.9% CER (character error rate) on the Aishell-1 corpus, the best result reported on this dataset so far.
161 - Hong Yuan , Yu-Han Ma , 2021
We study the non-equilibrium thermodynamics of a heat engine operating between two finite-sized reservoirs with well-defined temperatures. Within the linear response regime, it is discovered that there exists a power-efficiency trade-off depending on the ratio of heat capacities ($gamma$) of the reservoirs for the engine; the uniform temperature of the two reservoirs at final time $tau$ is bounded from below by the entropy production $sigma_{mathrm{min}}propto1/tau$. We further obtain a universal efficiency at maximum power of the engine for arbitrary $gamma$. Our findings can be used to develop an optimization scenario for thermodynamic cycles with finite-sized reservoirs in practice.
Driving materials out of equilibrium by ultra-short laser pulses offers unprecedented access to new chemistry and physics. Layered tin selenide (SnSe) has recently emerged as a promising candidate for high-performance thermoelectrics (TE) with the cu rrent record figure of merit (ZT) observed in high temperature Cmcm phase. However, traditional investigations on Cmcm phase are carried out in thermal equilibrium condition, therefore restricted in a compromised version due to electron-phonon coupling. In this study, we demonstrate that, through femtosecond-resolved coherent phonon spectroscopy, SnSe displays a transient photoinduced switch of point-group symmetry from Pnma to Cmcm at room temperature. This non-equilibrium Cmcm phase exists in SnSe with the status of cold lattice with hot electrons, which allows us to noninvasively characterize physical properties in a selective manner. By independently control the electronic and lattice degree of freedom in SnSe, a giant anharmonicity and the non-thermal softening nature are demonstrated in the transient Cmcm phase. Our results, which provide a nonequilibrium platform to investigate thermal properties with a cold lattice, may reform the current thermal equilibrium-based thinking in TE research community.
332 - Zonghai Yao , Hong Yu 2021
Models pre-trained on large-scale regular text corpora often do not work well for user-generated data where the language styles differ significantly from the mainstream text. Here we present Context-Aware Rule Injection (CARI), an innovative method f or formality style transfer (FST). CARI injects multiple rules into an end-to-end BERT-based encoder and decoder model. It learns to select optimal rules based on context. The intrinsic evaluation showed that CARI achieved the new highest performance on the FST benchmark dataset. Our extrinsic evaluation showed that CARI can greatly improve the regular pre-trained models performance on several tweet sentiment analysis tasks.
A complexity-adaptive tree search algorithm is proposed for $boldsymbol{G}_N$-coset codes that implements maximum-likelihood (ML) decoding by using a successive decoding schedule. The average complexity is close to that of the successive cancellation (SC) decoding for practical error rates when applied to polar codes and short Reed-Muller (RM) codes, e.g., block lengths up to $N=128$. By modifying the algorithm to limit the worst-case complexity, one obtains a near-ML decoder for longer RM codes and their subcodes. Unlike other bit-flip decoders, no outer code is needed to terminate decoding. The algorithm can thus be applied to modified $boldsymbol{G}_N$-coset code constructions with dynamic frozen bits. One advantage over sequential decoders is that there is no need to optimize a separate parameter.
We introduce generalized spatially coupled parallel concatenated codes (GSC-PCCs), a class of spatially coupled turbo-like codes obtained by coupling parallel concatenated codes (PCCs) with a fraction of information bits repeated before the PCC encod ing. GSC-PCCs can be seen as a generalization of the original spatially coupled parallel concatenated convolutional codes (SC-PCCs) proposed by Moloudi et al. [1]. To characterize the asymptotic performance of GSC-PCCs, we derive the corresponding density evolution equations and compute their decoding thresholds. We show that the proposed codes have some nice properties such as threshold saturation and that their decoding thresholds improve with the repetition factor $q$. Most notably, our analysis suggests that the proposed codes asymptotically approach the capacity as $q$ tends to infinity with any given constituent convolutional code.
88 - Qixuan Sun , Yaqi Yin , Hong Yu 2021
Emotion-cause pair extraction (ECPE), an emerging task in sentiment analysis, aims at extracting pairs of emotions and their corresponding causes in documents. This is a more challenging problem than emotion cause extraction (ECE), since it requires no emotion signals which are demonstrated as an important role in the ECE task. Existing work follows a two-stage pipeline which identifies emotions and causes at the first step and pairs them at the second step. However, error propagation across steps and pair combining without contextual information limits the effectiveness. Therefore, we propose a Dual-Questioning Attention Network to alleviate these limitations. Specifically, we question candidate emotions and causes to the context independently through attention networks for a contextual and semantical answer. Also, we explore how weighted loss functions in controlling error propagation between steps. Empirical results show that our method performs better than baselines in terms of multiple evaluation metrics. The source code can be obtained at https://github.com/QixuanSun/DQAN.
A polar-coded transmission (PCT) scheme with joint channel estimation and decoding is proposed for channels with unknown channel state information (CSI). The CSI is estimated via successive cancellation (SC) decoding and the constraints imposed by th e frozen bits. SC list decoding with an outer code improves performance, including resolving a phase ambiguity when using quadrature phase-shift keying (QPSK) and Gray labeling. Simulations with 5G polar codes and QPSK show gains of up to $2$~dB at a frame error rate (FER) of $10^{-4}$ over pilot-assisted transmission for various non-coherent models. Moreover, PCT performs within a few tenths of a dB to a coherent receiver with perfect CSI. For Rayleigh block-fading channels, PCT outperforms an FER upper bound based on random coding and within one dB of a lower bound.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا