ترغب بنشر مسار تعليمي؟ اضغط هنا

3D convolutional neural networks have achieved promising results for video tasks in computer vision, including video saliency prediction that is explored in this paper. However, 3D convolution encodes visual representation merely on fixed local space time according to its kernel size, while human attention is always attracted by relational visual features at different time of a video. To overcome this limitation, we propose a novel Spatio-Temporal Self-Attention 3D Network (STSANet) for video saliency prediction, in which multiple Spatio-Temporal Self-Attention (STSA) modules are employed at different levels of 3D convolutional backbone to directly capture long-range relations between spatio-temporal features of different time steps. Besides, we propose an Attentional Multi-Scale Fusion (AMSF) module to integrate multi-level features with the perception of context in semantic and spatio-temporal subspaces. Extensive experiments demonstrate the contributions of key components of our method, and the results on DHF1K, Hollywood-2, UCF, and DIEM benchmark datasets clearly prove the superiority of the proposed model compared with all state-of-the-art models.
98 - Lingzhi Zhao , Ying Cui , Zhi Liu 2021
This paper investigates adaptive streaming of one or multiple tiled 360 videos from a multi-antenna base station (BS) to one or multiple single-antenna users, respectively, in a multi-carrier wireless system. We aim to maximize the video quality whil e keeping rebuffering time small via encoding rate adaptation at each group of pictures (GOP) and transmission adaptation at each (transmission) slot. To capture the impact of field-of-view (FoV) prediction, we consider three cases of FoV viewing probability distributions, i.e., perfect, imperfect, and unknown FoV viewing probability distributions, and use the average total utility, worst average total utility, and worst total utility as the respective performance metrics. In the single-user scenario, we optimize the encoding rates of the tiles, encoding rates of the FoVs, and transmission beamforming vectors for all subcarriers to maximize the total utility in each case. In the multi-user scenario, we adopt rate splitting with successive decoding and optimize the encoding rates of the tiles, encoding rates of the FoVs, rates of the common and private messages, and transmission beamforming vectors for all subcarriers to maximize the total utility in each case. Then, we separate the challenging optimization problem into multiple tractable problems in each scenario. In the single-user scenario, we obtain a globally optimal solution of each problem using transformation techniques and the Karush-Kuhn-Tucker (KKT) conditions. In the multi-user scenario, we obtain a KKT point of each problem using the concave-convex procedure (CCCP). Finally, numerical results demonstrate that the proposed solutions achieve notable gains over existing schemes in all three cases. To the best of our knowledge, this is the first work revealing the impact of FoV prediction on the performance of adaptive streaming of tiled 360 videos.
65 - Zhi Liu , Nikhil Garg 2021
Due to the Covid-19 pandemic, more than 500 US-based colleges and universities went test-optional for admissions and promised that they would not penalize applicants for not submitting test scores, part of a longer trend to rethink the role of testin g in college admissions. However, it remains unclear how (and whether) a college can simultaneously use test scores for those who submit them, while not penalizing those who do not--and what that promise even means. We formalize these questions, and study how a college can overcome two challenges with optional testing: $textit{strategic applicants}$ (when those with low test scores can pretend to not have taken the test), and $textit{informational gaps}$ (it has more information on those who submit a test score than those who do not). We find that colleges can indeed do so, if and only if they are able to use information on who has test access and are willing to randomize admissions.
The feasible region $Omega_{{rm ind}}(F)$ of a graph $F$ is the collection of points $(x,y)$ in the unit square such that there exists a sequence of graphs whose edge densities approach $x$ and whose induced $F$-densities approach $y$. A complete des cription of $Omega_{{rm ind}}(F)$ is not known for any $F$ with at least four vertices that is not a clique or an independent set. The feasible region provides a lot of combinatorial information about $F$. For example, the supremum of $y$ over all $(x,y)in Omega_{{rm ind}}(F)$ is the inducibility of $F$ and $Omega_{{rm ind}}(K_r)$ yields the Kruskal-Katona and clique density theorems. We begin a systematic study of $Omega_{{rm ind}}(F)$ by proving some general statements about the shape of $Omega_{{rm ind}}(F)$ and giving results for some specific graphs $F$. Many of our theorems apply to the more general setting of quantum graphs. For example, we prove a bound for quantum graphs that generalizes an old result of Bollobas for the number of cliques in a graph with given edge density. We also consider the problems of determining $Omega_{{rm ind}}(F)$ when $F=K_r^-$, $F$ is a star, or $F$ is a complete bipartite graph. In the case of $K_r^-$ our results sharpen those predicted by the edge-statistics conjecture of Alon et. al. while also extending a theorem of Hirst for $K_4^-$ that was proved using computer aided techniques and flag algebras. The case of the 4-cycle seems particularly interesting and we conjecture that $Omega_{{rm ind}}(C_4)$ is determined by the solution to the triangle density problem, which has been solved by Razborov.
We consider the effect of the coupling between 2D quantum rotors near an XY ferromagnetic quantum critical point and spins of itinerant fermions. We analyze how this coupling affects the dynamics of rotors and the self-energy of fermions.A common bel ief is that near a $mathbf{q}=0$ ferromagnetic transition, fermions induce an $Omega/q$ Landau damping of rotors (i.e., the dynamical critical exponent is $z=3$) and Landau overdamped rotors give rise to non-Fermi liquid fermionic self-energy $Sigmapropto omega^{2/3}$. This behavior has been confirmed in previous quantum Monte Carlo studies. Here we show that for the XY case the behavior is different. We report the results of large scale quantum Monte Carlo simulations, which clearly show that at small frequencies $z=2$ and $Sigmapropto omega^{1/2}$. We argue that the new behavior is associated with the fact that a fermionic spin is by itself not a conserved quantity due to spin-spin coupling to rotors, and a combination of self-energy and vertex corrections replaces $1/q$ in the Landau damping by a constant. We discuss the implication of these results to experiment
105 - Xizhi Liu , Dhruv Mubayi 2021
An $(n,r,s)$-system is an $r$-uniform hypergraph on $n$ vertices such that every pair of edges has an intersection of size less than $s$. Using probabilistic arguments, R{o}dl and v{S}iv{n}ajov{a} showed that for all fixed integers $r> s ge 2$, there exists an $(n,r,s)$-system with independence number $Oleft(n^{1-delta+o(1)}right)$ for some optimal constant $delta >0$ only related to $r$ and $s$. We show that for certain pairs $(r,s)$ with $sle r/2$ there exists an explicit construction of an $(n,r,s)$-system with independence number $Oleft(n^{1-epsilon}right)$, where $epsilon > 0$ is an absolute constant only related to $r$ and $s$. Previously this was known only for $s>r/2$ by results of Chattopadhyay and Goodman
The origin of the pseudogap behavior, found in many high-$T_c$ superconductors, remains one of the greatest puzzles in condensed matter physics. One possible mechanism is fermionic incoherence, which near a quantum critical point allows pair formatio n but suppresses superconductivity. Employing quantum Monte Carlo simulations of a model of itinerant fermions coupled to ferromagnetic spin fluctuations, represented by a quantum rotor, we report numerical evidence of pseudogap behavior, emerging from pairing fluctuations in a quantum-critical non-Fermi liquid. Specifically, we observe enhanced pairing fluctuations and a partial gap opening in the fermionic spectrum. However, the system remains non-superconducting until reaching a much lower temperature. In the pseudogap regime the system displays a gap-filling rather than gap-closing behavior, consistent with experimental observations. Our results provide the first unambiguous lattice model realization of a pseudogap state in a strongly correlated system, driven by superconducting fluctuations.
124 - Chengjun Guo , Ying Cui , Zhi Liu 2021
In this paper, we study the optimal transmission of a multi-quality tiled 360 virtual reality (VR) video from a multi-antenna server (e.g., access point or base station) to multiple single-antenna users in a multiple-input multiple-output (MIMO)-orth ogonal frequency division multiple access (OFDMA) system. We minimize the total transmission power with respect to the subcarrier allocation constraints, rate allocation constraints, and successful transmission constraints, by optimizing the beamforming vector and subcarrier, transmission power and rate allocation. The formulated resource allocation problem is a challenging mixed discrete-continuous optimization problem. We obtain an asymptotically optimal solution in the case of a large antenna array, and a suboptimal solution in the general case. As far as we know, this is the first work providing optimization-based design for 360 VR video transmission in MIMO-OFDMA systems. Finally, by numerical results, we show that the proposed solutions achieve significant improvement in performance compared to the existing solutions.
We present a method which provides a unified framework for most stability theorems that have been proved in graph and hypergraph theory. Our main result reduces stability for a large class of hypergraph problems to the simpler question of checking th at a hypergraph $mathcal H$ with large minimum degree that omits the forbidden structures is vertex-extendable. This means that if $v$ is a vertex of $mathcal H$ and ${mathcal H} -v$ is a subgraph of the extremal configuration(s), then $mathcal H$ is also a subgraph of the extremal configuration(s). In many cases vertex-extendability is quite easy to verify. We illustrate our approach by giving new short proofs of hypergraph stability results of Pikhurko, Hefetz-Keevash, Brandt-Irwin-Jiang, Bene Watts-Norin-Yepremyan and others. Since our method always yields minimum degree stability, which is the strongest form of stability, in some of these cases our stability results are stronger than what was known earlier. Along the way, we clarify the different notions of stability that have been previously studied.
Due to the complicated procedure and costly hardware, Simultaneous Localization and Mapping (SLAM) has been heavily dependent on public datasets for drill and evaluation, leading to many impressive demos and good benchmark scores. However, with a hug e contrast, SLAM is still struggling on the way towards mature deployment, which sounds a warning: some of the datasets are overexposed, causing biased usage and evaluation. This raises the problem on how to comprehensively access the existing datasets and correctly select them. Moreover, limitations do exist in current datasets, then how to build new ones and which directions to go? Nevertheless, a comprehensive survey which can tackle the above issues does not exist yet, while urgently demanded by the community. To fill the gap, this paper strives to cover a range of cohesive topics about SLAM related datasets, including general collection methodology and fundamental characteristic dimensions, SLAM related tasks taxonomy and datasets categorization, introduction of state-of-the-arts, overview and comparison of existing datasets, review of evaluation criteria, and analyses and discussions about current limitations and future directions, looking forward to not only guiding the dataset selection, but also promoting the dataset research.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا