ترغب بنشر مسار تعليمي؟ اضغط هنا

We present a conditional text generation framework that posits sentential expressions of possible causes and effects. This framework depends on two novel resources we develop in the course of this work: a very large-scale collection of English senten ces expressing causal patterns CausalBank; and a refinement over previous work on constructing large lexical causal knowledge graphs Cause Effect Graph. Further, we extend prior work in lexically-constrained decoding to support disjunctive positive constraints. Human assessment confirms that our approach gives high-quality and diverse outputs. Finally, we use CausalBank to perform continued training of an encoder supporting a recent state-of-the-art model for causal reasoning, leading to a 3-point improvement on the COPA challenge set, with no change in model architecture.
In the past few years, Chinese astronomical community is actively testing astronomical sites for several new optical/infrared ground-based telescopes. These site testing campaigns conducted were mainly focused on fundamental performances of the site, such as cloud coverage, seeing, temperature, etc. With increasing interests in sodium laser guide star adaptive optics for these new telescopes in the Chinese astronomical community, it is interesting to investigate the performance of the laser guide star at these sites, especially considering that the sodium laser guide stars on-sky performance is significantly influenced by sites local performances, such as geomagnetic field, sodium layer dynamics, density of air molecule, etc. In this paper, we studied sodium laser guide stars performance of a 20W class Quasi-CW pulsed laser developed by TIPC with numerical simulation for five selected sites in China.
Most differentiable neural architecture search methods construct a super-net for search and derive a target-net as its sub-graph for evaluation. There exists a significant gap between the architectures in search and evaluation. As a result, current m ethods suffer from an inconsistent, inefficient, and inflexible search process. In this paper, we introduce EnTranNAS that is composed of Engine-cells and Transit-cells. The Engine-cell is differentiable for architecture search, while the Transit-cell only transits a sub-graph by architecture derivation. Consequently, the gap between the architectures in search and evaluation is significantly reduced. Our method also spares much memory and computation cost, which speeds up the search process. A feature sharing strategy is introduced for more balanced optimization and more efficient search. Furthermore, we develop an architecture derivation method to replace the traditional one that is based on a hand-crafted rule. Our method enables differentiable sparsification, and keeps the derived architecture equivalent to that of Engine-cell, which further improves the consistency between search and evaluation. Besides, it supports the search for topology where a node can be connected to prior nodes with any number of connections, so that the searched architectures could be more flexible. For experiments on CIFAR-10, our search on the standard space requires only 0.06 GPU-day. We further have an error rate of 2.22% with 0.07 GPU-day for the search on an extended space. We can also directly perform the search on ImageNet with topology learnable and achieve a top-1 error rate of 23.8% in 2.1 GPU-day.
38 - Zhongyang Li , Fei Lu 2020
In the learning of systems of interacting particles or agents, coercivity condition ensures identifiability of the interaction functions, providing the foundation of learning by nonparametric regression. The coercivity condition is equivalent to the strictly positive definiteness of an integral kernel arising in the learning. We show that for a class of interaction functions such that the system is ergodic, the integral kernel is strictly positive definite, and hence the coercivity condition holds true.
52 - Zhongyang Li 2020
We consider random perfect matchings on a general class of contracting bipartite graphs by letting certain edge weights be 0 on the contracting square-hexagon lattice in a periodic way. We obtain a deterministic limit shape in the scaling limit. The results can also be applied to prove the existence of multiple disconnected liquid regions for all the contracting square-hexagon lattices with certain edge weights, extending the results proved in [13] for contracting square-hexagon lattices where the number of square rows in each period is either 0 or 1.
We present a new learning-based framework to recover vehicle pose in SO(3) from a single RGB image. In contrast to previous works that map from local appearance to observation angles, we explore a progressive approach by extracting meaningful Interme diate Geometrical Representations (IGRs) to estimate egocentric vehicle orientation. This approach features a deep model that transforms perceived intensities to IGRs, which are mapped to a 3D representation encoding object orientation in the camera coordinate system. Core problems are what IGRs to use and how to learn them more effectively. We answer the former question by designing IGRs based on an interpolated cuboid that derives from primitive 3D annotation readily. The latter question motivates us to incorporate geometry knowledge with a new loss function based on a projective invariant. This loss function allows unlabeled data to be used in the training stage to improve representation learning. Without additional labels, our system outperforms previous monocular RGB-based methods for joint vehicle detection and pose estimation on the KITTI benchmark, achieving performance even comparable to stereo methods. Code and pre-trained models are available at this https URL.
A stochastic model of susceptible/infected/removed (SIR) type, inspired by COVID-19, is introduced for the spread of infection through a spatially-distributed population. Individuals are initially distributed at random in space, and they move continu ously according to independent random processes. The disease may pass from an infected individual to an uninfected individual when they are sufficiently close. Infected individuals are permanently removed at some given rate $alpha$. Two models are studied here, termed the delayed diffusion and the diffusion models. In the first, individuals are stationary until they are infected, at which time they begin to move; in the second, all individuals start to move at the initial time $0$. Using a perturbative argument, conditions are established under which the disease infects a.s. only finitely many individuals. It is proved for the delayed diffusion model that there exists a critical value $alpha_cin(0,infty)$ for the existence of a pandemic.
72 - Zhongyang Li 2020
We study the community detection problem on a Gaussian mixture model, in which vertices are divided into $kgeq 2$ distinct communities. The major difference in our model is that the intensities for Gaussian perturbations are different for different e ntries in the observation matrix, and we do not assume that every community has the same number of vertices. We explicitly find the threshold for the exact recovery of the maximum likelihood estimation. Applications include the community detection on hypergraphs.
62 - Zhongyang Li 2020
We study the asymptotics of Schur polynomials with partitions $lambda$ which are almost staircase; more precisely, partitions that differ from $((m-1)(N-1),(m-1)(N-2),ldots,(m-1),0)$ by at most one component at the beginning as $Nrightarrow infty$, f or a positive integer $mge 1$ independent of $N$. By applying either determinant formulas or integral representations for Schur functions, we show that $frac{1}{N}log frac{s_{lambda}(u_1,ldots,u_k, x_{k+1},ldots,x_N)}{s_{lambda}(x_1,ldots,x_N)}$ converges to a sum of $k$ single-variable holomorphic functions, each of which depends on the variable $u_i$ for $1leq ileq k$, when there are only finitely many distinct $x_i$s and each $u_i$ is in a neighborhood of $x_i$, as $Nrightarrowinfty$. The results are related to the law of large numbers and central limit theorem for the dimer configurations on contracting square-hexagon lattices with certain boundary conditions.
A recent approach for object detection and human pose estimation is to regress bounding boxes or human keypoints from a central point on the object or person. While this center-point regression is simple and efficient, we argue that the image feature s extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries, due to object deformation and scale/orientation variation. To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions. This point set is arranged to reflect a good initialization for the given task, such as modes in the training data for pose estimation, which lie closer to the ground truth than the central point and provide more informative features for regression. As the utility of a point set depends on how well its scale, aspect ratio and rotation matches the target, we adopt the anchor box technique of sampling these transformations to generate additional point-set candidates. We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation. Our results show that this general-purpose approach can achieve performance competitive with state-of-the-art methods for each of these tasks. Code is available at url{https://github.com/FangyunWei/PointSetAnchor}
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا