ترغب بنشر مسار تعليمي؟ اضغط هنا

Research in machine learning (ML) has primarily argued that models trained on incomplete or biased datasets can lead to discriminatory outputs. In this commentary, we propose moving the research focus beyond bias-oriented framings by adopting a power -aware perspective to study up ML datasets. This means accounting for historical inequities, labor conditions, and epistemological standpoints inscribed in data. We draw on HCI and CSCW work to support our argument, critically analyze previous research, and point at two co-existing lines of work within our community -- one bias-oriented, the other power-aware. This way, we highlight the need for dialogue and cooperation in three areas: data quality, data work, and data documentation. In the first area, we argue that reducing societal problems to bias misses the context-based nature of data. In the second one, we highlight the corporate forces and market imperatives involved in the labor of data workers that subsequently shape ML datasets. Finally, we propose expanding current transparency-oriented efforts in dataset documentation to reflect the social contexts of data design and production.
161 - Maxime Egea 2021
We propose and study a new multilevel method for the numerical approximation of a Gibbs distribution $pi$ on R d , based on (over-damped) Langevin diffusions. This method both inspired by [PP18] and [GMS + 20] relies on a multilevel occupation measur e, i.e. on an appropriate combination of R occupation measures of (constant-step) discretized schemes of the Langevin diffusion with respective steps $gamma$r = $gamma$02 --r , r = 0,. .. , R. For a given diffusion, we first state a result under general assumptions which guarantees an $epsilon$-approximation (in a L 2-sense) with a cost proportional to $epsilon$ --2 (i.e. proportional to a Monte-Carlo method without bias) or $epsilon$ --2 | log $epsilon$| 3 under less contractive assumptions. This general result is then applied to over-damped Langevin diffusions in a strongly convex setting, with a study of the dependence in the dimension d and in the spectrum of the Hessian matrix D 2 U of the potential U : R d $rightarrow$ R involved in the Gibbs distribution. This leads to strategies with cost in O(d$epsilon$ --2 log 3 (d$epsilon$ --2)) and in O(d$epsilon$ --2) under an additional condition on the third derivatives of U. In particular, in our last main result, we show that, up to universal constants, an appropriate choice of the diffusion coefficient and of the parameters of the procedure leads to a cost controlled by ($lambda$ U $lor$1) 2 $lambda$ 3 U d$epsilon$ --2 (where$lambda$U and $lambda$ U respectively denote the supremum and the infimum of the largest and lowest eigenvalue of D 2 U). In our numerical illustrations, we show that our theoretical bounds are confirmed in practice and finally propose an opening to some theoretical or numerical strategies in order to increase the robustness of the procedure when the largest and smallest eigenvalues of D 2 U are respectively too large or too small.
HD 141569 is a Herbig Ae/Be star that straddles the boundary between the transition disks and debris disks. It is a low dust mass disk that reveals numerous structural elements (e.g. gaps and rings) that may point to young planets. It also exhibits a reservoir of CO gas observed at both millimeter and IR wavelengths. Previous observations (Goto et al. 2006) reported a possible asymmetry in the CO gas emission. Herein the IR ro-vibrational emission lines are analyzed and modeled both spectroscopically and spectroastrometrically. We find emission features from both 12CO and 13CO isotopologues heated to a temperature of approximately 200 K in the radial extent of 13 to 60 au. We do not see evidenceof the previously reported asymmetry in CO emission, our results being consistent with a Keplerian, axisymmetric emitting region. This raises the question of whether the emission profile may be evolving in time, possibly as a result of an orbiting feature in the inner disk such as a planet.
Since the early months of 2020, non-pharmaceutical interventions (NPIs) -- implemented at varying levels of severity and based on widely-divergent perspectives of risk tolerance -- have been the primary means to control SARS-CoV-2 transmission. We se ek to identify how risk tolerance and vaccination rates impact the rate at which a population can return to pre-pandemic contact behavior. To this end, we develop a novel feedback control method for data-driven decision-making to identify optimal levels of NPIs across geographical regions in order to guarantee that hospitalizations will not exceed a given risk tolerance. Results are shown for the state of Colorado, and they suggest that: coordination in decision-making across regions is essential to maintain the daily number of hospitalizations below the desired limits; increasing risk tolerance can decrease the number of days required to discontinue NPIs, at the cost of an increased number of deaths; and if vaccination uptake is less than 70%, at most levels of risk tolerance, return to pre-pandemic contact behaviors before the early months of 2022 may newly jeopardize the healthcare system.
152 - Gianluca Francica 2021
Fluctuation theorems are fundamental results in non-equilibrium thermodynamics. Considering the fluctuation theorem with respect to the entropy production and an observable, we derive a new thermodynamic uncertainty relation which also applies to non-cyclic and time-reversal non-symmetric protocols.
One of the most important branches of nonlinear control theory is the so-called sliding-mode. Its aim is the design of a (nonlinear) feedback law that brings and maintains the state trajectory of a dynamic system on a given sliding surface. Here, dyn amics becomes completely independent of the model parameters and can be tuned accordingly to the desired target. In this paper we study this problem when the feedback law is subject to strong structural constraints. In particular, we assume that the control input may take values only over two bounded and disjoint sets. Such sets could be also non perfectly known a priori. An example is a control input allowed to switch only between two values. Under these peculiarities, we derive the necessary and sufficient conditions that guarantee sliding-mode control effectiveness for a class of time-varying continuous-time linear systems that includes all the stationary state-space linear models. Our analysis covers several scientific fields. It is only apparently confined to the linear setting and allows also to study an important set of nonlinear models. We describe fundamental examples related to epidemiology where the control input is the level of contact rate among people and the sliding surface permits to control the number of infected. For popular epidemiological models we prove the global convergence of control schemes based on the introduction of severe restrictions, like lockdowns, to contain epidemic. This greatly generalizes previous results obtained in the literature by casting them within a general sliding-mode theory.
Dual-view snapshot compressive imaging (SCI) aims to capture videos from two field-of-views (FoVs) using a 2D sensor (detector) in a single snapshot, achieving joint FoV and temporal compressive sensing, and thus enjoying the advantages of low-bandwi dth, low-power, and low-cost. However, it is challenging for existing model-based decoding algorithms to reconstruct each individual scene, which usually require exhaustive parameter tuning with extremely long running time for large scale data. In this paper, we propose an optical flow-aided recurrent neural network for dual video SCI systems, which provides high-quality decoding in seconds. Firstly, we develop a diversity amplification method to enlarge the differences between scenes of two FoVs, and design a deep convolutional neural network with dual branches to separate different scenes from the single measurement. Secondly, we integrate the bidirectional optical flow extracted from adjacent frames with the recurrent neural network to jointly reconstruct each video in a sequential manner. Extensive results on both simulation and real data demonstrate the superior performance of our proposed model in a short inference time. The code and data are available at https://github.com/RuiyingLu/OFaNet-for-Dual-view-SCI.
Implicit copulas are the most common copula choice for modeling dependence in high dimensions. This broad class of copulas is introduced and surveyed, including elliptical copulas, skew $t$ copulas, factor copulas, time series copulas and regression copulas. The common auxiliary representation of implicit copulas is outlined, and how this makes them both scalable and tractable for statistical modeling. Issues such as parameter identification, extended likelihoods for discrete or mixed data, parsimony in high dimensions, and simulation from the copula model are considered. Bayesian approaches to estimate the copula parameters, and predict from an implicit copula model, are outlined. Particular attention is given to implicit copula processes constructed from time series and regression models, which is at the forefront of current research. Two econometric applications -- one from macroeconomic time series and the other from financial asset pricing -- illustrate the advantages of implicit copula models.
This paper considers the unsupervised domain adaptation problem for neural machine translation (NMT), where we assume the access to only monolingual text in either the source or target language in the new domain. We propose a cross-lingual data selec tion method to extract in-domain sentences in the missing language side from a large generic monolingual corpus. Our proposed method trains an adaptive layer on top of multilingual BERT by contrastive learning to align the representation between the source and target language. This then enables the transferability of the domain classifier between the languages in a zero-shot manner. Once the in-domain data is detected by the classifier, the NMT model is then adapted to the new domain by jointly learning translation and domain discrimination tasks. We evaluate our cross-lingual data selection method on NMT across five diverse domains in three language pairs, as well as a real-world scenario of translation for COVID-19. The results show that our proposed method outperforms other selection baselines up to +1.5 BLEU score.
Geopositioning and tracking a moving boat at sea is a very challenging problem, requiring boat detection, matching and estimating its GPS location from imagery with no common features. The problem can be stated as follows: given imagery from a camera mounted on a moving platform with known GPS location as the only valid sensor, we predict the geoposition of a target boat visible in images. Our solution uses recent ML algorithms, the camera-scene geometry and Bayesian filtering. The proposed pipeline first detects and tracks the target boats location in the image with the strategy of tracking by detection. This image location is then converted to geoposition to the local sea coordinates referenced to the camera GPS location using plane projective geometry. Finally, target boat local coordinates are transformed to global GPS coordinates to estimate the geoposition. To achieve a smooth geotrajectory, we apply unscented Kalman filter (UKF) which implicitly overcomes small detection errors in the early stages of the pipeline. We tested the performance of our approach using GPS ground truth and show the accuracy and speed of the estimated geopositions. Our code is publicly available at https://github.com/JianliWei1995/AI-Track-at-Sea.
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا