No Arabic abstract
For modern flash-based SSDs, the performance overhead of internal data migrations is dominated by the data transfer time, not by the flash program time as in old SSDs. In order to mitigate the performance impact of data migrations, we propose rCopyback, a restricted version of copyback. Rcopyback works like the original copyback except that only n consecutive copybacks are allowed. By limiting the number of successive copybacks, it guarantees that no data reliability problem occurs when data is internally migrated using rCopyback. In order to take a full advantage of rCopyback, we developed a rCopyback-aware FTL, rcFTL, which intelligently decides whether rCopyback should be used or not by exploiting varying host workloads. Our evaluation results show that rcFTL can improve the overall I/O throughput by 54% on average over an existing FTL which does not use copybacks.
Emerging storage systems with new flash exhibit ultra-low latency (ULL) that can address performance disparities between DRAM and conventional solid state drives (SSDs) in the memory hierarchy. Considering the advanced low-latency characteristics, different types of I/O completion methods (polling/hybrid) and storage stack architecture (SPDK) are proposed. While these new techniques are expected to take costly software interventions off the critical path in ULL-applied systems, unfortunately no study exists to quantitatively analyze system-level characteristics and challenges of combining such newly-introduced techniques with real ULL SSDs. In this work, we comprehensively perform empirical evaluations with 800GB ULL SSD prototypes and characterize ULL behaviors by considering a wide range of I/O path parameters, such as different queues and access patterns. We then analyze the efficiencies and challenges of the polled-mode and hybrid polling I/O completion methods (added into Linux kernels 4.4 and 4.10, respectively) and compare them with the efficiencies of a conventional interrupt-based I/O path. In addition, we revisit the common expectations of SPDK by examining all the system resources and parameters. Finally, we demonstrate the challenges of ULL SSDs in a real SPDK-enabled server-client system. Based on the performance behaviors that this study uncovers, we also discuss several system implications, which are required to take a full advantage of ULL SSD in the future.
Accuracy in the absolute position in the sky is one of the limiting factors for pulsar timing, and timing parameters have a direct impact on the understanding of the physics of Isolated Neutron Stars (INS). We report here on a high-accuracy measurement of the optical position of Geminga (V=25.5), the only known radio-quiet INS. The procedure combines the Hipparcos and Tycho catalogues, ground-based astrometric data,and Hubble Space Telescope (HST) Wide Field Planetary Camera (WFPC2) images, to yield Gemingas absolute position to within ~40 mas (per coordinate). Such a positional accuracy, unprecedented for the optical position of a pulsar or an object this faint, is needed to combine in phase gamma-ray photons collected over more than 20 years, i.e. over 2.5 billions of star revolutions. Although quite a difficult task, this is the only way to improve our knowledge of the timing parameters of this radio silent INS.
Edge computing is the natural progression from Cloud computing, where, instead of collecting all data and processing it centrally, like in a cloud computing environment, we distribute the computing power and try to do as much processing as possible, close to the source of the data. There are various reasons this model is being adopted quickly, including privacy, and reduced power and bandwidth requirements on the Edge nodes. While it is common to see inference being done on Edge nodes today, it is much less common to do training on the Edge. The reasons for this range from computational limitations, to it not being advantageous in reducing communications between the Edge nodes. In this paper, we explore some scenarios where it is advantageous to do training on the Edge, as well as the use of checkpointing strategies to save memory.
Over the last decades quaternions have become a crucial and very successful tool for attitude representation in robotics and aerospace. However, there is a major problem that is continuously causing trouble in practice when it comes to exchanging formulas or implementations: there are two quaternion multiplications in common use, Hamiltons original multiplication and its flipped version, which is often associated with NASAs Jet Propulsion Laboratory. We believe that this particular issue is completely avoidable and only exists today due to a lack of understanding. This paper explains the underlying problem for the popular passive world to body usage of rotation quaternions, and derives an alternative solution compatible with Hamiltons multiplication. Furthermore, it argues for entirely discontinuing the flipped multiplication. Additionally, it provides recipes for efficiently detecting relevant conventions and migrating formulas or algorithms between them.
Precoding has stood out as a promising multi-user transmission technique to meet the emerging throughput demand of satellite communication systems while awaiting the technological maturity for exploiting higher bands. Precoding enables the reduction of interference among co-channel beams through spatial processing while promoting aggressive frequency reuse and improving spectral efficiency. Satellite systems offer multitude of system and service configurations, resulting in different precoder design methodologies. This article explores the motivation for the introduction of precoding, offers an insight to their theoretical development in a diverse scenarios and presents some avenues for future development.