Simultaneous Iris and Periocular Region Detection Using Coarse Annotations

62 0 0.0 ( 0 )

Download Cite

Added by Rayson Laroca

Publication date 2019

fields Informatics Engineering

and research's language is English

Authors Diego R. Lucio - Rayson Laroca - Luiz A. Zanlorensi

Computer Vision and Pattern Recognition

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In this work, we propose to detect the iris and periocular regions simultaneously using coarse annotations and two well-known object detectors: YOLOv2 and Faster R-CNN. We believe coarse annotations can be used in recognition systems based on the iris and periocular regions, given the much smaller engineering effort required to manually annotate the training images. We manually made coarse annotations of the iris and periocular regions (122K images from the visible (VIS) spectrum and 38K images from the near-infrared (NIR) spectrum). The iris annotations in the NIR databases were generated semi-automatically by first applying an iris segmentation CNN and then performing a manual inspection. These annotations were made for 11 well-known public databases (3 NIR and 8 VIS) designed for the iris-based recognition problem and are publicly available to the research community. Experimenting our proposal on these databases, we highlight two results. First, the Faster R-CNN + Feature Pyramid Network (FPN) model reported an Intersection over Union (IoU) higher than YOLOv2 (91.86% vs 85.30%). Second, the detection of the iris and periocular regions being performed simultaneously is as accurate as performed separately, but with a lower computational cost, i.e., two tasks were carried out at the cost of one.

rate research

SIP-SegNet: A Deep Convolutional Encoder-Decoder Network for Joint Semantic Segmentation and Extraction of Sclera, Iris and Pupil based on Periocular Region Suppression

68 - Bilal Hassan , Ramsha Ahmed , Taimur Hassan 2020

The current developments in the field of machine vision have opened new vistas towards deploying multimodal biometric recognition systems in various real-world applications. These systems have the ability to deal with the limitations of unimodal biometric systems which are vulnerable to spoofing, noise, non-universality and intra-class variations. In addition, the ocular traits among various biometric traits are preferably used in these recognition systems. Such systems possess high distinctiveness, permanence, and performance while, technologies based on other biometric traits (fingerprints, voice etc.) can be easily compromised. This work presents a novel deep learning framework called SIP-SegNet, which performs the joint semantic segmentation of ocular traits (sclera, iris and pupil) in unconstrained scenarios with greater accuracy. The acquired images under these scenarios exhibit purkinje reflexes, specular reflections, eye gaze, off-angle shots, low resolution, and various occlusions particularly by eyelids and eyelashes. To address these issues, SIP-SegNet begins with denoising the pristine image using denoising convolutional neural network (DnCNN), followed by reflection removal and image enhancement based on contrast limited adaptive histogram equalization (CLAHE). Our proposed framework then extracts the periocular information using adaptive thresholding and employs the fuzzy filtering technique to suppress this information. Finally, the semantic segmentation of sclera, iris and pupil is achieved using the densely connected fully convolutional encoder-decoder network. We used five CASIA datasets to evaluate the performance of SIP-SegNet based on various evaluation metrics. The simulation results validate the optimal segmentation of the proposed SIP-SegNet, with the mean f1 scores of 93.35, 95.11 and 96.69 for the sclera, iris and pupil classes respectively.

Computer Vision and Pattern Recognition Image and Video Processing

A comparison of the active region upflow and core properties using simultaneous spectroscopic observations from IRIS and Hinode

90 - Krzysztof Barczynski , Louise Harra , Lucia Kleint 2021

The origin of the slow solar wind is still an open issue. It has been suggested that upflows at the edge of active regions (AR) can contribute to the slow solar wind. Here, we compared the upflow region and the AR core and studied how the plasma properties change from the chromosphere via the transition region to the corona. We studied limb-to-limb observations NOAA 12687 (14th - 25th Nov 2017). We analysed spectroscopic data simultaneously obtained from IRIS and Hinode/EIS in six spectral lines. We studied the mutual relationships between the plasma properties for each emission line, as well as comparing the plasma properties between the neighbouring formation temperature lines. To find the most characteristic spectra, we classified the spectra in each wavelength using the machine learning technique k-means. We found that in the upflow region the Doppler velocities of the coronal lines are strongly correlated, but the transition region and coronal lines show no correlation. However, their fluxes are strongly correlated. The upflow region has lower density and lower temperature than the AR core. In the upflow region, the Doppler and non-thermal velocity show a strong correlation in the coronal lines, but the correlation is not seen in the AR core. At the boundary between the upflow region and the AR core, the upflow region shows an increase in the coronal non-thermal velocity, the emission obtained from the DEM, and the domination of the redshifted regions in the chromosphere. The obtained results suggest that at least three parallel mechanisms generate the plasma upflow: (1) the reconnection between closed loops and open magnetic field lines in the lower corona or upper chromosphere; (2) the reconnection between the chromospheric small-scale loops and open magnetic field; (3) the expansion of the magnetic field lines that allows the chromospheric plasma to escape to the solar corona.

Solar and Stellar Astrophysics

Multispectral Pedestrian Detection via Simultaneous Detection and Segmentation

119 - Chengyang Li , Dan Song , Ruofeng Tong 2018

Multispectral pedestrian detection has attracted increasing attention from the research community due to its crucial competence for many around-the-clock applications (e.g., video surveillance and autonomous driving), especially under insufficient illumination conditions. We create a human baseline over the KAIST dataset and reveal that there is still a large gap between current top detectors and human performance. To narrow this gap, we propose a network fusion architecture, which consists of a multispectral proposal network to generate pedestrian proposals, and a subsequent multispectral classification network to distinguish pedestrian instances from hard negatives. The unified network is learned by jointly optimizing pedestrian detection and semantic segmentation tasks. The final detections are obtained by integrating the outputs from different modalities as well as the two stages. The approach significantly outperforms state-of-the-art methods on the KAIST dataset while remain fast. Additionally, we contribute a sanitized version of training annotations for the KAIST dataset, and examine the effects caused by different kinds of annotation errors. Future research of this problem will benefit from the sanitized version which eliminates the interference of annotation errors.

Computer Vision and Pattern Recognition

Micro Stripes Analyses for Iris Presentation Attack Detection

147 - Meiling Fang , Naser Damer , Florian Kirchbuchner 2020

Iris recognition systems are vulnerable to the presentation attacks, such as textured contact lenses or printed images. In this paper, we propose a lightweight framework to detect iris presentation attacks by extracting multiple micro-stripes of expanded normalized iris textures. In this procedure, a standard iris segmentation is modified. For our presentation attack detection network to better model the classification problem, the segmented area is processed to provide lower dimensional input segments and a higher number of learning samples. Our proposed Micro Stripes Analyses (MSA) solution samples the segmented areas as individual stripes. Then, the majority vote makes the final classification decision of those micro-stripes. Experiments are demonstrated on five databases, where two databases (IIITD-WVU and Notre Dame) are from the LivDet-2017 Iris competition. An in-depth experimental evaluation of this framework reveals a superior performance compared with state-of-the-art algorithms. Moreover, our solution minimizes the confusion between textured (attack) and soft (bona fide) contact lens presentations.

Computer Vision and Pattern Recognition

End-to-End Monocular Vanishing Point Detection Exploiting Lane Annotations

125 - Hiroto Honda , Motoki Kimura , Takumi Karasawa 2021

Vanishing points (VPs) play a vital role in various computer vision tasks, especially for recognizing the 3D scenes from an image. In the real-world scenario of automobile applications, it is costly to manually obtain the external camera parameters when the camera is attached to the vehicle or the attachment is accidentally perturbed. In this paper we introduce a simple but effective end-to-end vanishing point detection. By automatically calculating intersection of the extrapolated lane marker annotations, we obtain geometrically consistent VP labels and mitigate human annotation errors caused by manual VP labeling. With the calculated VP labels we train end-to-end VP Detector via heatmap estimation. The VP Detector realizes higher accuracy than the methods utilizing manual annotation or lane detection, paving the way for accurate online camera calibration.

Computer Vision and Pattern Recognition