The casual, neutral, and formal language registers are highly perceptible in discourse productions. However, they are still poorly studied in Natural Language Processing (NLP), especially outside English, and for new textual types like tweets. To sti
mulate research, this paper introduces a large corpus of 228,505 French tweets (6M words) annotated in language registers. Labels are provided by a multi-label CamemBERT classifier trained and checked on a manually annotated subset of the corpus, while the tweets are selected to avoid undesired biases. Based on the corpus, an initial analysis of linguistic traits from either human annotators or automatic extractions is provided to describe the corpus and pave the way for various NLP tasks. The corpus, annotation guide and classifier are available on http://tremolo.irisa.fr.
Customers of machine learning systems demand accountability from the companies employing these algorithms for various prediction tasks. Accountability requires understanding of system limit and condition of erroneous predictions, as customers are oft
en interested in understanding the incorrect predictions, and model developers are absorbed in finding methods that can be used to get incremental improvements to an existing system. Therefore, we propose an accountable error characterization method, AEC, to understand when and where errors occur within the existing black-box models. AEC, as constructed with human-understandable linguistic features, allows the model developers to automatically identify the main sources of errors for a given classification system. It can also be used to sample for the set of most informative input points for a next round of training. We perform error detection for a sentiment analysis task using AEC as a case study. Our results on the sample sentiment task show that AEC is able to characterize erroneous predictions into human understandable categories and also achieves promising results on selecting erroneous samples when compared with the uncertainty-based sampling.
The study was conducted in four sites belonging to the Sheikh Badr district in Tartous
province. Ten different types were identified according to international standards. During
the 2016 and 2017 growth seasons, readings were recorded for phenotypi
c parameters as
well as physical and chemical analysis of fruit clusters. The studied types differed in many
characteristics. The results of the cluster analysis showed the distribution of the studied
types in two groups with a variance of 93%. The mean weight of the cluster varied
between the medium (349 g) and the large (1140.45 g). The ratio of total dissolved solids
(TSS) between the low (12.75%) and high (18.82%). the acidity between the very low
(3.53 g/l) and medium (6.38 g/l).
The wide geographical spread and the exponential growth of the numbers of goats
around the world clearly demonstrate the ability of these ruminants to adapt to
harsh climates and grazing land. The aim of this study was to assess the genetic
divers
ity of 42 samples of Syrian goats from many domestication stations including
Jabali, Shami and hybrid (hybridization between both Shami and Jabali). The study
was done by DNA extraction of these samples, application of SSR technology using
7 microstellite markers. The alleles number of markers were 29 alleles, at a rate of
4.1 allele for each genetic locus. The number of alleles of each locus ranged from 3
alleles in genetically markers (BMS1714, INRAD07, SRCRSP09) to 8 alleles at the
genetic marker SRCRSP01. The rate of allele-frequency across all studied genetic
sites ranged from 0.071429 with the molecular size of 210 bp for the McM527
genetic site to 0.97619 with a molecular size of 55 bp for the SRCRSP09 genetic site.
This research was carried out in the laboratories of the Atomic
Energy Commission Molecular Biology and Biotechnology
Department in Damascus to study the molecular characterization of
five of local and introduced grapes varieties which collected f
rom
the Pome and Grapevine Research Center in Sweida using 20
primer pairs of SSR.
This research was conducted during the period of 2014 - 2015 in six locations
ranging from a height of between (0-500 m) in Syria, Tartous Governate in order to survey
and evaluate some mulberry types belong to (Morus alba and Morus nigra). 33 type
s (22
of white mulberry and 11 of black mulberry) were determined for the purpose of
morphological and phenological characterization and for the determination of the degree of
morphological relationship between those types in addition to setting their own taxonomic
keys. Morphological analysis was done based on 14 morphology characteristics (leaves,
catkins, fruits) in addition to a cluster analysis for all types belongs to white and black
species and for each species alone.
The morphological tree for all types showed two main groups with a contrast ratio of
(48%), the first group included three types belong to black mulberry, and the second group
included 30 types belong to both species with a contrast ratio of (0-38%), while the
morphological tree for the black species types showed a contrast ratio of (47%) and (33%)
for the white specie types.
The degree of morphological similarity is variable between the studied types
regardless of fruits color, and the percentage of contrast among the black mulberry types is
higher than white mulberry types.
This research highlights on new manner in study of petroleum
reservoir , by integrated between a sedimentary study, which
comprises micro slides from side, and geophysical well logging
interpretation from another side, in Juriba & Transition zone
formation, in ELmhash structure oil field , at Euphrates depression
– west DER Azzor -Syria.
A selection (5) wells in ELmhash structure is Mahash (17-18-11-4-
7).
Morphological Characterization Has Been Achieved For (10) Phenotype Wild
Apple(Malus trilobata (Lab) ) In JABLEH mountains During The Period (2013-2014) In
Five Area (Pichraghe, Helbako, Almonaizlah, Raas al chaara, Aldalia),In Height Between
(780
-1250) m On See Standard, Morphological Characterization Included The
Characteristics Of The Tree, Stem, Shoots, Leaves, Flowers, Fruit And Seed. The Cluster
Analysis For (12) Characteristics Were Divided Into Two Groups Variation Ratio Even (57
%) , The First Group Included Four Types(Two Types Of Rass al chaara(R1, R2), One
Type Of Pichraghe(P1) And One Type Of Almonaizlah(M2) ), Variation Ratio Was The
Least(29.4%) Between (P1) And (R1), The Second Group Included Six Types (Tow
Types Of Pichraghe (P2, P3), Two Types Of Helbako, One Type Of Almonaizlah(M1)
And One Type Of Aldalia (D1).The Second Group Included Tow Sup Group Variation
Ratio Even (52%),The First(P2, P3), And The Second (H1, H2, M1, D1). Variation Ratio
Was The Least(23%) Between (H1) And (H2).
The aimsof this research were to morphologically characterize 27 local genotypes of
tulip,to evaluate the genetic variability between them using 32 morphological traits, and to
determine their chromosomal number.
The results allowed the distinctio
n between the studied genotypes based on the
morphological characterization of all vegetative,floral andbulb traits. Results confirmed
that the main limiting factor for distribution over the screened regionwas the high above
sea level.All genotypes were divided into two main groups, the firstone included genotypes
located at 400 - 850 m of altitude, and the second one located between 950 -1450 m.
The microscopic analysis results showed that all the studied genotypes were diploid
(2n=2x=24 chromosomes).
Depending on the results of the morphological characterization and the chromosomal
number of local genotypes studied, it can be considered the closest to the specie
Tulipaagenensis D.C.
Randomly amplified polymorphic DNA (RAPD) technique was used to study the genetic relationship among 10 wild accessions of Origanum syriacum grown in Syria.
The results of this study may have a great impact on the
national biodiversity program whic
h lacks the employment of
molecular techniques, and thus we recommend making use
of these techniques in genotype identification in general, and
studying the relationship among them, and in particular in
studying important wild species of medical importance which
have not been described yet on the molecular level.