Research papers, master and doctoral theses about إجابة احتمالية أعلى

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

242 - Association for Computation Linguistics 2021 مقالة

Large language models have shown promising results in zero-shot settings. For example, they can perform multiple choice tasks simply by conditioning on a question and selecting the answer with the highest probability. However, ranking by string proba bility can be problematic due to surface form competition---wherein different surface forms compete for probability mass, even if they represent the same underlying concept in a given context, e.g. computer'' and PC.'' Since probability mass is finite, this lowers the probability of the correct answer, due to competition from other strings that are valid answers (but not one of the multiple choice options). We introduce Domain Conditional Pointwise Mutual Information, an alternative scoring function that directly compensates for surface form competition by simply reweighing each option according to its a priori likelihood within the context of a specific task. It achieves consistent gains in zero-shot performance over both calibrated and uncalibrated scoring functions on all GPT-2 and GPT-3 models on a variety of multiple choice datasets.

surface form competition form competition highest probability answer مسابقة شكل السطح شكل مسابقة إجابة احتمالية أعلى صناعة حمض الفوسفور المزيد..

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

224 - Association for Computation Linguistics 2021 مقالة

Word meaning is notoriously difficult to capture, both synchronically and diachronically. In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four different languages, based on 100,000 human semantic proximity judgments. We describe in detail the multi-round incremental annotation process, the choice for a clustering algorithm to group usages into senses, and possible -- diachronic and synchronic -- uses for this dataset.

إجابة احتمالية أعلى usage graphs dwug استخدام الرسوم البيانية تليك صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد