Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

What Would it Take to get Biomedical QA Systems into Practice?

ماذا سيستغرق الحصول على أنظمة ضمان الجودة الطبية الحيوية موضع التنفيذ؟

1137 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

MFAQ. medical question answering biomedical qa systems السؤال الطبي الرد أنظمة ضمان الجودة الطبية الحيوية صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Medical question answering (QA) systems have the potential to answer clinicians' uncertainties about treatment and diagnosis on-demand, informed by the latest evidence. However, despite the significant progress in general QA made by the NLP community, medical QA systems are still not widely used in clinical environments. One likely reason for this is that clinicians may not readily trust QA system outputs, in part because transparency, trustworthiness, and provenance have not been key considerations in the design of such models. In this paper we discuss a set of criteria that, if met, we argue would likely increase the utility of biomedical QA systems, which may in turn lead to adoption of such systems in practice. We assess existing models, tasks, and datasets with respect to these criteria, highlighting shortcomings of previously proposed approaches and pointing toward what might be more usable QA systems.

References used

https://aclanthology.org/

rate research

What Will it Take to Fix Benchmarking in Natural Language Understanding?

650 - Association for Computation Linguistics 2021 مقالة

Evaluation for many natural language understanding (NLU) tasks is broken: Unreliable and biased systems score so highly on standard benchmarks that there is little room for researchers who develop better systems to demonstrate their improvements. The recent trend to abandon IID benchmarks in favor of adversarially-constructed, out-of-distribution test sets ensures that current models will perform poorly, but ultimately only obscures the abilities that we want our benchmarks to measure. In this position paper, we lay out four criteria that we argue NLU benchmarks should meet. We argue most current benchmarks fail at these criteria, and that adversarial data collection does not meaningfully address the causes of these failures. Instead, restoring a healthy evaluation ecosystem will require significant progress in the design of benchmark datasets, the reliability with which they are annotated, their size, and the ways they handle social bias.

فهم التوظيف fix benchmarking إصلاح المعيار صناعة حمض الفوسفور

Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

695 - Association for Computation Linguistics 2021 مقالة

Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it in to smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowledge for a target task, these sub-graph adapters are further fine-tuned along with the underlying BERT through a mixture layer. We evaluate our MoP with three biomedical BERTs (SciBERT, BioBERT, PubmedBERT) on six downstream tasks (inc. NLI, QA, Classification), and the results show that our MoP consistently enhances the underlying BERTs in task performance, and achieves new SOTA performances on five evaluated datasets.

large knowledge graph biomedical knowledge graphs infusing large biomedical الرسم البياني المعرفة الكبيرة رسوم المعرفة الطبية الحيوية الرسوم البيانية غرس Biomedical كبيرة صناعة حمض الفوسفور المزيد..

Tencent AI Lab Machine Translation Systems for the WMT21 Biomedical Translation Task

1186 - Association for Computation Linguistics 2021 مقالة

This paper describes the Tencent AI Lab submission of the WMT2021 shared task on biomedical translation in eight language directions: English-German, English-French, English-Spanish and English-Russian. We utilized different Transformer architectures , pretraining and back-translation strategies to improve translation quality. Concretely, we explore mBART (Liu et al., 2020) to demonstrate the effectiveness of the pretraining strategy. Our submissions (Tencent AI Lab Machine Translation, TMT) in German/French/Spanish⇒English are ranked 1st respectively according to the official evaluation results in terms of BLEU scores.

الكلمات داخل المجال تجزئة lab machine translation tencent ai lab الترجمة الآلية المختبر Tencent AI Lab. صناعة حمض الفوسفور

Connecting Attributions and QA Model Behavior on Realistic Counterfactuals

940 - Association for Computation Linguistics 2021 مقالة

When a model attribution technique highlights a particular part of the input, a user might understand this highlight as making a statement about counterfactuals (Miller, 2019): if that part of the input were to change, the model's prediction might ch ange as well. This paper investigates how well different attribution techniques align with this assumption on realistic counterfactuals in the case of reading comprehension (RC). RC is a particularly challenging test case, as token-level attributions that have been extensively studied in other NLP tasks such as sentiment analysis are less suitable to represent the reasoning that RC models perform. We construct counterfactual sets for three different RC settings, and through heuristics that can connect attribution methods' outputs to high-level model behavior, we can evaluate how useful different attribution methods and even different formats are for understanding counterfactuals. We find that pairwise attributions are better suited to RC than token-level attributions across these different RC settings, with our best performance coming from a modification that we propose to an existing pairwise attribution method.

realistic counterfactuals model behavior connecting attributions مواجهة واقعية السلوك النموذجي ربط الوثود صناعة حمض الفوسفور المزيد..

HW-TSC's Submissions to the WMT21 Biomedical Translation Task

1295 - Association for Computation Linguistics 2021 مقالة

This paper describes the submission of Huawei Translation Service Center (HW-TSC) to WMT21 biomedical translation task in two language pairs: Chinese↔English and German↔English (Our registered team name is HuaweiTSC). Technical details are introduced in this paper, including model framework, data pre-processing method and model enhancement strategies. In addition, using the wmt20 OK-aligned biomedical test set, we compare and analyze system performances under different strategies. On WMT21 biomedical translation task, Our systems in English→Chinese and English→German directions get the highest BLEU scores among all submissions according to the official evaluation results.

Tencent AI Lab. huawei translation service خدمة الترجمة Huawei صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

What Would it Take to get Biomedical QA Systems into Practice?

ماذا سيستغرق الحصول على أنظمة ضمان الجودة الطبية الحيوية موضع التنفيذ؟

Ask ChatGPT about the research

Read More

suggested questions