Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Self-Alignment Pretraining for Biomedical Entity Representations

المحاذاة الذاتية محاكاة لتمثيل الكيانات الطبية الحيوية

957 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

self-alignment pretraining biomedical entity representations biomedical محاذاة المحاذاة تمثيل الكيانات الطبية الحيوية الطبية الحيوية صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Despite the widespread success of self-supervised learning via masked language models (MLM), accurately capturing fine-grained semantic relationships in the biomedical domain remains a challenge. This is of paramount importance for entity-level tasks such as entity linking where the ability to model entity relations (especially synonymy) is pivotal. To address this challenge, we propose SapBERT, a pretraining scheme that self-aligns the representation space of biomedical entities. We design a scalable metric learning framework that can leverage UMLS, a massive collection of biomedical ontologies with 4M+ concepts. In contrast with previous pipeline-based hybrid systems, SapBERT offers an elegant one-model-for-all solution to the problem of medical entity linking (MEL), achieving a new state-of-the-art (SOTA) on six MEL benchmarking datasets. In the scientific domain, we achieve SOTA even without task-specific supervision. With substantial improvement over various domain-specific pretrained MLMs such as BioBERT, SciBERTand and PubMedBERT, our pretraining scheme proves to be both effective and robust.

References used

https://aclanthology.org/

rate research

Introducing Information Retrieval for Biomedical Informatics Students

710 - Association for Computation Linguistics 2021 مقالة

Introducing biomedical informatics (BMI) students to natural language processing (NLP) requires balancing technical depth with practical know-how to address application-focused needs. We developed a set of three activities introducing introductory BM I students to information retrieval with NLP, covering document representation strategies and language models from TF-IDF to BERT. These activities provide students with hands-on experience targeted towards common use cases, and introduce fundamental components of NLP workflows for a wide variety of applications.

biomedical informatics students biomedical informatics introducing biomedical informatics طلاب المعلوماتية الطبية الحيوية المعلوماتية الطبية الحيوية تقديم المعلوماتية الطبية الحيوية صناعة حمض الفوسفور المزيد..

FJWU Participation for the WMT21 Biomedical Translation Task

653 - Association for Computation Linguistics 2021 مقالة

In this paper we present the FJWU's system submitted to the biomedical shared task at WMT21. We prepared state-of-the-art multilingual neural machine translation systems for three languages (i.e. German, Spanish and French) with English as target lan guage. Our NMT systems based on Transformer architecture, were trained on combination of in-domain and out-domain parallel corpora developed using Information Retrieval (IR) and domain adaptation techniques.

fjwu participation biomedical translation task biomedical shared task مشاركة FJWU. مهمة الترجمة الطبية الحيوية المهمة المشتركة الطبية الحيوية صناعة حمض الفوسفور المزيد..

From Alignment to Assignment: Frustratingly Simple Unsupervised Entity Alignment

532 - Association for Computation Linguistics 2021 مقالة

Cross-lingual entity alignment (EA) aims to find the equivalent entities between crosslingual KGs (Knowledge Graphs), which is a crucial step for integrating KGs. Recently, many GNN-based EA methods are proposed and show decent performance improvemen ts on several public datasets. However, existing GNN-based EA methods inevitably inherit poor interpretability and low efficiency from neural networks. Motivated by the isomorphic assumption of GNN-based methods, we successfully transform the cross-lingual EA problem into an assignment problem. Based on this re-definition, we propose a frustratingly Simple but Effective Unsupervised entity alignment method (SEU) without neural networks. Extensive experiments have been conducted to show that our proposed unsupervised approach even beats advanced supervised methods across all public datasets while having high efficiency, interpretability, and stability.

unsupervised entity alignment entity alignment محاذاة الكيان غير المدعومة محاذاة الكيان صناعة حمض الفوسفور

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

620 - Association for Computation Linguistics 2021 مقالة

Entity retrieval, which aims at disambiguating mentions to canonical entities from massive KBs, is essential for many tasks in natural language processing. Recent progress in entity retrieval shows that the dual-encoder structure is a powerful and ef ficient framework to nominate candidates if entities are only identified by descriptions. However, they ignore the property that meanings of entity mentions diverge in different contexts and are related to various portions of descriptions, which are treated equally in previous works. In this work, we propose Multi-View Entity Representations (MuVER), a novel approach for entity retrieval that constructs multi-view representations for entity descriptions and approximates the optimal view for mentions via a heuristic searching method. Our method achieves the state-of-the-art performance on ZESHEL and improves the quality of candidates on three standard Entity Linking datasets.

improving first-stage entity improving first-stage first-stage entity retrieval تحسين كيان المرحلة الأولى تحسين المرحلة الأولى استرجاع كيان المرحلة الأولى صناعة حمض الفوسفور المزيد..

The Fujitsu DMATH Submissions for WMT21 News Translation and Biomedical Translation Tasks

533 - Association for Computation Linguistics 2021 مقالة

This paper describes the Fujitsu DMATH systems used for WMT 2021 News Translation and Biomedical Translation tasks. We focused on low-resource pairs, using a simple system. We conducted experiments on English-Hausa, Xhosa-Zulu and English-Basque, and submitted the results for Xhosa→Zulu in the News Translation Task, and English→Basque in the Biomedical Translation Task, abstract and terminology translation subtasks. Our system combines BPE dropout, sub-subword features and back-translation with a Transformer (base) model, achieving good results on the evaluation sets.

biomedical translation tasks fujitsu dmath submissions biomedical translation مهام الترجمة الطبية الحيوية fujitsu dmath التقديمات الترجمة الطبية الحيوية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Self-Alignment Pretraining for Biomedical Entity Representations

المحاذاة الذاتية محاكاة لتمثيل الكيانات الطبية الحيوية

Ask ChatGPT about the research

Read More

suggested questions