Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Scientific Text Mining

التنقيب في النصوص العلمية

3752 1 92 0 ( 0 )

Download Cite

Added by جامعة دمشق حلقة بحث

Publication date 2018

fields Informatics Engineering

and research's language is العربية

Authors آمال النوري( طالب ) - رغد جمال الدين( طالب ) - مرام كماشة( طالب )

Created by Amal AlNouri

Machine learning Data Mining Deep Learning automatic text summarization

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

No English abstract

Artificial intelligence review:

Upgrade your account to view the content

Research summary

يتناول هذا البحث موضوع التنقيب في النصوص العلمية، ويهدف إلى تحليل النصوص لاستخلاص معارف ذات جودة عالية من عدة مصادر نصية وربطها لتشكيل حقائق وفرضيات جديدة. يركز البحث على الأوراق البحثية كتمثيل للمعرفة البشرية، ويستفيد من حركة الوصول المفتوح والتعلم الآلي لتسهيل عملية التنقيب. تتضمن أهداف التنقيب دعم الوصول الاستكشافي للأدب العلمي، تحليل ومقارنة نتائج الأبحاث، تلخيص نتائج الأبحاث، أتمتة مراجعة الأدب العلمي، والإجابة عن الأسئلة العلمية. يناقش البحث أيضًا طرق تلخيص النصوص العلمية، سواء كانت Extractive أو Abstractive، ويستعرض مجموعة من الأبحاث في هذا المجال. كما يتناول البحث استخراج الصور التوضيحية والجداول من الأوراق البحثية، ويقدم خوارزميات لتحديد هذه العناصر وإسناد المسميات التوضيحية لها. بالإضافة إلى ذلك، يناقش البحث تحديد الاستشهادات المهمة في الأدبيات العلمية، ويقترح نماذج لتصنيف الاستشهادات بناءً على أهميتها. أخيرًا، يقدم البحث نظام Citeomatic كأداة لمراجعة الأدب العلمي، ويشرح كيفية تدريب النموذج للتنبؤ بالاستشهادات ذات المغزى.

Critical review

دراسة نقدية: يعتبر هذا البحث شاملاً ومفيداً في مجال التنقيب في النصوص العلمية، حيث يغطي مجموعة واسعة من المواضيع بدءًا من تلخيص الأوراق البحثية وصولاً إلى استخراج الصور والجداول وتحديد الاستشهادات المهمة. ومع ذلك، يمكن أن يكون هناك بعض التحديات في تطبيق الخوارزميات المقترحة على نطاق واسع، خاصةً فيما يتعلق بدقة استخراج الصور والجداول من تنسيقات مختلفة للأوراق البحثية. كما أن الاعتماد الكبير على الأدوات البرمجية مثل Poppler وPDFBox قد يحد من إمكانية التوسع في تطبيق هذه الخوارزميات على أنواع أخرى من الوثائق. بالإضافة إلى ذلك، يمكن تحسين النموذج المقترح لتحديد الاستشهادات المهمة من خلال تضمين مزيد من البيانات التدريبية المتنوعة لضمان دقة أعلى في التصنيف. بشكل عام، يقدم البحث إسهامات قيمة في مجال التنقيب في النصوص العلمية، ولكنه يحتاج إلى مزيد من التحسينات لتطبيقه بشكل فعال على نطاق واسع.

Questions related to the research

ما هي الأهداف الرئيسية لعملية التنقيب في النصوص العلمية؟

الأهداف الرئيسية تشمل دعم الوصول الاستكشافي للأدب العلمي، تحليل ومقارنة نتائج الأبحاث، تلخيص نتائج الأبحاث، أتمتة مراجعة الأدب العلمي، والإجابة عن الأسئلة العلمية.
ما الفرق بين طرق تلخيص النصوص العلمية Extractive وAbstractive؟

الطريقة Extractive تعتمد على نسخ أجزاء من الوثيقة لتشكيل الملخص، بينما الطريقة Abstractive تولد ملخصاً جديداً قد يحتوي على عبارات لا تظهر في الوثيقة الأصلية.
ما هي الخوارزميات المستخدمة لاستخراج الصور والجداول من الأوراق البحثية؟

الخوارزميات تشمل تحديد بدايات نصوص المسميات التوضيحية، تحديد مناطق الكتل النصية والرسومية، وإسناد المسمى التوضيحي إلى منطقة الرسم التوضيحي الذي يشير إليه.
كيف يتم تحديد الاستشهادات المهمة في الأدبيات العلمية؟

يتم تحديد الاستشهادات المهمة من خلال نماذج تصنيف تعتمد على سمات مثل عدد الاستشهادات المباشرة وغير المباشرة، موقع الاقتباس في الورقة، وجود عبارات تدل على أهمية العمل، وظهور الاقتباس في أشكال توضيحية أو جداول.

Keywords

التنقيب في النصوص العلمية تلخيص الأوراق البحثية استخراج الصور والجداول تحديد الاستشهادات المهمة Citeomatic التعلم الآلي الوصول المفتوح

References used

Petr Knoth, and Phil Gooch, A. (22 September 2015). An Introduction to Text Mining Research Papers [PDF]. Retrieved from https://www.uksg.org/sites/uksg.org/files/Text-Mining-Research-Papers.pptx.pdf.

(Visser, W. T., and M. B. Wieling. "Sentence-based summarization of scientific documents." The design and implementation of an online available automatic summarizer. Report, last retrieved Nov. 29th (2007).

Qazvinian, Vahed, and Dragomir R. Radev. "Scientific paper summarization using citation summary networks." Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1. Association for Computational Linguistics, 2008

, Horacio, and Francesco Ronzano. "Trainable citation-enhanced summarization of scientific articles." Proceedings of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL). 2016

(Collins, Ed, Isabelle Augenstein, and Sebastian Riedel. "A Supervised Approach to Extractive Summarization of Scientific Papers." arXiv preprint arXiv:1706.03946 (2017)

Clark, Christopher Andreas, and Santosh Kumar Divvala. "Looking Beyond Text: Extracting Figures, Tables and Captions from Computer Science Papers." AAAI Workshop: Scholarly Big Data. 2015

Clark, Christopher, and Santosh Divvala. "PDFFigures 2.0: Mining figures from research papers." Digital Libraries (JCDL), 2016 IEEE/ACM Joint Conference on. IEEE, 2016

Valenzuela, Marco, Vu Ha, and Oren Etzioni. "Identifying Meaningful Citations." AAAI Workshop: Scholarly Big Data. 2015

“Citeomatic: Automated Literature Review”. The Allen Institute for Artificial Intelligence, 2017. Retrieved from http://allenai.org/semantic-scholar/citeomatic

rate research

trajectory data mining

1733 - Damascus University 2018 ورقة بحثية

The advances in location-acquisition and mobile computing techniques have generated massive spatial trajectory data, which represent the mobility of a diversity of moving objects, such as people, vehicles and animals. Many techniques have been propos ed for processing, managing and mining trajectory data in the past decade, fostering a broad range of applications. In this article, we conduct a systematic survey on the major research into trajectory data mining, providing a panorama of the field as well as the scope of its research topics. Following a roadmap from the derivation of trajectory data, to trajectory data preprocessing, to trajectory data management, and to a variety of mining tasks (such as trajectory pattern mining, outlier detection, and trajectory classification), the survey explores the connections, correlations and differences among these existing techniques. This survey also introduces the methods that transform trajectories into other data formats, such as graphs, matrices, and tensors, to which more data mining and machine learning techniques can be applied. Finally, some public trajectory datasets are presented. This survey can help shape the field of trajectory data mining, providing a quick understanding of this field to the community.

trajectory data mining

Role of Data Mining in Cyber Security

2548 - Damascus University 2018 حلقة بحث

Data mining is becoming a pervasive technology in activities as diverse as using historical data to predict the success of a marketing campaign looking for patterns in financial transactions to discover illegal activities. From this perspective it wa s just a matter of time for the discipline to reach the important area of computer security This research presents a collection of research efforts on the use of data mining in computer security.

Security Data Mining Cyber

Retrieval of Images & Text using Data Mining Techniques

1786 - منى عبد الله_مريم اسماعيل_مرح جربوع 2019 ورقة بحثية

في هذا البحث تٌستخدم تقنيات استكشاف الصور كالتجميع و قواعد الاستكشاف لاستكشاف المعرفة من الصورة و أيضاً يستخدم دمج الميزات متعددة الوسائط مثل البصرية و النصية.

التجميع الارتباط استرجاع الصور استكشاف المعرفة استكشاف الصور استخراج الميزات

Employing Data Mining Algorithms in Traffic Accidents Analyzing

7024 - Tishreen University 2015 ورقة بحثية

In this paper we introduce a comparison for some of data mining algorithm for traffic accidents analysis. We start by describing available data for entry by analyzing the structure of statistical reports in Lattakia traffic directorate, and proceed to data mining stage which enables us to smart study of factors that play roles in traffic accident and find its inter-relations and importance for causing traffic accident. That comes after building data warehouse upon the database we built to store the data we gathered. In this research we list a some of models was tested which is a sample of a many cases we checked to have the research results.

Classification العنقدة Clustering Data Mining التنقيب في البيانات التصنيف حوادث المرور مستودعات البيانات قواعد بيانات معرفية قواعد الاقتران Traffic Accidents Data Warehouse Knowledge Database Association Rules المزيد..

Using Data Mining Techniques to Analyze Students’ Performance

4373 - Aِl-Baath University 2016 ورقة بحثية

The objective of this paper is presenting an educational data mining case study, by applying data mining techniques.

Data Mining التنقيب في البيانات التنقيب في بيانات المؤسسات التعليمية برنامج ذكاء الأعمال الخاص بشركة مايكروسوفت خوارزمية مايكروسوفت لأشجار القرار خوارزمية مايكروسوفت للتجميع و الكشف عن الحالات الشاذة خوارزمية مايكروسوفت لقواعد الارتباط Educational Data Mining SQL Server Business Intelligence Development Studio Microsoft Decision Trees Microsoft Clustering Outlier Detection Microsoft Association Rules المزيد..

Scientific Text Mining

التنقيب في النصوص العلمية

Ask ChatGPT about the research

No English abstract

Read More

suggested questions