A Multi-View Approach Based on Naming Behavioral Modeling for Aligning Chinese User Accounts across Multiple Networks

68 0 0.0 ( 0 )

Download Cite

Added by Junxing Zhu

Publication date 2020

fields Informatics Engineering

and research's language is English

Authors Junxing Zhu - Xiang Wang - Qiang Liu

Social and Information Networks

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Hundreds of millions of Chinese people have become social network users in recent years, and aligning the accounts of common Chinese users across multiple social networks is valuable to many inter-network applications, e.g., cross-network recommendation, cross-network link prediction. Many methods have explored the proper ways of utilizing account name information into aligning the common English users accounts. However, how to properly utilize the account name information when aligning the Chinese user accounts remains to be detailedly studied. In this paper, we firstly discuss the available naming behavioral models as well as the related features for different types of Chinese account name matchings. Secondly, we propose the framework of Multi-View Cross-Network User Alignment (MCUA) method, which uses a multi-view framework to creatively integrate different models to deal with different types of Chinese account name matchings, and can consider all of the studied features when aligning the Chinese user accounts. Finally, we conduct experiments to prove that MCUA can outperform many existing methods on aligning Chinese user accounts between Sina Weibo and Twitter. Besides, we also study the best learning models and the top-k valuable features of different types of name matchings for MCUA over our experimental data sets.

rate research

Understanding User Topic Preferences across Multiple Social Networks

74 - Ziqing Zhu , Jiuxin Cao , Tao Zhou 2021

In recent years, social networks have shown diversity in function and applications. People begin to use multiple online social networks simultaneously for different demands. The ability to uncover a users latent topic and social network preference is critical for community detection, recommendation, and personalized service across social networks. Unfortunately, most current works focus on the single network, necessitating new technology and models to address this issue. This paper proposes a user preference discovery model on multiple social networks. Firstly, the global and local topic concepts are defined, then a latent semantic topic discovery method is used to obtain global and local topic word distributions, along with user topic and social network preferences. After that, the topic distribution characteristics of different social networks are examined, as well as the reasons why users choose one network over another to create a post. Next, a Gibbs sampling algorithm is adopted to obtain the model parameters. In the experiment, we collect data from Twitter, Instagram, and Tumblr websites to build a dataset of multiple social networks. Finally, we compare our research to previous works, and both qualitative and quantitative evaluation results have demonstrated the effectiveness.

Social and Information Networks

Community Detection Across Multiple Social Networks based on Overlapping Users

170 - Ziqing Zhu , Tao Zhou , Chenghao Jia 2019

With the rapid development of Internet technology, online social networks (OSNs) have got fast development and become increasingly popular. Meanwhile, the research works across multiple social networks attract more and more attention from researchers, and community detection is an important one across OSNs for online security problems, such as the user behavior analysis and abnormal community discovery. In this paper, a community detection method is proposed across multiple social networks based on overlapping users. First, the concept of overlapping users is defined, then an algorithm CMN NMF is designed to discover the stub communities from overlapping users based on the social relevance. After that, we extend each stub community in different social networks by adding the users with strong similarity, and in the end different communities are excavated out across networks. Experimental results show the advantage on effectiveness of our method over other methods under real data sets.

Social and Information Networks

Detecting Automatically Managed Accounts in Online Social Networks: Graph Embedding Approach

68 - Ilia Karpov , Ekaterina Glazkova (National Research Universityn Higher School of Economics , Moscow 2020

The widespread of Online Social Networks and the opportunity to commercialize popular accounts have attracted a large number of automated programs, known as artificial accounts. This paper focuses on the classification of human and fake accounts on the social network, by employing several graph neural networks, to efficiently encode attributes and network graph features of the account. Our work uses both network structure and attributes to distinguish human and artificial accounts and compares attributed and traditional graph embeddings. Separating complex, human-like artificial accounts into a standalone task demonstrates significant limitations of profile-based algorithms for bot detection and shows the efficiency of network structure-based methods for detecting sophisticated bot accounts. Experiments show that our approach can achieve competitive performance compared with existing state-of-the-art bot detection systems with only network-driven features. The source code of this paper is available at: http://github.com/karpovilia/botdetection.

Social and Information Networks

Multiple Accounts Detection on Facebook Using Semi-Supervised Learning on Graphs

68 - Xiaoyun Wang , Chun-Ming Lai , Yunfeng Hong 2018

In social networks, a single user may create multiple accounts to spread his / her opinions and to influence others, by actively comment on different news pages. It would be beneficial to both social networks and their communities, to demote such abnormal activities, and the first step is to detect those accounts. However, the detection is challenging, because these accounts may have very realistic names and reasonable activity patterns. In this paper, we investigate three different approaches, and propose using graph embedding together with semi-supervised learning, to predict whether a pair of accounts are created by the same user. We carry out extensive experimental analyses to understand how changes in the input data and algorithmic parameters / optimization affect the prediction performance. We also discover that local information have higher importance than the global ones for such prediction, and point out the threshold leading to the best results. We test the proposed approach with 6700 Facebook pages from the Middle East, and achieve the averaged accuracy at 0.996 and AUC (area under curve) at 0.952 for users with the same name; with the U.S. 2016 election dataset, we obtain the best AUC at 0.877 for users with different names.

Social and Information Networks

Characterising User Content on a Multi-lingual Social Network

114 - Pushkal Agarwal , Kiran Garimella , Sagar Joglekar 2020

Social media has been on the vanguard of political information diffusion in the 21st century. Most studies that look into disinformation, political influence and fake-news focus on mainstream social media platforms. This has inevitably made English an important factor in our current understanding of political activity on social media. As a result, there has only been a limited number of studies into a large portion of the world, including the largest, multilingual and multi-cultural democracy: India. In this paper we present our characterisation of a multilingual social network in India called ShareChat. We collect an exhaustive dataset across 72 weeks before and during the Indian general elections of 2019, across 14 languages. We investigate the cross lingual dynamics by clustering visually similar images together, and exploring how they move across language barriers. We find that Telugu, Malayalam, Tamil and Kannada languages tend to be dominant in soliciting political images (often referred to as memes), and posts from Hindi have the largest cross-lingual diffusion across ShareChat (as well as images containing text in English). In the case of images containing text that cross language barriers, we see that language translation is used to widen the accessibility. That said, we find cases where the same image is associated with very different text (and therefore meanings). This initial characterisation paves the way for more advanced pipelines to understand the dynamics of fake and political content in a multi-lingual and non-textual setting.

Social and Information Networks Computation and Language