Home Publications Patents In the Media Resources Courses Supervision

 

List of Publications

Check also my DBLP entry, my Google scholar entry, and my Edinburgh explorer entry.

Quick Navigator: 2025, 2024, 2023, 2022, 2021, 2020, 2019, 2018, 2017, 2016, 2015, 2014, 2013, 2012, 2011, 2010, 2009, 2008, 2007, 2006, Thesis, Reports.

 

 

Books

  • K. Darwish and W. Magdy. Arabic Information Retrieval. Download

  • W. Magdy et al. Arabic Language Processing (Arabic book). Download

 

2025

  • Fawzi M., B. Ross, and W. Magdy. "The Prophet said so!": On Exploring Hadith Presence on Arabic Social Media. CSCW 2025 to appear

 

2024

  • Fawzi M. and W. Magdy. "Pinocchio had a Nose, You have a Network!": On Characterizing Fake News Spreaders on Arabic Social Media. CSCW 2024 link

  • Al Hariri Y., S. Chausson, B. Ross, and W. Magdy. TwiXplorer: An Interactive Tool for Narrative Detection and Analysis in Historic Twitter Data. CSCW 2024 to appear

  • Keleg A., W. Magdy, and S. Goldwater. Estimating the Level of Dialectness Predicts Inter-annotator Agreement in Multi-dialect Arabic Datasets. ACL 2024 link Outstanding Paper Award

  • Abdul-Mageed M., C. Zhang, A. Keleg, A. Elmadany, I. Hamed, W. Magdy, H. Bouamor, N. Habash. NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task. ArabicNLP 2024 to appear

 

2023

  • Kekulluoglu D., K. Vaniea, Maria Wolters, and W. Magdy. Twitter has a Binary Privacy Setting, are Users Aware of How It Works?. CSCW 2023 link

  • Keleg A. and W. Magdy. DLAMA: A Framework for Curating Culturally Diverse Facts for Probing the Knowledge of Pretrained Language Models. ACL 2023 link

  • Keleg A., S. Goldwater, and W. Magdy. ALDi: Quantifying the Arabic Level of Dialectness of Text. EMNLP 2023 link

  • Keleg A. and W. Magdy. Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification. ArabicNLP 2023 link

  • Benkhedda Y., P. Xiao, and W. Magdy. Emoji are Effective Predictors of User's Demographics. ASONAM 2023 link

  • Zheng Y., B. Ross, and W. Magdy. What Makes Good Counterspeech? A Comparison of Generation Approaches and Evaluation Metrics. CS4OA 2023 link

 

2022

  • Waniek M, W. Magdy and T. Rahwan. Hiding opinions from machine learning. PNAS Nexus link

  • Aldayel A. and W. Magdy. Characterizing the Role of Bots in Polarized Stance on Social Media. SNAM 12(30) link

  • Magdy W. Robert Elliott Smith: Rage Inside the Machine - the prejudice of algorithms, and how to stop the internet making bigots of us all. GPEM 23(1) link

  • Kekulluoglu D., K. Vaniea, and W. Magdy. Understanding Privacy Switching Behaviour on Twitter. CHI 2022 arXiv, link

  • Kekulluoglu D., W. Magdy and K. Vaniea. From an Authentication Question to a Public Social Event: Characterizing Birthday Sharing on Twitter. ICWSM 2022 arXiv, link

  • Oprea S., S. Wilson, and W. Magdy. Should a Bot be Sarcastic? Understanding User Preferences Towards Sarcasm Generation. ACL 2022 link

  • Bahgat M., S. Wilson, and W. Magdy. LIWC-UD: Classifying Online Slang Terms into LIWC Categories. WebSci 2022 link

  • Abu Farha I., Wilson S., Oprea S. and W. Magdy. Sarcasm Detection is Way Too Easy! 🙃 An Empirical Comparison of Human and Machine Sarcasm Detection. EMNLP 2022 link

  • Kamila S., W. Magdy, S. Dutta and M. Wang. AX-MABSA: A Framework for Extremely Weakly Supervised Multi-label Aspect Based Sentiment Analysis. EMNLP 2022 link

  • Meaney J. A., S. Wilson, L. Chiruzzo, and W. Magdy. Don't Take it Personally: Analyzing Gender and Age Differences in Ratings of Online Humor. SocInfo 2022 arXiv, link Best Paper Award Runner-up

  • Tarighat F. S., W. Magdy, and M. Corley. Understanding Fillers May Facilitate Automatic Sarcasm Comprehension: A Structural Analysis of Twitter Data and a Participant Study. SemDial 2022 link

  • Keleg A. and W. Magdy. SMASH at Qur'an QA 2022: Creating Better Faithful Data Splits for Low-resourced Question Answering Scenarios. OSACT 2022 link

  • Abu Farha I., Oprea S., Wilson S. and W. Magdy. SemEval-2022 Task 6: iSarcasmEval, Intended Sarcasm Detection in English and Arabic. SemEval 2022 link

  • Abu Farha I. and W. Magdy. The Effect of Arabic Dialect Familiarity on Data Annotation. WANLP - EMNLP 2022 link Best Paper Award

 

2021

  • Abu Farha I. and W. Magdy. A Comparative Study of Effective Approaches for Arabic Sentiment Analysis. IP&M 58(2) link

  • Aldayel A. and W. Magdy. Stance Detection on Social Media: State of the Art and Trends. IP&M 58(4) link

  • Robertson A., W. Magdy, S. Goldwater. Black or White but never neutral: How readers perceive identity from yellow or skin-toned emoji. CSCW 2021 link, arXiv

  • Al Hariri Y., W. Magdy and M. Wolters. Atheists versus Theists: Religious Polarisation in Arab Online Communities. CSCW 2021 link

  • Robertson A., W. Magdy, S. Goldwater. Identity Signals in Emoji Do not Influence Perception of Factual Truth on Twitter. Emoji2021 - ICWSM 2021 link

  • Oprea S., S. Wilson, and W. Magdy. Chandler: An Explainable Sarcastic Response Generator. EMNLP 2021 link

  • Abu Farha I., Zaghouani W. and W. Magdy. Overview of the WANLP 2021 Shared Task on Sarcasm and Sentiment Detection in Arabic. WANLP - EACL 2021 link

  • Abu Farha I. and W. Magdy. Benchmarking Transformers for Arabic Sarcasm Detection. WANLP - EACL 2021 link

  • Meaney J. A., S. Wilson, L. Chiruzzo, A. Lopez and W. Magdy. SemEval 2021 Task 7: HaHackathon, Detecting and Rating Humor and Offense. SemEval 2021 link

 

2020

  • Robertson A., W. Magdy, S. Goldwater. Emoji Skin Tone Modifiers: Analyzing Variation in Usage on Social Media. ACM TSC 3(2) link

  • Chen L., W. Magdy and M. Wolters. The Effect of User Psychology on the Content of Social Media Posts: Originality and Transitions Matter. Frontiers in Psychology 11 link

  • Oprea S. and W. Magdy. iSarcasm: A Dataset of Intended Sarcasm. ACL 2020 link, arXiv

  • Oprea S. and W. Magdy. The Effect of Sociocultural Variables on Sarcasm Communication Online. CSCW 2020 link, arXiv

  • Abokhodair N., A. Elmadany and W. Magdy. Holy Tweets: Exploring the Sharing of the Quran on Twitter. CSCW 2020 link, arXiv

  • Kekulluoglu D., W. Magdy and K. Vaniea. Analysing Privacy Leakage of Life Events on Twitter. WebSci 2020 link

  • Chen L., W. Magdy, H. Whalley and M. Wolters. Examining the Role of Derived Mood Patterns in Predicting Depression Symptoms. WebSci 2020 link

  • Chen L., W. Magdy, H. Whalley and M. Wolters. It's Not Just About Sad Songs: The Effect of Depression on Posting Lyrics and Quotes. SocInfo 2020 link

  • Wilson S., W. Magdy, B. McGillivray and G. Tyson. Analyzing Temporal Relationships between Trending Terms on Twitter and Urban Dictionary Activity. WebSci 2020 link, arXiv

  • Wilson S., W. Magdy, B. McGillivray, K. Garimella, and G. Tyson. Urban Dictionary Embeddings for Slang NLP Applications. LREC 2020 link

  • Bahgat M., S. Wilson, W. Magdy. Towards Using Word Embedding Vector Space for Better Cohort Analysis. ICWSM 2020 link

  • Mubarak H., K. Darwish, W. Magdy, T. Elsayed and H. Al-Khalifa. Overview of OSACT4 Arabic Offensive Language Detection Shared Task. OSACT4 - LREC 2020 link

  • Abu Farha I. and W. Magdy. From Arabic Sentiment Analysis to Sarcasm Detection: The ArSarcasm Dataset. OSACT4 - LREC 2020 link

  • Abu Farha I. and W. Magdy. Multitask Learning for Arabic Offensive Language and Hate-Speech Detection. OSACT4 - LREC 2020 link

  • Meaney J. A., S. Wilson and W. Magdy. SMASH at SemEval-2020 Task 7: Optimizing the Hyperparameters of ERNIE 2.0 for Humor Ranking and Rating. SemEval 2020 link

  • Jinhang L., G. Longinos, S. Wilson and W. Magdy. Emoji and Self-Identity in Twitter Bios. NLP+CSS - ENMLP 2020 link

  • Wilson S., W. Magdy, B. McGillivray, and G. Tyson. Embedding Structured Dictionary Entries. Insights from Negative Results in NLP - EMNLP 2020 link

 

2019

  • Mourad A., F. Scholer, W. Magdy, and M. Sanderson. A Practical Guide for the Effective Evaluation of Twitter User Geolocation. ACM TSC 2(3) link, arXiv

  • Aldayel A. and W. Magdy. Your Stance is Exposed! Analysing Possible Factors for Stance Detection on Social Media. CSCW 2019 link, arXiv

  • Oprea S. and W. Magdy. Exploring Author Context for Detecting Intended vs Perceived Sarcasm. ACL 2019 link

  • Al Hariri Y., W. Magdy and M. Wolters. Arabs and Atheism: Religious Discussions in the Arab Twittersphere. SocInfo 2019 link, arXiv

  • Aldayel A. and W. Magdy. Assessing Sentiment of the Expressed Stance on Social Media. SocInfo 2019 link, arXiv

  • Algotiml B., A. Elmadany and W. Magdy. Arabic Tweet-Act: Speech Act Recognition for Arabic Asynchronous Conversations. WANLP - ACL 2019 link

  • Abu Farha I. and W. Magdy. Mazajak: An Online Arabic Sentiment Analyser. WANLP - ACL 2019 link, Live demo

 

2018

  • Mourad A., F. Scholer, W. Magdy, and M. Sanderson. How Well Did You Locate Me? Effective Evaluation of Twitter User Geolocation. ASONAM 2018 link

  • Cremarenco D. and W. Magdy. ClassStrength v2: An Adaptive Multilingual Tool for Tweet Classification. ASONAM 2018 link

  • Robertson A., W. Magdy, S. Goldwater. Self-Representation on Twitter Using Emoji Skin Color Modifiers. ICWSM 2018 link, arXiv, In Media

  • Alharbi R., W. Magdy, K. Darwish, A. AbdelAli, and H. Mubarak. Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM. LREC 2018 link

  • Darwish K., H. Mubarak, M. Eldesouki, A. Abdelali, Y. Samih, R. Alharbi, M. Attia, W. Magdy, L. Kallmeyer. Multi-Dialect Arabic POS Tagging: A CRF Approach. LREC 2018 link

  • Elmadany A., H. Mubarak, W. Magdy. ArSAS: An Arabic Speech-Act and Sentiment Corpus of Tweets. OSACT3 - LREC 2018 link

 

2017

  • Magdy W., Y. Elkhatib, G. Tyson, S. Joglekar, and N. Sastry. Fake it till you make it: Fishing for Catfishes. ASONAM 2017 link, arXiv, In Media

  • Darwish K., W. Magdy, and T. Zanouda. Improved Stance Prediction in a User Similarity Feature Space. ASONAM 2017 link

  • Magdy W. and M. Eldesouky. ClassStrength: A Multilingual Tool for Tweets Classification. ASONAM 2017 link

  • Darwish K., W. Magdy, and T. Zanouda. Trump vs. Hillary: What went Viral during the 2016 US Presidential Election. SocInfo 2017 link, arXiv, In Media

  • Darwish K., W. Magdy, A. Rahimi, N. Abukhodair, T. Baldwin. Predicting Online Islamophopic Behavior after #ParisAttacks. Journal of Web Science 3 (1), 2017 preprint

  • Mubarak H., K. Darwish, and W. Magdy. Abusive Language Detection on Arabic Social Media. ALW1 - ACL 2017 link

 

2016

  • Magdy W., K. Darwish, A. Rahimi, N. Abukhodair, T. Baldwin. #ISISisNotIslam or #DeportAllMuslims? Predicting Unspoken Views. Web Science 2016 link, In Media

  • Magdy W., T. Elsayed, and M. Hasanain. On the Evalaution of Tweet Timeline Generation Task. ECIR 2016 link

  • Magdy W. and T. Elsayed. Unsupervised Adaptive Microblog Filtering for Broad Dynamic Topics. IP&M 2016 link

  • Magdy W., K. Darwish, and I. Weber. #FailedRevolutions: Using Twitter to Study the Antecedents of ISIS Support. First Monday. link, arXiv, In Media

  • Nakov P., L. Marquez, A. Moschitti, W. Magdy, H. Mubarak, A. Friehat, J. Glass, and B. Randeree. SemEval-2016 Task 3: Community Question Answering. SemEval 2016 - NAACL link

 

2015

  • Borge-Holthoefer J., W. Magdy, K. Darwish, and I. Weber. Content and Network Dynamics Behind Egyptian Political Polarization on Twitter. CSCW 2015 arXiv, ACM

  • Magdy W., H. Sajjad, T. El-Ganainy and F. Sebastiani. Distant Supervision for Tweet Classification using YouTube Labels. ICWSM 2015 link

  • Magdy W., H. Sajjad, T. El-Ganainy and F. Sebastiani. Bridging Social Media via Distant Supervision. SNAM 5(1) link, arXiv

  • Marquez L., J. Glass, W. Magdy, A. Moschitti, P. Nakov, and B. Randeree. SemEval-2015 Task 3: Answer Selection in Community Question Answering. SemEval 2015 - ACL link

  • M. Nicosia, S. Filice, A. Barron-Cedeno, I. Saleh, H. Mubarak, W. Gao, P. Nakov, G. Da San Martino, A. Moschitti, K. Darwish, L. Marquz, S. Joty, and W. Magdy. QCRI: Answer Selection for Community Question Answering - Experiment for Arabic and English. SemEval 2015 - ACL link

  • Hasanain M., T. Elsayed, and W. Magdy. Improving Tweet Timeline Generation by Predicting Optimal Retrieval Depth. AIRS 2015 link1, link2 Best Paper Award

  • Ali A., W. Magdy, and S. Renals. Multi-Reference Evaluation for Dialectal Speech Recognition System: A Study for Egyptian ASR. ArabicNLP - ACL 2015 link

  • Ali A., W. Magdy, and S. Renals. Multi-Reference WER for Evaluating ASR for Language with No Orthographic Rules. ASRU 2015 link

  • Magdy W., K. Darwish, and I. Weber. "I like ISIS, but I want to watch Chris Nolan's new movie": Exploring ISIS Supporters on Twitter. Hypertext 2015 link

 

2014

  • Magdy W. and T. Elsayed. Adaptive Method for Following Dynamic Topics on Twitter. ICWSM 2014 link

  • Hasanain M., T. Elsayed, and W. Magdy. Identification of Answer-Seeking Questions in Arabic Microblogs. CIKM 2014 link

  • Elsawy E., M. Mokhtar, and W. Magdy. TweetMogaz v2: Identifying News Stories in Social Media. CIKM 2014 link

  • Magdy W., W. Gao, T. El-Ganainy, and Z. Wei. QCRI at TREC 2014: Applying the KISS principle for the TTG task in the Microblog Track. TREC 2014 (ranked 2nd / 13 groups in the TTG task)  link

  • El-Ganainy T., W. Magdy, and A. Rafea. Hyperlink-Extended Pseudo Relevance Feedback for Improved Microblog Retrieval. SoMeRA - SIGIR 2014 link

  • Wei Z., W. Gao, T. El-Ganainy, W. Magdy, K-F. Wong. Ranking Model Selection and Fusion for Effective Microblog. SoMeRA - SIGIR 2014 link

  • Darwish, K. and W. Magdy. Arabic Information Retrieval. Foundations and Trends in Information Retrieval 7, 4 (Feb. 2014), 239-342  link

 

2013

  • Magdy W. and G. J. F. Jones. Studying Machine Translation Technologies for Large-Data CLIR Tasks: A Patent Prior-Art Search Case Study. Springer, Information Retrieval, 2013  link

  • El-Ganainy T., Z. Wei, W. Magdy, W. Gao: QCRI at TREC 2013 Microblog Track. TREC 2013 (ranked 2nd / 20 groups)  link

  • A. Ali, W. Magdy, and S.Vogel. A Tool for Monitoring and Analyzing HealthCare Tweets. HSD workshop, SIGIR 2013  link1, link2

  • Magdy W. TweetMogaz: A News Portal of Tweets. SIGIR 2013  link

  • A. Kothari, W.Magdy, K. Darwish, A. Mourad, and A. Taei. Detecting Comments on News Articles in Microblogs. ICWSM 2013 link Best Dataset Award

 

2012

  • Eskevich M., W. Magdy, and G. J. F. Jones. New Metrics for Meaningful Evaluation of Informally Structured Speech Retrieval. ECIR 2012  link

  • Saad El-Din A., W. Magdy. Web-based Pseudo Relevance Feedback for Microblog Retrieval. TREC 2012  link

  • Magdy W. and A. Ali, and K. Darwish. A Summarization Tool for Time-Sensitive Social Media. CIKM 2012  link

  • K. Darwish, W.Magdy and A. Mourad. Language Processing for Arabic Microblog Retrieval. CIKM 2012  link

  • Piroi F., M. Lupu, A. Hanbury, W. Magdy, A. P. Sexton, I. Filippov. CLEF-IP 2012: Retrieval Experiments in the Intellectual Property Domain. CLEF 2012  link1, link2

 

2011

  • Magdy W. and G. J. F. Jones. A Study on Query Expansion Methods for Patent Retrieval. PAIR 2011 - CIKM 2011  link

  • Magdy W. and G. J. F. Jones. An Efficient Method for Using Machine Translation Technologies in Cross-Language Patent Search. CIKM 2011  link

  • Ganguly D., J. Leveling, W. Magdy, and G. J. F. Jones. Patent Query Reduction based on Pseudo-Relevant Documents. CIKM 2011  link

  • Leveling J., W. Magdy, and G. J. F. Jones. An Investigation of Decompounding for Cross-Language Patent Search. SIGIR 2011  link

  • Magdy W., P. Lopez, and G. J. F. Jones. Simple vs. Sophisticated Approaches for Patent Prior-Art Search. ECIR 2011  link

  • Magdy W. and G. J. F. Jones. Should MT Systems be Used as Black Boxes in CLIR?. ECIR 2011  link

 

2010

  • Magdy W. and K. Darwish. Omni Font OCR Error Correction with Effect on Retrieval. ISDA 2010  link

  • Magdy W. and G. J. F. Jones. Examining the Robustness of Evaluation Metrics for Patent Retrieval with Incomplete Relevance Judgements. CLEF 2010  link

  • Magdy W. and G. J. F. Jones. Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task. CLEF 2010 Working Notes  link

  • Leveling J., M. R. Ghorab, W. Magdy, G. J. F. Jones, and V. Wade. DCU-TCD@LogCLEF 2010: Re-ranking Document Collections and Query Performance Estimation. CLEF 2010 Working Notes  link

  • Magdy W. and G. J. F. Jones. PRES: A Score Metric for Evaluating Recall-Oriented Information Retrieval Applications. SIGIR 2010   link

  • Magdy W., J. Min, J. Leveling, and G. J. F. Jones. Building a Domain-Specific Document Collection for Evaluating Metadata Effect on Information Retrieval. LREC 2010  link

  • Magdy W. and G. J. F. Jones. A New Metric for Patent Retrieval Evaluation. AsPIRe'10 - ECIR 2010  link

 

2009

  • Magdy W., J. Leveling, and G. J. F. Jones. Exploring Structured Documents and Query Formulation Techniques for Patent Retrieval.CLEF 2009  link

  • Magdy W., J. Leveling, and G. J. F. Jones. DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval. CLEF 2009 Working Notes  link

  • Magdy W. , K. Darwish, and M. El-Saban. Efficient Language-Independent Retrieval of Printed Documents without OCR. SPIRE 2009 link

 

2008

  • Magdy W. and K. Darwish. Book Search: Indexing the Valuable Parts. BooksOnline'08 workshop - CIKM 2008 link1, link2

  • Magdy W. and K. Darwish. Effect of OCR Error Correction on Arabic Retrieval. Springer, Information Retrieval, 2008    link1,link2

 

2007

  • Magdy W. and K. Darwish. CMIC at INEX 2007. INEX, 2007    link

  • Darwish K. and W. Magdy. Error Correction vs. Query Garbling for Arabic OCR Document Retrieval. ACM TOIS, volume 26, 2007    link

  • Magdy W., K. Darwish, and M. Rashwan. Fusion of Multiple Corrupted Transmissions and its effect on Information Retrieval. ESOLE 2007    link

  • Magdy W., K. Darwish, O. Emam, and H. Hassan. Arabic Cross-Document Person Name Normalization. Semitic Languages workshop - ACL 2007, Prague    link

 

2006

  • Magdy W. and K. Darwish. Word-Based Correction for Retrieval of Arabic OCR Degraded Documents. SPIRE 2006    link

  • Magdy W. and K. Darwish. Arabic OCR Error Correction Using Character Segment Correction, Language Modeling, and Shallow Morphology. EMNLP 2006    link

  • Abdelsapor A., N. Adly, K. Darwish, O. Emam, W. Magdy, and M. Nagi. Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents. LREC 2006    link

 

Thesis

  • Masters Degree Thesis: "Statistical Methods for Error in Text Correction"    link

  • PhD Thesis: "Toward Higher Effectiveness for Recall-Oriented Information Retrieval: A Patent Retrieval Case Study"    link
    Supervisor: Prof. Gareth Jones
    Examiners: Prof. Alan Smeaton (Dublin City University) and Dr. Barrou Diallo (European Patent Office)

 

Reports

  • Cram L., R. Hill, C. Llewellyn, and W. Magdy. UK General Election 2017: a Twitter Analysis. 2017. arXiv

  • Magdy W. and K. Darwish. Trump vs. Hillary Analyzing Viral Tweets during US Presidential Elections 2016. 2016. arXiv

  • Magdy W., K. Darwish, N. Abokhodair. Quantifying Public Response towards Islam on Twitter after Paris Attacks. 2015 arXiv

  • Darwish K., W. Magdy. Attitudes towards Refugees in Light of the Paris Attacks. 2015 arXiv

 

Extended Abstracts

  • Chen L., W. Magdy and M. Wolters. Online Community Engagement when Talking About Infidelity: The Case of Reddit. IC2S2 2019

  • Robertson A., W. Magdy, S. Goldwater. Understanding Referential Aspects of Skin-toned Emoji on Social Media. IC2S2 2019

  • Aldayel A. and W. Magdy. It is more than what you Say! Leveraging User Online Activity for Improved Stance Detection. IC2S2 2019

 

 

 

 

 

 

[ Home | Publications | Patents | Media Attention]

 

Last Modified: Sep 2024