Home Publications Patents In the Media Resources Courses Supervision

 

Mazajak (مَزَاجَك)

  • A Dialect Arabic Sentiment Analyser
  • Online tool here.
  • Related Publications:
    Abu Farha I. and W. Magdy. Mazajak: An Online Arabic Sentiment Analyser. WANLP - ACL 2019

 

Patent Retrieval Evaluation Score (PRES)

 

ArSAS: An Arabic Speech-Act and Sentiment Corpus of Tweets

  • A set of 21K Arabic tweets labeled for 4 classes of sentiment and 6 classes of speech-act.
  • Download data here.
  • Related Publication:
    Elmadany A., H. Mubarak, W. Magdy. ArSAS: An Arabic Speech-Act and Sentiment Corpus of Tweets. OSACT3 - LREC 2018

 

Manually Annotated Tweets into 14 Different Classes

  • 3129 manually annotated tweets into 14 different categories. Number of tweets per class ranges from 179 to 261. Categories used are the same list of categories used by YouTube.
  • Download data here.
  • Related Publications:
    Magdy W., H. Sajjad, T. El-Ganainy and F. Sebastiani. Distant Supervision for Tweet Classification using YouTube Labels. ICWSM 2015.
    Magdy W., H. Sajjad, T. El-Ganainy and F. Sebastiani. Bridging Social Media via Distant Supervision. SNAM 2015.

 

ClassStrength

  • A tool for classifying social text into 14 general-purpose categories in 5 languages: English, Arabic, French, German, and Russian
  • Download tool here.
  • Online classification tool here.
  • Related Publications:
    Magdy W. and M. Eldesouky. ClassStrength: A Multilingual Tool for Tweets Classification. ASONAM 2017

 

Microblog Filtering Data

 

Tweet Collection during Military Intervention in Egypt in July 2013

 

PornHub User Accounts

  • Anonymized set of almost 100,000 user accounts on the adult social platform, PornHub. Each account contains informations of (details in the paper):
    - Profile: age, country, gender, intersted in, ...
    - Activities: videos watches, posted ...
    - Network: number of friends, subscripers, subscribed to; with numbers of males/females in each.
    - Comments: all comments by user on other users' walls.
  • Download data here.
  • Related Publications:
    Magdy W., Y. Elkhatib, G. Tyson, S. Joglekar, and N. Sastry. Fake it till you make it: Fishing for Catfishes. ASONAM 2017

 

Media Attention

List of articles around my published research can be found here