About me
I'm currently a post-doc at the University of Edinburgh (in case the hosting of this page didn't give location away!). I hold a BSc in Computer Science and AI from the University of Sheffield (graduated 2004), and a PhD in Natural Language Processing from the same place (in 2009). In my spare time I'm an avid cinema fan, I love music (live and recorded), anybody who knows me would say I'm obsessive about food (I love to cook), and I enjoy playing a whole range of sports. I'm originally from York, went to school in Tadcaster (a little town halfway between York and Leeds, famous for little more than the fact that it has four breweries which made for eventful entertainment and a strange smell) and moved to Sheffield at the tender age of eighteen where I got my undergrad and doctorate. Seven years later I moved to Edinburgh, where I still reside.
Research Interests
I got my Phd in (statistical) classification methods for documents, encompassing related methods in statistics, machine learning and language processing, from the University of Sheffield. I'm now working on problems at the intersection of computer vision and language, particularly detecting out of context regions of images and modelling the effect these have on human viewing behaviour. I have also worked on distributional models of word meaning, and detecting outliers (anomalies) in document collections, as well as developing and adapting language processing techniques for use with Arabic, Persian and Chinese. I am currently on the SynProc (synchronous processing) project at Edinburgh; I have previously been involved in several MoD funded projects, including the REVEAL and CLUE contracts at Sheffield.
Publications
- Allison, B. (2008) An Improved Hierarchical Bayesian Model of Language for Document Classification. In Proceedings COLING’08, Manchester, UK
- Jabbari, S. & Allison, B., and Guthrie, L. (2008) An Empirical Bayesian Method for Detecting Out Of Context Words. In Proceedings TSD’08, Brno, Czech Republic
- Allison, B. (2008) Sentiment Detection Using Lexically-Based Classifiers. In Proceedings TSD’08, Brno, Czech Republic
- Allison, B. (2008) Revisiting the Use of Lexically-Based Features for Sentiment Detection. In Proceedings EMOT’08, Marrakech, Morocco
- Allison, B., & Guthrie, L. (2008) Authorship Attribution of E-Mail: Comparing Classifiers Over a New Corpus for Evaluation. In Proceedings LREC’08, Marrakech, Morocco
- Jabbari, S., Allison, B., and Guthrie, L., (2008) Using a Probabilistic Model of Context to Detect Word Obfuscation. In Proceedings LREC’08, Marrakech, Morocco
- Liu, W., Allison, B., and Guthrie, L., (2008) “Professor or screaming beast? Detecting Anomalous Words in Chinese. In Proceedings LREC’08, Marrakech, Morocco
- Guthrie, D., Guthrie, L., Allison, B., and Wilks, Y., (2007) Unsupervised Anomaly Detection. In: Proceedings IJCAI'07, Hyderbad, India
- Allison, B., Guthrie, L. and Guthrie, D., (2006) Another Look at the Data Sparsity Problem. In Proceegings TSD2006, Brno, Czech Republic
- Jabbari, S., Allison, B., Guthrie, D., and Guthrie, L. (2006) Towards the Orwellian Nightmare: Separation of Business and Personal Emails. In: Proceedings ACL 2006, Sydney, Australia
- Guthrie, D., Allison, B., Liu, W., Guthrie, L., and Wilks, Y., (2006) A Closer Look at Skip-Gram Modelling. In: Proceedings LREC'2006, Genoa, Italy
Resources
- Enron Authorship corpus: a 9-way authorship attribution problem derived from the Enron corpus (from the LREC 2008 paper above) Download(.tar.gz)
- A corpus of Business and Personal mails for classification (from the ACL 2006 paper above). Download(.tar.gz)
- For those who are really interested, take a look at my cv
Contact
Ben Allison
School of Informatics
Informatics Forum
10 Crichton Street
Edinburgh
EH8 9AB
UK
Tel.: +44 (0)131 651 5661
Email: (my first initial)(my surname) {at} inf.ed.ac.uk
(note there are no spaces or punctuation between the first intial and the surname)