Publications


2019

Hamdy Mubarak, Ahmed Abdelali, Kareem Darwish,Mohamed Eldesouki, Younes Samih, Hassan Sajjad (2019): A System for Diacritizing Four Varieties of Arabic. In Proceedings of the EMNLP 2019.

Younes Samih, H Mubarak, A Abdelali, M Attia, M Eldesouki, K Darwish (2019): QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification. In proceedings of the Fourth Arabic Natural Language Processing Workshop, 290-294 [Secured the third place for SubTask 2]

M Attia, Younes Samih, A Elkahky, H Mubarak, A Abdelali, K Darwish (2019): POS Tagging for Improving Code-Switching Identification in Arabic. In proceedings of the Fourth Arabic Natural Language Processing Workshop, 18-29

Hamdy Mubarak, Ahmed Abdelali, Hassan Sajjad, Younes Samih and Kareem Darwish (2019): Highly Effective Arabic Diacritization using Sequence to Sequence Modeling. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019): Human Language Technologies, Volume 1. Minneapolis, Minnesota, USA.

2018

Mohammed Attia, Younes Samih, and Wolfgang Maier (2018): GHHT at CALCS 2018: Named Entity Recognition on Code-switched Data. In proceedings of the Third workshop on Computational Approaches to linguistic Code-switching, Melbourne Australia. Association for Computational Linguistics. [Secured the second place for Egyptian Arabic/MSA] [URL]

Mohammed Attia,Younes Samih, Ali Elkahky, and Laura Kallmeyer (2018): Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks. In 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan). [code]

Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki, Younes Samih, Randah Alharbi, Mohammed Attia, Walid Magdy and Laura Kallmeyer (2018): Multi-Dialect Arabic POS Tagging: A CRF Approach. In 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan).

Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak, Younes Samih, Mohammed Attia (2018): Diacritization of Moroccan and Tunisian Arabic Dialects: A CRF Approach. Proceedings of The 4th Arabic Natural Language Processing Workshop (WANLP-2018),  the 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan).

Abdelali, A., Attia, M., Samih Younes, Y., Darwish, K., & Mubarak, H. (2018). Diacritization of Maghrebi Arabic Sub-Dialects. arXiv preprint arXiv:1810.06619.

Mohammed Attia, Younes Samih, Manaal Faruqui and Wolfgang Maier (2018), Discovering Discriminative Attributes in Distributional Semantics, in the proceedings of the SemEval-2018 workshop at NAACL 2018.

Tatiana Bladier, Andreas van Cranenburgh, Younes Samih, Laura Kallmeyer (2018): German and French Neural Supertagging Experiments for LTAG Parsing. ACL 2018 Student Research Workshop

Ehren Rafael, Lichte Timm, Samih Younes (2018): Mumpitz at PARSEME Shared Task 2018: A Bidirectional LSTM for the Identification of Verbal Multiword Expressions. In Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions , August 25-26, 2018 (LAW-MWE-CxG-2018),COLING 2018 (Santa Fe, USA) ACL, pages 261-267.

2017

Younes Samih, “Dialectal Arabic processing Using Deep Learning,” PhD Thesis, Heinrich Heine University Düsseldorf, Düsseldorf, Germany, 2017. [bib]

Younes Samih, Mohamed Eldesouki, Mohammed Attia, Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak and Laura Kallmeyer (2017). Learning from Relatives: Unified Dialectal Arabic Segmentation. Proceedings of CoNLL 2017, the SIGNLL Conference on Computational Natural Language Learning, co-located with ACL 2017 in Vancouver, Canada. [Dataset] [Annotation Guidelines]

Younes Samih, Mohammed Attia, Mohamed Eldesouki, Hamdy Mubarak, Ahmed Abdelali, Laura Kallmeyer and Kareem Darwish (2017): A Neural Architecture for Dialectal Arabic Segmentation. Proceedings of The Third Arabic Natural Language Processing Workshop (WANLP-2017), EACL 2017, Valencia, Spain, 46-54. [Dataset]

Eldesouki, Mohamed, Younes Samih, Ahmed Abdelali, Mohammed Attia, Hamdy Mubarak, Kareem Darwish, and Laura Kallmeyer (2017).  Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM.  In: CoRR abs/1708.05891

2016

Younes Samih, Suraj Maharjan, Mohammed Attia, Laura Kallmeyer and Thamar Solorio (2016): Multilingual Code-switching Identification via LSTM Recurrent Neural Networks. In the Proceedings of the Second Workshop on Computational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016. (Secured the first place for Egyptian Arabic/MSA and the second place for Spanish/English code-switching identification) [ link ]

Younes Samih, Wolfgang Maier and Laura Kallmeyer (2016): SAWT: Sequence Annotation Web Tool. In the Proceedings of the Second Workshop on Computational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016.

Attia,M.,  Maharjan, S.,  Samih,Y.,  Kallmeyer, L., and Solorio, T.  Detecting Semantic Relations via Word Embeddings, in Proceedings of the  5th CogALex workshop at COLING 2016 (SECURED THE FIRST PLACE FOR TASK 1  AND THE SECOND PLACE FOR TASK 2 ) [ link ]

Younes Samih and Wolfgang Maier(2016):An Arabic-Moroccan Darija Code-Switched Corpus, In the Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2016.

Younes Samih and Wolfgang Maier: Detecting code-switching in Moroccan Arabic (2016), In the Proceedings of SocialNLP @ IJCAI-2016, New York, USA 2016.

2015

Une métagrammaire de l’interface morpho-sémantique dans les verbes en arabe (Petitjean, Simon, Samih, Younes and Lichte, Timm), In Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles, Association pour le Traitement Automatique des Langues, 2015.

Arabic spelling error detection and correction (Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan and Josef van Genabith), Natural Language Engineering, 22(5), 751-773. doi:10.1017/S1351324915000030

2014

XMG: a tool for implementing frames (Lichte,Timm, Petitjean,Simon, Kallmeyer, Laura and Samih, Younes), In Concept Types and Frames (CTF 2014), 25–27 August 2014, University of Düsseldorf, 2014. [Link]

2013

Synchronous Regular Relations and Morphological Analysis (Wurm, Christian and Samih, Younes), In Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, 2013.

2012

Arabic Word Generation and Modelling for Spell Checking. (Shaalan, Khaled F, Samih, Younes, Attia, Mohammed, Pecina, Pavel and van Genabith, Josef), In LREC, 2012.

Conversion of Procedural Morphologies to Finite-State Morphologies: a Case Study of Arabic (Hulden, Mans and Samih, Younes), In 10th International Workshop on Finite State Methods and Natural Language Processing, 2012.

Improved Spelling Error Detection and Correction for Arabic. (Attia, Mohammed, Pecina, Pavel, Samih, Younes, Shaalan, Khaled F and van Genabith, Josef), In COLING (Posters), 2012.

The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words> (Attia, Mohammed, Samih, Younes and van Genabith, Khaled Shaalan1 Josef), In COLING 2012, 2012.

2011

FTrace: a tool for finite-state morphology (Kilbury James, Bontcheva Katina and Samih Younes), In 9th International Workshop on Finite State Methods and Natural Language Processing, 2011.

Historical-Comparative Reconstruction in Finite-State Technology (Kilbury, J., Bontcheva, K., Mamerow, N., & Samih, Y), In 9th International Tbilisi Symposium on Language, Logic and Computation, 2011. (pp. 26-30).