2023
-
Younes Samih, Laura Kallmeyer (2023), Unsupervised Semantic Frame Induction Revisited, in Proceedings of the IWCS 2023 Université de Lorraine, France: the 15th International Conference on Computational Semantics (20-23th June 2023) [Data]
- David Arps, Laura Kallmeyer, Younes Samih and Hassan Sajjad (2023): Multilingual Nonce Dependency Treebanks: Understanding how LLMs represent and process syntactic structure. arXiv.
2022
- Patent: Mubarak, Hamdy S and Darwish, Kareem Mohamed and Abdelali, Ahmed and Sajjad, Hassan and Younes, Samih (2022), Method and System for Diacritizing Arabic Text, US Patent App. 17/598,633
- Arps, David, Younes Samih, Laura Kallmeyer & Hassan Sajjad. 2022. Probing for Constituency Structure in Neural Language Models.In Findings of the Association for Computational Linguistics: EMNLP 2022. (EMNLP 2022).
2021
- Hamdy Mubarak, Ahmed Abdelali, Kareem Darwish, Younes Samih (2021), Automatic Expansion and Retargeting of Arabic Offensive Language Training, arXiv:2111.09574
-
Younes Samih, Kareem Darwish (2021). A Few Topical Tweets are Enough for Effective User-Level Stance Detection, in Proceedings of the 16th Conference of the European Chapter of the Association for
Computational Linguistics ( EACL 2021 ), pages 2637–2646 April 19 – 23, 2021. -
H. Mubarak, Ammar Rashed, Kareem Darwish, Younes Samih, Ahmed Abdelali (2021). Arabic Offensive Language on Twitter: Analysis and Experiments, in proceedings of the 6th Arabic Natural Language Processing Workshop in the 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)
-
Ahmed Abdelali, Hamdy Mubarak, Younes Samih, Sabit Hassan and Kareem Darwish(2021), QADI: Arabic Dialect Identification in the Wild. in proceedings of the 6th Arabic Natural Language Processing Workshop in the 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)
-
Ahmed Abdelali, Sabit Hassan, Hamdy Mubarak, Kareem Darwish, Younes Samih (2021). Pre-Training BERT on Arabic Tweets: Practical Considerations
-
Esther Seyffarth, Younes Samih, Laura Kallmeyer and Hassan Sajjad (2021), Implicit representations of event properties within contextual language models: Searching for “causativity neurons”. In Proceedings of the IWCS 2021 Groningen: the 14th International Conference on Computational Semantics (16-18 June 2021)
2020
-
Shammur Chowdhury, Younes Samih, Mohamed Eldesouki, Ahmed Ali. 2020. Effects of Dialectal Code-Switching on Speech Modules: A Study using Egyptian Arabic Broadcast Speech. In (INTERSPEECH 2020), Shanghai, China [code].
-
Sabit Hassan, Younes Samih, Hamdy Mubarak, and Ahmed Abdelali. 2020. ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media. In Proceedings of the International Workshop on Semantic Evaluation (SemEval).
-
Sabit Hassan, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, Ammar Rashed, and Shammur Chowdhury (2020): ALT Submission for OSACT Shared Task on Offensive Language Detection. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, pages 61–65. Language Resources and Evaluation Conference (LREC 2020), Marseille, 11–16 May 2020.
-
Kareem Darwish, Mohammed Attia, Hamdy Mubarak, Younes Samih , Ahmed Abdelali, Lluís Màrquez, Mohamed Eldesouki and Laura Kallmeyer (2020): Effective multi-dialectal arabic POS tagging. Natural Language Engineering ( NLE 2020). Cambridge University Press: 14 April 2020.
-
Suwon Shon, Ahmed Ali, Younes Samih, Hamdy Mubarak, James Glass, ADI17: A Fine-Grained Arabic Dialect Identification Dataset, (ICASSP 2020): The 45th International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, May 4 -8
2019
-
Hamdy Mubarak, Ahmed Abdelali, Kareem Darwish,Mohamed Eldesouki, Younes Samih, Hassan Sajjad (2019): A System for Diacritizing Four Varieties of Arabic. In Proceedings of the (EMNLP 2019).
-
Ahmed Ali, Suwon Shon, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, James Glass, Steve Renals, Khalid Choukri, The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech, Published 2019, Computer Science, 2019 IEEE Automatic Speech Recognition and Understanding Conference (ASRU 2019)
-
Younes Samih, H Mubarak, A Abdelali, M Attia, M Eldesouki, K Darwish (2019): QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification. In proceedings of the Fourth Arabic Natural Language Processing Workshop, 290-294
-
M Attia, Younes Samih, A Elkahky, H Mubarak, A Abdelali, K Darwish (2019): POS Tagging for Improving Code-Switching Identification in Arabic. In proceedings of the Fourth Arabic Natural Language Processing Workshop, 18-29
-
Hamdy Mubarak, Ahmed Abdelali, Hassan Sajjad, Younes Samih and Kareem Darwish (2019): Highly Effective Arabic Diacritization using Sequence to Sequence Modeling. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019): Human Language Technologies, Volume 1. Minneapolis, Minnesota, USA.
2018
-
Mohammed Attia, Younes Samih, and Wolfgang Maier (2018): GHHT at CALCS 2018: Named Entity Recognition on Code-switched Data. In proceedings of the Third workshop on Computational Approaches to linguistic Code-switching, Melbourne Australia. Association for Computational Linguistics. [Secured the second place for Egyptian Arabic/MSA] [URL]
-
Mohammed Attia,Younes Samih, Ali Elkahky, and Laura Kallmeyer (2018): Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks. In 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan). [code]
-
Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki, Younes Samih, Randah Alharbi, Mohammed Attia, Walid Magdy and Laura Kallmeyer (2018): Multi-Dialect Arabic POS Tagging: A CRF Approach. In 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan).
-
Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak, Younes Samih, Mohammed Attia (2018): Diacritization of Moroccan and Tunisian Arabic Dialects: A CRF Approach. Proceedings of The 4th Arabic Natural Language Processing Workshop (WANLP-2018), the 11th edition of the Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki (Japan).
-
Abdelali, A., Attia, M., Samih Younes, Y., Darwish, K., & Mubarak, H. (2018). Diacritization of Maghrebi Arabic Sub-Dialects. arXiv preprint arXiv:1810.06619.
-
Mohammed Attia, Younes Samih, Manaal Faruqui and Wolfgang Maier (2018), Discovering Discriminative Attributes in Distributional Semantics, in the proceedings of the SemEval-2018 workshop at NAACL 2018.
-
Tatiana Bladier, Andreas van Cranenburgh, Younes Samih, Laura Kallmeyer (2018): German and French Neural Supertagging Experiments for LTAG Parsing. ACL 2018 Student Research Workshop
-
Ehren Rafael, Lichte Timm, Samih Younes (2018): Mumpitz at PARSEME Shared Task 2018: A Bidirectional LSTM for the Identification of Verbal Multiword Expressions. In Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions , August 25-26, 2018 (LAW-MWE-CxG-2018),COLING 2018 (Santa Fe, USA) ACL, pages 261-267.
2017
-
Younes Samih, “Dialectal Arabic processing Using Deep Learning,” PhD Thesis, Heinrich Heine University Düsseldorf, Düsseldorf, Germany, 2017. [bib]
-
Younes Samih, Mohamed Eldesouki, Mohammed Attia, Kareem Darwish, Ahmed Abdelali, Hamdy Mubarak and Laura Kallmeyer (2017). Learning from Relatives: Unified Dialectal Arabic Segmentation. Proceedings of CoNLL 2017, the SIGNLL Conference on Computational Natural Language Learning, co-located with ACL 2017 in Vancouver, Canada. [Dataset] [Annotation Guidelines]
-
Younes Samih, Mohammed Attia, Mohamed Eldesouki, Hamdy Mubarak, Ahmed Abdelali, Laura Kallmeyer and Kareem Darwish (2017): A Neural Architecture for Dialectal Arabic Segmentation. Proceedings of The Third Arabic Natural Language Processing Workshop (WANLP-2017), EACL 2017, Valencia, Spain, 46-54. [Dataset]
-
Eldesouki, Mohamed, Younes Samih, Ahmed Abdelali, Mohammed Attia, Hamdy Mubarak, Kareem Darwish, and Laura Kallmeyer (2017). Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM. In: CoRR abs/1708.05891
2016
-
Younes Samih, Suraj Maharjan, Mohammed Attia, Laura Kallmeyer and Thamar Solorio (2016): Multilingual Code-switching Identification via LSTM Recurrent Neural Networks. In the Proceedings of the Second Workshop on Computational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016. (Secured the first place for Egyptian Arabic/MSA and the second place for Spanish/English code-switching identification) [ link ]
-
Younes Samih, Wolfgang Maier and Laura Kallmeyer (2016): SAWT: Sequence Annotation Web Tool. In the Proceedings of the Second Workshop on Computational Approaches to Code Switching, EMNLP, Austin, Texas, USA, November 2016.
-
Attia,M., Maharjan, S., Samih,Y., Kallmeyer, L., and Solorio, T. Detecting Semantic Relations via Word Embeddings, in Proceedings of the 5th CogALex workshop at COLING 2016 (SECURED THE FIRST PLACE FOR TASK 1 AND THE SECOND PLACE FOR TASK 2 ) [ link ]
-
Younes Samih and Wolfgang Maier: Detecting code-switching in Moroccan Arabic (2016), In the Proceedings of SocialNLP @ IJCAI-2016, New York, USA 2016.
2015
-
Une de l’interface morpho-sémantique dans les verbes en arabe (Petitjean, Simon, Samih, Younes and Lichte, Timm), In Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles, Association pour le Traitement Automatique des Langues, 2015.
-
Arabic spelling error detection and correction (Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan and Josef van Genabith), Natural Language Engineering, 22(5), 751-773. doi:10.1017/S1351324915000030
2014
-
XMG: a tool for implementing frames (Lichte,Timm, Petitjean,Simon, Kallmeyer, Laura and Samih, Younes), In Concept Types and Frames (CTF 2014), 25–27 August 2014, University of Düsseldorf, 2014. [Link]
2013
-
Synchronous Regular Relations and Morphological Analysis (Wurm, Christian and Samih, Younes), In Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, 2013.
2012
-
Arabic Word Generation and Modelling for Spell Checking. (Shaalan, Khaled F, Samih, Younes, Attia, Mohammed, Pecina, Pavel and van Genabith, Josef), In LREC, 2012.
-
Conversion of Procedural Morphologies to Finite-State Morphologies: a Case Study of Arabic (Hulden, Mans and Samih, Younes), In 10th International Workshop on Finite State Methods and Natural Language Processing, 2012.
-
Improved Spelling Error Detection and Correction for Arabic. (Attia, Mohammed, Pecina, Pavel, Samih, Younes, Shaalan, Khaled F and van Genabith, Josef), In COLING (Posters), 2012.
-
The Floating Arabic Dictionary: An Automatic Method for Updating a Lexical Database through the Detection and Lemmatization of Unknown Words> (Attia, Mohammed, Samih, Younes and van Genabith, Khaled Shaalan1 Josef), In COLING 2012, 2012.
2011
-
FTrace: a tool for finite-state morphology (Kilbury James, Bontcheva Katina and Samih Younes), In 9th International Workshop on Finite State Methods and Natural Language Processing, 2011.
-
Historical-Comparative Reconstruction in Finite-State Technology (Kilbury, J., Bontcheva, K., Mamerow, N., & Samih, Y), In 9th International Tbilisi Symposium on Language, Logic and Computation, 2011. (pp. 26-30).