350 руб

Журнал «Информационно-измерительные и управляющие системы» №11 за 2009 г.
Статья в номере:
Морфологический анализатор для арабского языка (SAMA1)
Авторы:
Боумедин Шаннаг
аспирант Санкт-Петербургского института информатики РАН.
В. В. Александров
д. т. н., профессор, зав. лаб. автоматизации научных исследований Санкт-Петербургского института информатики РАН.
E-mail: alexandr@iias.spb.su
Аннотация:
Предложен алгоритм морфологического анализатора арабского языка (SАМА1) для обнаружения корней арабских слов. Данный анализатор был проверен на базе данных 24013 корней (существительных и глаголов), взятых из арабского тезауруса, арабских книг и научных работ. Экспериментальные работы, проведенные с SАМА1 показали точность выделения корней около 98%.
Страницы: 60-62
Список источников
- Abdelali, A., Cowie, J., and Soliman, S. H., Arabic information retrieval perspectives // Proceedings of JEP-TALN 2004 Arabic Language Processing. Fez 19-22. April. 2004.
- Larkey, L., Ballesteros, L., and Connell, M., Improving stemming for arabic information retrieval: Light stemming and co-occurrence analysis. In SIGIR 2002. P. 269-274.
- Larkey, L., Ballesteros, L., and Connell, M., Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis. SIGIR 2002, Finland. P. 275-282.
- Gey, F., Oard, D., The TREC-2001 cross-language information retrieval track: searching Arabic using English, French or Arabic queries. NIST, TREC 2001 Proceedings, P. 16-25.
- Larkey, L., and Connell, M., Arabic information retrieval at UMass in TREC-10. In: Voorhees, E.M. and Harman, D.K. (Eds.) The Tenth Text Retrieval Conference, TREC 2001 NIST Special Publication 500-250. 2002. P. 562-570.
- Al-Fedaghi, S. and Al-Anzi, New algorithm to generate Arabic root-pattern forms. In Proceedings of the 11th national computer conference. King Fahd University of Petroleum&Minerals, Dhahran, Saudi Arabia. 1989. P. 391-400.
- Al-Shalabi, R., Design and Implementation of an Arabic Morphological System to Support Natural Language Processing. PHD thesis, Computer Science. Chicago. 1996.
- Beesley, K. R., Arabic finite-state morphological analysis and generation. In COLING-96: Proceedings of the 16th international conference on computational linguistics. 1996. V.1. P. 89-94.
- Khoja,S. and Garside. Stemming Arabic text. Computing Department Lancaster University, Lancaster, 1999.
- http://www.comp.lancs.ac.uk/computing/users/khoja/stemmer.
- Darwish, K., Doermann, D., Jones, R., Oard, D., and Rautiainen, M. TREC-10 experiments at Maryland: CLIR and video. In TREC 2001. Gaithersburg: NIST, 2001.
- www.LearnArabicOnline.com
- Khoja, S., Garside R., and Knowles, G., An Arabic tagset for the morphosyntactic tagging of Arabic corpus linguistics, Lancaster University, Lancaster, UK. 2001.
- Lavie, A., Peterson, E., Probst, K., Wintner S., and Eytani, Y., Rapid prototyping of a transfer-based Hebrew-to-English Machine Translation system. Proceedings of the TMI-04. 2004.
- Morneau, R., Designing an artificial language: Arabic morphology. 1994.
- Goweder, A. and De Roeck, A., Assessment of a significant Arabic corpus.Presented at the Arabic NLP Workshop at ACL/EACL 2001. Toulouse. France. 2001.
- Larkey, L., S., Ballesteros, L., and Connell, M. E., Improving Stemming for Arabic Information Retrieval: Light Stemming and Co-occurrence Analysis // In SIGIR-02, August 11-15. 2002. Tampere. Finland. 2002 P. 275-282.
- Hayder, K. Al Ameed, Shaikha O. Al Ketbi, Amna A. Al Kaabi, Khadija S. Al Shebli, Naila F. Al Shamsi, Noura H. Al Nuaimi, Shaikha S. Al Muhairi, Arabic light stemmer: anew enhanced approach // Software Engineering Dept. College of Information Technology, UAE University,PO. Box 17555. Al-Ain. UAE.
- ChenA. and Gey, F., Building an Arabic Stemmer for Information Retrieval School of Information Management and Systems University of California at Berkeley. CA 94720-4600, USA.